Open Interpreter is a desktop agent designed to help users read, edit, and create documents—including PDF forms, Excel sheets, and Markdown files. It enables automation such as filling PDF forms, analyzing and editing spreadsheets, and transforming documents interactively through AI. Users can work alongside advanced AI agents for a variety of tasks, integrate AI models of their choice, or use locally run or managed models. The tool is suitable for professionals, businesses, and users looking to streamline complex document workflows with AI assistance.
If you use the plugin ' Code Interpreter ' developed by OpenAI, it is possible to have ChatGPT generate programming code, but in addition to not being able to connect to the Internet, there are issues with the available packages, upload capacity, execution time, etc. There are restrictions. ' Open Interpreter ' is an open source AI tool that breaks through these limitations by running locally and flexibly generates and executes various programs, so I actually tried it out. KillianLucas/open-interpreter: OpenAI's Code Interpreter in your terminal, running locally https://github.com/KillianLucas/open-interpreter/blob/main/docs/README_JA.md A movie is available that allows you to check the operation of Open Interpreter. Enter the prompt 'Can you set my system to dark mode?'. The AI then generated a code that put the system into dark mode. Next time I will ask you to create a simple Pomodoro timer . AI automatically generates HTML, CSS, and JavaScript, and we were able to create a nice timer app. Next, specify the pdf file and enter the prompt 'Can you summarize?' AI will summarize the content. 'Please list the contents of your calendar' The calendar app will start automatically. Your schedule has been listed. Please send it by email. This time the email app starts up. I was able to send the email. Start a new terminal from the folder where the Word format file is saved. 'Can I convert the Word file here to PDF?' PDF format files were automatically generated one after another. ◆I actually tried using it Now that we have confirmed the working image, let's actually run Open Interpreter using a Windows 10 PC. You need Python to run Open Interpreter, so first go to the official Python website , hover your cursor over 'Downloads' and click the 'Python 3.XX.X' button that appears. The installer will be downloaded, so double-click to run it. Check 'Add python.exe to PATH' and click 'Install Now'. Wait for a while as the installation begins. When finished, click Close to exit the installer. Once the Python installation is complete, open the Start menu, search for 'cmd' and click on the command prompt that appears to start it. Install Open Interpreter using the command below. [code]pip install open-interpreter[/code] And start it with the command below. [code]interpreter[/code] You will be asked to enter the OpenAI API key every time you start it. In addition to OpenAI, ``Azure'', ``Anthropic'', ``Replicate'', ``AI21'', ``OpenRouter'', ``Cohere'', and ``Petals'' are available , and it is also possible to operate with a local large-scale language model . However, I was told that the use of GPT-4 is strongly recommended, so I will use GPT-4 this time. Open the OpenAI API keys screen and click 'Create new secret key'. Give it a meaningful name and click 'Create secret key'. A new API key will be generated, so copy it. Go back to the command prompt, paste it, press Enter, and you're ready to go. If you find it troublesome to enter the API key every time, you can register it as an environment variable. First of all, as a trial, I entered the prompt 'Plot AAPL and META's normalized stock prices (please plot the normalized stock price data of Apple and Meta)'. You will be shown a plan of execution steps and the code and asked if you want to run the code. Type 'y' and press Enter. If an error occurs, GPT-4 will analyze the error and correct it automatically. A new code will be generated, so enter 'y' and press Enter. An error will appear and a correction code will be generated, so type 'y' and press the Enter key. We've finally completed the first step: importing the necessary libraries. A code is then generated to download historical stock price data. Type 'y' and press Enter.
Imagine controlling your computer, running code, and fetching data, all by simply typing out natural language commands. Open Interpreter makes it possible!
Visit Open Interpreter's official website for product details and getting started.