Auto-GPT and BabyAGI: How ‘autonomous agents’ are bringing generative AI to the masses
Auto-GPT and BabyAGI: bring generative AI to the masses
An exemplary use case of AI for brainstorming and task management can be observed in the BabyAGI Python script. This task-driven autonomous agent is designed to effectively generate, prioritise and execute tasks by leveraging its past outcomes and a predetermined objective.
BabyAGI is an analogue version of the original Task-Driven Autonomous Agent (released on Twitter by Yohei Nakajima in March). With OpenAI's natural language processing and Pinecones data storage and retrieval capabilities, BabyAGI is able to retain context and generate more intelligent results. At just 140 lines of code, it is both comprehensible and easily adaptable
Over the course of the last week, programmers across the globe have initiated the construction of "autonomous agents".
These agents, functioning in connection with large language models (LLMs), like OpenAI's GPT-4, work toward solving intricate problems. Implementing this new technology could mark a crucial accomplishment in LLMs' practical applications.
Usually, we engage with GPT-4 by carefully typing prompts into it, until we obtain the desired output. However, most of us have neither the skill or the patience to develop multitudes of prompts that adequately guide the LLM toward answering a complex question.
As expected, developers have been exploring methods to automate the majority of this process, which is where autonomous agents enter the equation. These agents are capable of executing a series of tasks in a systematic way, aiming to achieve a predetermined "goal" for the large language models.
The range of tasks an autonomous agent can already perform is quite diverse, including web research, code writing and to-do list creation.
To accomplish this, agents effectively provide a software interface that connects to the front of a large language model. This interface can leverage well-known software practices like loops and functions, guiding the language model towards achieving a general objective, such as "searching for and summarising all YouTube videos related to the Great Recession."
Some people refer to these agents as "recursive," given that they operate in a loop, asking the LLM questions that are based on the previous result, until the model generates a complete answer.
BabyAGI
Yohei Nakajima, a venture capitalist and frequent coder and experimenter is the creator of the autonomous agent BabyAGI, which he describes as an "AI agent containing an AI task manager".
Originally designed to automate some of the tasks he performs regularly as a VC - such as researching new technologies and companies - by replicating his workflow, Nakajima realised that BabyAGI could be adapted to achieve many other objectives as well. BabyAGI manages, adds and reprioritizes tasks for the GPT-4 language model to complete. After stripping the agent down to the essentials (105 lines of code), Nakajima posted it on GitHub for others to adopt as a foundation for their own agents, which could be more specialised.
Nakajima is inspired by the inventive ways in which other developers are improving the BabyAGI agent. Some developers have added moderation functions, he notes, as well as the potential to perform parallel tasks, create additional agents, and implement code-writing and robotics functionality.
Auto-GPT
Auto-GPT appears to possess even greater degrees of autonomy than its predecessors. Created by Toran Bruce Richards, Auto-GPT is described on GitHub as an agent powered by GPT-4 that can search the internet in a structured manner. It can create subtasks and launch new agents to complete them. It uses GPT-4 to write its own code, after which it "recursively debugs, develops, and self-improves" the code.
Auto-GPT is capable of solving a wide range of problems, but a case example on GitHub highlights how it can assist a "chef" in managing and growing a culinary business. In this scenario, the "Chef-GPT" agent "autonomously develops and manages businesses to increase net worth."
According to Richards, his original intention for an AI agent was for it to automatically email him daily AI news. However, in the process of his development, he discovered that existing LLMs struggle with "tasks that require long-term planning" or "cannot autonomously refine their approaches based on real-time feedback."
With this understanding in mind, he created Auto-GPT, which, he claims, "can use GPT-4's reasoning to address more complex and extensive problems that necessitate long-term planning and several steps".
AI Catalog's chief editor