Advanced AI Collaboration: Build a Multi-Agent System with CrewAI

Advanced AI Collaboration: Developing a Multi-Agent System with CrewAI

Hello! I'm Tommy, and I'll be guiding you through the advanced realm of Multi-Agent Systems, a topic that extends the capabilities of individual AI agents into powerful, cooperative units that can tackle complex, real-world problems. In this guide, we'll explore how to coordinate multiple AI agents to solve complex tasks, emphasizing scalability, orchestration, and collaboration. Whether you're developing autonomous customer support systems or complex problem-solving applications, this tutorial will provide you with the tools and knowledge you need to succeed. Stick around to see it all come together with a hands-on implementation in Google Colab at the end!

Overview of Multi-Agent System and Framework

Multi-agent systems represent a significant leap from traditional AI paradigms. Instead of relying on a single AI entity to manage all tasks, Multi-Agent Systems allow for specialized agents, each designed for specific roles. This specialization enables more efficient processing, parallel task execution, and the ability to tackle more complex problems.

Benefits:

Scalability: Each agent can be optimized and scaled independently, allowing the system to handle increasing workloads by adding more agents.
Robustness: If one agent fails, others can continue functioning, providing a failover mechanism that enhances system reliability.
Efficiency: Agents can work in parallel or hierarchy, speeding up the overall task completion time, particularly in scenarios where tasks are independent or can be broken down into smaller subtasks.
Modularity: The modular nature of Multi-Agent Systems means that agents can be reused across different systems, reducing development time for new projects.

Challenges:

Coordination Complexity: Ensuring that agents work together seamlessly can be difficult, especially as the number of agents increases.
Communication Overhead: The need for agents to communicate adds overhead, particularly if they rely on different models or frameworks.
Error Handling: Failures in one agent can propagate or cause issues in others, requiring sophisticated error-handling mechanisms.

Introduction to CrewAI

CrewAI is an excellent framework for managing and orchestrating multiple agents. It simplifies the complex concepts of Multi-Agent Systems into manageable structures, providing tools for building, deploying, and managing multi-agent systems in production environments.

Some key features of CrewAI include:

Sequential, Parallel, and Hierarchical Task Execution: By default, tasks are processed sequentially, but CrewAI also supports parallel and hierarchical execution, which is crucial for large-scale systems.
Custom Tool Integration: CrewAI allows developers to create and integrate custom tools tailored to specific agent tasks, enhancing the versatility and effectiveness of the system for their use case.
Memory Management: CrewAI provides mechanisms for short-term, long-term, and entity memory, enabling agents to learn from past experiences and improve over time.
Role-Based Agent Configuration: By focusing agents on specific roles and goals, CrewAI ensures that each agent is optimized for its task, improving overall system efficiency.

Setup and Dependencies

Before defining the agents, let's ensure that your environment is correctly set up. For this tutorial, we'll be using Google Colab. Follow these steps to install the necessary dependencies and set up your environment variables:

Install Dependencies:

Since we're working on Google Colab, installing dependencies is straightforward. We'll be using the crewai, crewai_tools, langchain_community, and pymongo packages. These libraries provide the core functionality for creating and managing AI agents, integrating external tools like those from LangChain, and connecting to a MongoDB database.

The command above was run in a Google Colab notebook, but if you are running it locally, remove the exclamation mark (!).

Set Environment Variables:

Next, you'll need to set up your environment variables. For this tutorial, we'll use the gpt-3.5-turbo model from OpenAI, as it's widely accessible. If you have access to GPT-4, you can skip this step or modify the environment variable accordingly.

Add the following code to your Colab notebook, replacing the placeholder values with your actual API keys and credentials. This setup allows your agents to interact with external services and databases securely.

Designing a Multi-Agent System

Designing a multi-agent system begins with clearly defining the roles and responsibilities of each agent. Let's walk through a practical example: building a Customer Support System where different agents handle distinct tasks such as data retrieval, inquiry resolution, and quality assurance review.

STEP 1: Define the Agents

When creating AI agents, it's crucial to establish a strong mental framework. Start by asking yourself key questions that mirror the thought process of a manager:

Goal Orientation: What is the agent's primary objective? What processes will the agent need to accomplish this goal effectively?
Team Building Analogy: If this were a human task, what type of people would you hire to get the job done? Consider the roles and expertise needed, then map these qualities onto the AI agent's capabilities.

Each agent can run on a different language model (LLM) in a multi-agent system. Since we're using CrewAI for this tutorial, it's worth noting that agents can also integrate models from Hugging Face Hub. This flexibility lets you fine-tune the agents to meet specific needs, providing more customized responses.

For example, you can fine-tune models like phi-3, tinyLLama, or Llama-3 to better suit your use case. If you're unfamiliar with this process, you can refer to my previous tutorials on fine-tuning these models:

Fine-tuning Phi-3
Fine-tuning tinyllama
Fine-tuning Llama-3

To use a model from Hugging Face Hub, you can load it into your agent as follows:

Understanding the Data Retrieval Agent

Role: This agent is defined as a "Data Retrieval Specialist," focused on fetching data.
Goal: The agent's objective is to retrieve all relevant information about a customer from the database.
Backstory: The backstory provides context, helping the agent understand its role within the broader system.
Allow Delegation: Set to False, meaning this agent will not delegate its tasks to others.
Verbose: Enables detailed logging of the agent's actions.

Understanding the Support Agent

Role: The "Senior Support Representative" is responsible for delivering exceptional customer support.
Goal: The agent aims to provide friendly and helpful support.
Backstory: This agent uses the data provided by the Data Retrieval Specialist to assist the customer.
Allow Delegation: Set to False to keep the responsibility of the task within this agent.
Verbose: Detailed logging helps you track how this agent performs.

Understanding the Support QA Agent

Role: The "Support Quality Assurance Specialist" ensures the quality of the support provided.
Goal: This agent's goal is to achieve recognition for maintaining high support quality.
Backstory: It focuses on ensuring the Senior Support Representative's responses are thorough and accurate.
Verbose: As with the other agents, verbose logging is enabled to monitor the agent's activities.

Step 2: Define the Tasks

With our agents defined, the next step is to create the tasks they will perform. Tasks are central to how agents operate, providing a clear set of actions to be carried out using specific tools. The tools parameter is key, as it dictates which resources or utilities the agent will use to accomplish the task. There are various tools available, including those from LangChain, but it's important to select the one that best suits the agent's role and objective. Avoid overloading your agents with too many tools - focus on those that are most effective for the task at hand.

Key Elements of Effective Tools:

Versatility: The tool should handle different types of inputs from the agent, adapting to various scenarios.
Fault Tolerance: It should fail gracefully, possibly by querying further, sending error messages, or prompting specific input ranges.
Caching: This prevents unnecessary repeated requests by using a cross-agent caching layer, optimizing efficiency even when the same query is used by different agents.

Before we define the tasks, let's initialize the tool we'll use.

Tool Initialization

Tools can be built-in or custom-made, depending on the task's requirements. In this tutorial, we'll use several tools, including DirectoryReadTool and FileReadTool for reading files from a specified directory, and a custom tool for database retrieval.

First, let's initialize the built-in tools:

Next, we'll define a custom tool for retrieving data from a MongoDB database. While creating this tool, I encountered an issue where initializing the MongoDB client in the usual __init__ constructor caused errors. After some research, I found that initializing it as a class variable with the appropriate type annotation resolved the issue.

your_code_here

Defining the Tasks

Tasks represent specific objectives that agents must achieve. Each task is defined by a description, an expected_output, the tools it will use, and the agent responsible for carrying out the task.

Data Retrieval Task

This task is assigned to our data_retrieval_agent, whose job is to gather all relevant information about the customer from the database. The data collected here will be crucial for addressing the customer's inquiry in subsequent tasks.


data_retrieval_task = Task(
    description=dedent("""
        Gather all relevant {customer} data from the database, focusing on crucial data which will be great to know when addressing the customer's inquiry.
    """),
    expected_output=dedent("""
        A comprehensive dataset of the customer's information. Highlighting key info of the customer that will be helpful to the team when addressing the customer's inquiry.
    """),
    tools=[retrival_tool],
    agent=data_retrieval_agent,
)

In this task, the retrival_tool fetches the necessary data from the database, which the agent will then process to ensure it's relevant and complete.

Inquiry Resolution Task

Once the data is retrieved, the support_agent will use this information to address the customer's inquiry. This task involves searching through relevant files and generating a detailed response.


inquiry_resolution = Task(
    description=dedent("""
        {customer} just reached out with a super important ask:
        {inquiry}
        {customer} is the one that reached out. Make sure to use everything you know to provide the best support possible. You must strive to provide a complete, clear and accurate response to the customer's inquiry.
    """),
    expected_output=dedent("""
        A detailed, informative response to the customer's inquiry that addresses all aspects of their question.
        The response should include references to everything you used to find the answer, including external data or solutions. Ensure the answer is complete, leaving no questions unanswered, and maintain a helpful and friendly tone throughout.
    """),
    tools=[directory_read_tool, file_read_tool],
    agent=support_agent,
)

Here, the directory_read_tool and file_read_tool help the support_agent sift through stored documentation, ensuring that the response to the customer is well-informed and accurate.

Quality Assurance Review Task

Finally, the support_quality_assurance_agent reviews the response generated by the support_agent. This task ensures that the response meets the high standards of the company, is comprehensive, and is customer-friendly.


quality_assurance_review = Task(
    description=dedent("""
        Review the response drafted by the Senior Support Representative for {customer}'s inquiry. Ensure that the answer is comprehensive, accurate, and adheres to the high-quality standards expected for customer support.
        Verify that all parts of the customer's inquiry have been addressed thoroughly, with a helpful and friendly tone.
        Check for references and sources used to find the information, ensuring the response is well-supported and leaves no questions unanswered.
    """),
    expected_output=dedent("""
        A final, detailed, and informative response ready to be sent to the customer.
        This response should fully address the customer's inquiry, incorporating all relevant feedback and improvements.
        Don't be too formal, we are a chill and cool company but maintain a professional and friendly tone throughout.
    """),
    agent=support_quality_assurance_agent,
    output_file="response.md",
    human_input=True,
)

This task adds a layer of quality control, ensuring that the customer's needs are fully met and that the response aligns with the company's standards.

Step 3: Initialize the Crew

Now that we've defined our agents and tasks, it's time to bring everything together by initializing a Crew. The Crew is the central entity that manages the execution of tasks by the agents. It orchestrates how tasks are processed and whether agents can remember previous interactions.

Process Options

The process parameter controls how tasks are executed by the crew. The options include:

Sequential (Default): Tasks are performed one after the other in a specific order.
Hierarchical: One agent acts as a manager, delegating tasks to other agents while maintaining an overarching memory of the tasks.
Parallel: Tasks are executed concurrently, allowing multiple tasks to run at the same time.

Memory Types

Memory enhances the ability of agents to recall past interactions, improving the quality of responses over time. The memory parameter, when set to True, activates various types of memory:

Short Memory: Only available during the crew's runtime. Once the crew finishes, this memory is cleared and won't be accessible in subsequent runs.
Long Memory: Stores responses in persistent storage, allowing the crew to recall past interactions even after the session has ended.
Entity Memory: Groups and recognizes entities (e.g., customer names, products) within the conversation to provide more contextual and meaningful responses. It's also short-lived and cleared after the session ends.

By default, memory is set to False, but when enabled (memory=True), all types of memory are activated for better performance.

Here's how we initialize the crew with our agents, tasks, and memory configuration:

your_code_here

Key Parameters Explained

agents: A list of agents that will participate in the crew. Each agent is assigned to specific tasks based on their defined roles and goals.
tasks: A list of tasks that the agents need to complete. These tasks are designed to be executed sequentially by default, but you can customize the execution flow by changing the process parameter.
verbose: Setting this to 2 enables detailed logging, so you can monitor how the agents interact with each other and how they progress through the tasks.
memory: By setting memory=True, we enable all types of memory, ensuring the agents can recall previous interactions and provide more informed responses.

Execution Flow

With this configuration, the crew will execute the tasks in sequence:

Data Retrieval: The data_retrieval_agent retrieves relevant customer information from the database.
Inquiry Resolution: The support_agent uses the retrieved data, along with additional information from directories and files, to provide a comprehensive response to the customer's inquiry.
Quality Assurance Review: The support_quality_assurance_agent reviews the response, ensuring it meets quality standards, and then saves it to a file.

Step 4: Kick Off the Crew

After initializing the crew with your agents and tasks, the next step is to kick it off by providing the necessary inputs. These inputs will guide the agents as they work through their assigned tasks.

In this example, we'll be using the following inputs:

customer: The name of the customer inquiring.
inquiry: The specific question or issue the customer wants to be addressed.

Explanation of the Inputs

customer: "Tommy Ade" is the customer whose information will be retrieved and whose inquiry will be addressed by the agents. This is also used to personalize the responses and interactions.
inquiry: The customer is asking about how to fine-tune the Llama 3 model and also requesting some background information about the large language model (LLM). This input will drive the tasks performed by the agents, from data retrieval to inquiry resolution and quality assurance.

Step 5: Reviewing the Crew's Execution

Once the kick off method is invoked, the crew begins executing the tasks. The verbose setting ensures detailed logs are produced, showing how the agents collaborate to achieve the goal. This is particularly useful for understanding the decision-making process and how each agent contributes to the final result.

You should see the Data Retrieval Specialist begin the retrieval task with detailed logs and outputs. A minor error occurs when using the custom tool we defined earlier; since CrewAI helps us manage errors gracefully, it didn't affect the execution and made the agent pass in the right variable which gave us the customer details we wanted.

After the agent is finished working, it returns the finished output and gives it to the next agent who will be taking on the next task. The Support Agent starts and makes use of the directory and file tools to read all the files and prepare a response to a customer query. After the Support Agent has provided its response, the next task kicks off, and the QA agent starts. It then delegates a task to the Support agent again to review its response.

Then the QA agent receives the feedback from the Support Agent, and now seeks our feedback for a better response if need be. The QA agent then receives the feedback and works on its response based on what was said and gives a suitable response.

Final Output in Markdown Format

After the crew completes all tasks, it generates the final result in a markdown format. This format is particularly useful for documentation purposes or for sharing the results in a structured and readable way.

Since we set an output_file for our QA task, the file containing the response will be in our current directory. The content of the output file holds the final response from the Support QA agent. You can view the Google Colab Notebook used for this tutorial HERE.

Generalizing Across Use Cases

Adaptation to Multiple Domains

While this tutorial focuses on customer support examples, the principles of multi-agent systems can be adapted to various industries, from supply chain management to personalized AI-driven services.

Modular and Reusable Design

When designing your system, prioritize modularity. Structure agents and their interactions so that they can be easily adapted or reused in different projects, saving time and resources in future developments.

Conclusion

In this tutorial, we've built a sophisticated multi-agent system using CrewAI, demonstrating how to automate customer support tasks effectively. We started by setting up the environment, defining specialized agents, and creating tasks that leverage various tools, including a custom DatabaseRetrivalTool. By initializing and kicking off the crew, we saw how agents collaborate to retrieve data, draft responses, and ensure quality, all while utilizing memory and human input to produce a refined final output.

To dive deeper into the capabilities and possibilities of multi-agent systems, check out the CrewAI documentation.