Skip to content

Research Agent

The ResearchAgent is a built-in utility agent in the AI Refinery SDK for handling complex research queries. It retrieves information from web resources and generates comprehensive responses. For instance, if asked "How do interest rate changes by the Federal Reserve impact the stock market?", the ResearchAgent will

  • search predefined web sources,
  • gather the relevant information,
  • generate the answer based on the relevant information.

This documentation outlines the workflow and configurations needed to use the ResearchAgent.

Workflow Overview

The workflow of ResearchAgent consists of six components:

  1. Source selection: The ResearchAgent retrieves information related to a user query from one or more pre-selected sources (e.g., Google, vector database indices). These sources must be specified before project creation. The ResearchAgent will then utilize all selected sources for its search.

  2. Query transformation: Directly using the user's query to search all selected sources may not be optimal, especially if the query is lengthy or complex. In such cases, the ResearchAgent splits the query into multiple, more manageable queries to ensure relevant results. Thus, the ResearchAgent transforms the user's query into one or more queries tailored for each selected source.

  3. Retrieval: The ResearchAgent sends transformed queries to the selected sources and retrieves relevant data, ensuring it aligns with the original user query.

  4. Reranking: After retrieval, the ResearchAgent uses the reranker API to reorder the retrieved documents based on their relevance to the user's query. This ensures that the most pertinent information is prioritized in the response.

  5. Compression: The ResearchAgent then utilizes the prompt compression API to reduce the size of the top-ranked documents while retaining essential information. This step optimizes the prompt length for efficient processing.

  6. Answer generation: After retrieving, reranking, and compressing data from all sources, the ResearchAgent aggregates the information into a coherent and concise response, which is then presented to the user.

Usage

As a built-in utility agent in the AI Refinery SDK, ResearchAgent can be easily integrated into your project by adding the necessary configurations to your project YAML file. Specifically, ensure the following configurations are included:

  • Add a utility agent with agent_class: ResearchAgent under utility_agents.
  • Ensure the agent_name you chose for your ResearchAgent is listed in the agent_list under orchestrator.

Quickstart

To quickly set up a project with a ResearchAgent, use the following YAML configuration. This setup includes two retrievers (i.e., sources): one for web search (via Google) and another for vector search (over a specified vector database index). You can add more agents and retrievers as needed. Refer to the next section for a detailed overview of configurable options for ResearchAgent.

utility_agents:
  - agent_class: ResearchAgent
    agent_name: My Research Agent # A name that you choose for your research agent. This needs to be listed under orchestrator.
    config:
      reranker_top_k: 15      # Optional: Defines the top-k most relevant evidence pieces to retain post-reranking. Set to a negative value to skip reranking.
      compression_rate: 0.4   # Optional: Specifies the compression rate for retrieved content. Lower values increase compression, discarding more information (default: 0.6). Set to 1 to retain all content with no compression.

      retriever_config_list: 
      # The list of configurations of the retrievers i.e., sources used by your research agent to search for the relevant information related to a user query.
        - retriever_name: "Internet Search" # A name you choose for your retriever
          retriever_class: WebSearchRetriever # WebSearchRetriever is the type of retriever that performs web search via Google. 
          description: "This data source can collect the latest news / information from the open internet to answer any queries." # Optional. A description of the retrievar

        - retriever_name: "Knowledge Brain" # A name you choose for your retriever
          retriever_class: AzureAISearchRetriever # AzureAISearchRetriever is the type of retriever that performs web search via Google. 
          description: "Knowledge base for all Accenture internal documents and publications such as briefs, plans, and reports. This is expecially useful for studing previous marketing campaigns and Accenture-related topics." # Optional. A description of the retrievar

          aisearch_config:
          # The details of the vector database index that the `ResearchAgent` can search over
            base_url: azure.com # base url of where your vector database index is hosted>
            api_key: "xyz" # your azure api key for the vector database
            index: "my-knowledge-brain-2024" # name of the vector database index

            embedding_column: "outputMistralVector" # The column of the index you stored your embedded data that you want the `ResearchAgent` to search over
            embedding_config: 
              model: "intfloat/e5-mistral-7b-instruct" # Embedding model for the retrieval, must match the config you used for embedding your data in the embedding_column
            top_k: 5 # Number of documents to retrieve

            content_column: 
              - "output" # The content that we are going to retrieve

orchestrator:
  agent_list:
    - agent_name: "My Research Agent" # The name you chose for your ResearchAgent above.

Template YAML Configuration of ResearchAgent

In addition to the configurations mentioned for the example above, the ResearchAgent supports several other configurable options. See the template YAML configuration below for all available settings.

agent_class: ResearchAgent
agent_name: <name of the agent> # A name that you choose for your research agent
agent_description: <description of the agent> #Optional
config:
  reranker_top_k: <number of top k>      # Optional: Specifies the number of top results you want from the reranked output.
  compression_rate: <compression rate>   # Optional: Defines the desired level of compression. Set to 1 to retain all content without any compression.


  retriever_config_list:
  # Required. Here, you list the configurations of the retrievers i.e., sources to be used by your ResearchAgent.
    - retriever_name: <name of the retriever>
      retriever_class: WebSearchRetriever # The retriever_class must be "WebSearchRetriever" if you want your research agent to search the web
      description: <description of the retriever>
      query_transformation_examples:
      # Optional. Examples of how the research agent should convert the user query to a query or list of queries that is suitable for the web search
        - user_query: <an example query from the user>
          query:
          - <transformed query 1>
          - <transformed query 2>
      source_weight: <a source weight> # The importance weight of the information retrieved by this particular retriever if you are using multiple retrievers. 

    - retriever_name: <name of the retriever>
      retriever_class: AzureAISearchRetriever #Required. The retriever_class must be "AzureAISearchRetriever" if you want your research agent to perform vector search over an index hosted on Azure
      description: <description of this retriever> #Optional.

      aisearch_config:
        # Required. Vector database index
        base_url: <base url of where your vector database index is hosted>
        api_key: <your azure api key for the vector database>
        index: <name of the index you want your AzureAISearchRetriever to search>

        # Required. Embedding column / model
        embedding_column: <the column of the index you stored your embedded data that you want to search over>
        embedding_config:  # Embedding model for the retrieval, must match the config you used for embedding your data in the embedding_column
          model: <name of the model>
        top_k: <number of documents to retrieve>  # Optional (top k, default 10)

        # Required: Content that we are going to retrieve
        content_column: # a list of the content columns
          - <name of your content column 1>
          - <name of your content column 2>
          - <name of your content column n>

        # Optional:
        # Each document can be chunked into multiple rows. The aggregate_column
        # column stores the name of the document If aggregate_column is not
        # specified, we will treat each chunk as a document
        aggregate_column: <your aggregate column>

        # Optional: meta data that we want to include besides the chunk column
        meta_data: 
          - column_name: <"name of a column">
            load_name: <"load name of the column">

      query_transformation_examples:
      # Optional. A list of examples of a query from the user and their transformed versions for efficient vector search.
        - user_query: <an example query from the user>
          query:
            - <transformed query 1>
            - <transformed query 2>

      source_weight: <a source weight> # The importance weight of the information retrieved by this particular retriever if you are using multiple retrievers.

  output_style: <"markdown" or "conversational" or "html">  # Optional field
  contexts:  # Optional field
    - "date"
    - "chat_history"
    - "env_variable"
  llm_config:
  # Optional. Customized llm config (if you want the research agent to use a different LLM than the on in your base config)
    model: <model_name>