Autonomously retrieves, classifies, embeds, and performs semantic search to extract structured data from PubMed for clinical trials.
The BioRAG AI Agent is an intelligent system designed to automate the semantic retrieval and structured data extraction of clinical trial information from PubMed. It functions as a Retrieval-Augmented Generation (RAG) system, taking natural language prompts and converting them into optimized PubMed queries. Leveraging a Model Context Protocol (MCP) server for robust data fetching, the agent then classifies, embeds, and stores relevant articles in a FAISS vector database. Finally, it conducts semantic searches and utilizes large language models to extract key biomedical metadata such as drug names, indications, sponsors, and NCT IDs, providing a structured biomedical knowledge base.