Google has introduced a new tool called File Search, designed to simplify retrieval-augmented generation (RAG) workflows in enterprise environments. This innovation directly challenges OpenAI’s Assistants API by offering a more streamlined approach to data retrieval and processing.
RAG technology enables applications and agents to find the most relevant and reliable information for user queries. However, traditional RAG setups often present engineering obstacles and inefficiencies, requiring developers to integrate multiple tools manually, such as storage systems and embedding generators.
To address these issues, Google released the File Search Tool through its Gemini API — a managed RAG solution designed to simplify the process. This service “abstracts away the retrieval pipeline,” reducing the complexity of building custom systems and eliminating the need for manual orchestration.
“File Search provides a simple, integrated and scalable way to ground Gemini with your data, delivering responses that are more accurate, relevant and verifiable,” Google stated in a blog post.
The new File Search solution positions Google as a direct competitor to enterprise-grade RAG systems from OpenAI, AWS, and Microsoft. Google claims its product is more self-sufficient, requiring minimal coordination between different components.
Some key features, including storage and embedding generation, are accessible for free during query time. Enterprises start paying for embeddings when their files are indexed, at a fixed rate of $0.15 per one million tokens.
Author’s summary: Google’s File Search integrates and simplifies RAG workflows, challenging competitors by offering a managed, cost-efficient, and developer-friendly enterprise solution.