Cloudflare Launches AutoRAG: Simple, Fast, and Managed AI With Your Own Data

On April 7th, 2025, Cloudflare introduced AutoRAG, a new product that allows developers to build AI applications that understand and respond using their own data with minimal setup and no need to manage complex infrastructure.

Now available in open beta, AutoRAG is a fully managed Retrieval-Augmented Generation (RAG) system built on Cloudflare’s developer platform.

It takes care of every part of the RAG process, from reading your documents to generating answers with AI, so developers can create smart, data-driven applications faster than ever.

AutoRAG dashboard

What AutoRAG Does

AutoRAG makes it easy to build AI tools that give accurate answers based on the latest information from your own content.

It automatically reads your files, converts them into a searchable format, stores them, and uses them to give context-aware AI responses.

You don’t need to manually build or maintain a system of storage, indexing, embeddings, and retrieval. AutoRAG handles it all — reliably and at scale.

Why Retrieval-Augmented Generation (RAG) Matters

Large language models like Meta’s Llama 3.3 are powerful, but they don’t know everything — especially not recent events or company-specific data.

Training a model with your own information is expensive and time-consuming. Adding all the data to every prompt doesn’t work well either due to size limits.

RAG solves this problem by retrieving relevant data from your files at the time of the query, then combining it with the question before generating an answer. This ensures that the response is accurate, up-to-date, and based on your content.

How AutoRAG Works

AutoRAG has two main parts: Indexing and Querying.

1. Indexing (Done in the background)

Indexing - AutoRAG

File Ingestion: AutoRAG reads files from Cloudflare R2 (Cloudflare’s storage service). It supports many formats, like PDFs, HTML, images, text, and more.
Markdown Conversion: All files are converted into structured Markdown for consistency.
Image to Text: Images are processed using AI to turn them into descriptive text.
Chunking: Text is split into smaller sections for better search accuracy.
Embedding: Each section is turned into a vector (a numeric format that AI can search).
Vector Storage: These vectors are stored in Cloudflare’s Vectorize database, ready to be searched later.

2. Querying (Happens live when someone asks a question)

Querying - AutoRAG

Query Embedding: The user’s question is also turned into a vector.
Vector Search: AutoRAG searches the database to find the most relevant pieces of your content.
Response Generation: The relevant data and the question are sent to a language model that writes a complete, accurate answer.

This whole process happens automatically, and it updates in the background as your data changes. That means your AI responses always stay current — no manual updates are needed.

From Website to Smart Search in Minutes

If your content isn’t already in Cloudflare R2, you can still use AutoRAG. With Cloudflare’s Browser Rendering API, you can crawl your website, capture the content, and upload it into R2 for AutoRAG to use.

Here’s how it works:

Set up a Cloudflare Worker using Puppeteer to visit and capture your web pages.
Upload the HTML content to R2.
Create an AutoRAG instance and connect it to your R2 bucket.
AutoRAG will automatically process and index your content.
You can now ask questions and get accurate answers based on your site.

The entire process can take less than 5 minutes.

Adding AutoRAG to Your App

Once you’ve created and indexed your content with AutoRAG, you can:

Test it directly in the Cloudflare dashboard’s Playground by asking sample questions.
Integrate AutoRAG into your app using a Cloudflare Worker or API.
Use the aiSearch() method to get AI-generated answers or search() to get relevant documents only.

This makes it easy to add smart search or AI chat features to your app with just a few lines of code.

Costs and Limits

AutoRAG is free during the beta phase. You can create up to 10 AutoRAG instances per account and upload up to 100,000 files per instance.

You only pay for the Cloudflare services utilized (storage, compute, etc.), which are billed as usual under your account.

What’s Coming Next

Cloudflare has made it clear that AutoRAG is just getting started. Over the coming months, they plan to expand its capabilities in several important ways.

First, they’re adding support for more data sources beyond Cloudflare R2, including structured databases like Cloudflare D1 and direct website URL ingestion without needing manual crawling. This means developers will be able to pull in data from even more locations with less effort.

Cloudflare is working on improving the quality of responses with smarter techniques like reranking (to prioritize the most relevant results) and recursive chunking (to improve how large documents are broken down).

A new “crawl endpoint” is also in development, which will make it easier to scan and ingest entire websites automatically. These features aim to make AutoRAG more powerful, flexible, and useful across a wider range of applications.

Tips for Getting Started

Here’s how to get the most out of AutoRAG:

Organize your content: Use well-structured files for better results.
Start small: Test with a few documents or pages to learn how it works.
Use the Playground: Try out queries to fine-tune your setup before going live.
Keep it fresh: AutoRAG automatically updates, but you can monitor progress in the dashboard.
Use AI Gateway: Track usage and control costs easily.

Key Takeaways

AutoRAG simplifies building AI apps that use your own content.
It handles everything — from data ingestion to semantic search and response generation.
You can use it with documents, websites, images, and other formats — all automatically processed.
Integration into applications is fast and uses standard Cloudflare tools and APIs.
It’s free during open beta and designed to scale, with more features coming in 2025.

Dileep Thekkethil

Author

Dileep Thekkethil is the Director of Marketing at Stan Ventures, where he applies over 15 years of SEO and digital marketing expertise to drive growth and authority. A former journalist with six years of experience, he combines strategic storytelling with technical know-how to help brands navigate the shift toward AI-driven search and generative engines. Dileep is a strong advocate for Google’s EEAT standards, regularly sharing real-world use cases and scenarios to demystify complex marketing trends. He is an avid gardener of tropical fruits, a motor enthusiast, and a dedicated caretaker of his pair of cockatiels.

Cloudflare Launches AutoRAG: Simple, Fast, and Managed AI With Your Own Data

On this page

Free SEO Audit

What AutoRAG Does

Why Retrieval-Augmented Generation (RAG) Matters

How AutoRAG Works

1. Indexing (Done in the background)

2. Querying (Happens live when someone asks a question)

From Website to Smart Search in Minutes

Adding AutoRAG to Your App

Costs and Limits

What’s Coming Next

Tips for Getting Started

Key Takeaways

Dileep Thekkethil

Related Articles

Liz Reid: What Survives Google’s AI…

Google Just Launched The June 2026…

Should You Convert Your Website to…

Get Your Custom Proposal

On this page

Free SEO Audit

What AutoRAG Does

Why Retrieval-Augmented Generation (RAG) Matters

How AutoRAG Works

1. Indexing (Done in the background)

2. Querying (Happens live when someone asks a question)

From Website to Smart Search in Minutes

Adding AutoRAG to Your App

Costs and Limits

What’s Coming Next

Tips for Getting Started

Key Takeaways

Dileep Thekkethil

Related Articles

Liz Reid: What Survives Google’s AI…

Google Just Launched The June 2026…

Should You Convert Your Website to…