Transforms any website's documentation into an intelligent, conversational assistant. Simply provide a documentation URL, get answers to questions about a specific library or framework.
🎯 For developers who want to:
- Quickly understand new frameworks and libraries
- Get instant answers from complex documentation
- Save time searching through lengthy docs
- Have contextual conversation about code
- Firecrawl - LLM-Ready web crawling
- Unstructured - Document processing and intelligent chunking
- MongoDB Atlas - Vector database with similarity search
- OpenAI - Embeddings (text-embedding-3-large) and generation (gpt-4o-mini)
- AWS S3 - Temporary storage for crawled content
demo.mp4
git clone <repository-url>
cd rag_documentation
pip install -r requirements.txtcp .env.example .env
# Add your API keys to .env filepython main.pyPopular examples:
- 🔗 React: https://react.dev/learn
- 🔗 FastAPI: https://fastapi.tiangolo.com
- 🔗 LangGraph: https://docs.langchain.com/langgraph-platform
- 🔗 Next.js: https://nextjs.org/docs
💭 "How do I create a state graph in LangGraph?"
💭 "What are React hooks and how do I use them?"
💭 "How do I handle authentication in FastAPI?"
💭 "What's the difference between SSR and SSG in Next.js?"
| Parameter | Description | Default | 
|---|---|---|
| limit | Number of pages to crawl | 20 | 
| max_depth | Maximum crawl depth | 5 | 
| embedding_model | OpenAI embedding model | text-embedding-3-large | 
| generation_model | OpenAI chat model | gpt-4o-mini | 
- Python 3.8+
- OpenAI API key
- MongoDB Atlas cluster
- AWS S3 bucket
- Firecrawl API key