✨ Key Features
📁 Rich File Format Support
Section titled “📁 Rich File Format Support”OpenRag supports a comprehensive range of file formats for seamless document ingestion:
- Text Files: txt,md
- Document Files: pdf,docx,doc,pptx- Advanced PDF parsing with OCR support and Office document processing
- Audio Files: wav,mp3,mp4,ogg,flv,wma,aac- Audio transcription and content extraction
- Images: png,jpeg,jpg,svg- Vision Language Model (VLM) powered image captioning and analysis
All files are intelligently converted to Markdown format with images replaced by AI-generated captions, ensuring consistent processing across all document types.
🎛️ Native Web-Based Indexer UI
Section titled “🎛️ Native Web-Based Indexer UI”Experience intuitive document management through our built-in web interface.
Indexer UI Features
- Drag-and-drop file upload with batch processing capabilities
- Real-time indexing progress monitoring and status updates
- Admin Dashboard to monitor RAG components (Indexer, VectorDB, TaskStateManager, etc)
- Partition management - organize documents into logical collections
- Visual document preview and metadata inspection
- Search and filtering capabilities for indexed content
🗂️ Partition-Based Architecture
Section titled “🗂️ Partition-Based Architecture”Organize your knowledge base with flexible partition management:
- Multi-tenant support - isolate different document collections
💬 Interactive Chat UI with Source Attribution
Section titled “💬 Interactive Chat UI with Source Attribution”Engage with your documents through our sophisticated chat interface:
Chat UI Features
- Chainlit-powered UI - modern, responsive chat experience
- Source transparency - every response includes relevant document references
🔌 OpenAI API Compatibility
Section titled “🔌 OpenAI API Compatibility”OpenRag API is tailored to be compatible with the OpenAI format (see the openai-compatibility section for more details), enabling seamless integration of your deployed RAG into popular frontends and workflows such as OpenWebUI, LangChain, N8N, and more. This ensures flexibility and ease of adoption without requiring custom adapters.
Summary of features
- Drop-in replacement for OpenAI API endpoints
- Compatible with popular frontends like OpenWebUI, LangChain, N8N, and more
- Authentication support - secure your API with token-based auth
⚡ Distributed Ray Deployment
Section titled “⚡ Distributed Ray Deployment”Scale your RAG pipeline across multiple machines and GPUs.
Distributed Ray Deployment
- Horizontal scaling - distribute processing across worker nodes
- GPU acceleration - optimize inference across available hardware
- Resource management - intelligent allocation of compute resources
- Monitoring dashboard - real-time cluster health and performance metrics
See the section on distributed deployment in a ray cluster for more details
🔍 Advanced Retrieval & Reranking
Section titled “🔍 Advanced Retrieval & Reranking”OpenRag Leverages state-of-the-art retrieval techniques for superior accuracy.
Implemented advanced retrieval techniques
- Hybrid search - combines semantic similarity with BM25keyword matching
- Contextual retrieval - Anthropic’s technique for enhanced chunk relevance
- Multilingual reranking - using Alibaba-NLP/gte-multilingual-reranker-base