✨ Key Features

📁 Rich File Format Support

OpenRag supports a comprehensive range of file formats for seamless document ingestion:

Text Files: txt, md
Document Files: pdf, docx, doc, pptx - Advanced PDF parsing with OCR support and Office document processing
Audio Files: wav, mp3, mp4, ogg, flv, wma, aac - Audio transcription and content extraction
Images: png, jpeg, jpg, svg - Vision Language Model (VLM) powered image captioning and analysis

All files are intelligently converted to Markdown format with images replaced by AI-generated captions, ensuring consistent processing across all document types.

🎵 Audio File Processing

Audio files are processed using Whisper deployed with vLLM for reliable transcription. To enable audio file indexing, configure the transcriber service:

TRANSCRIBER_COMPOSE=extern/transcriber.yaml

🎛️ Native Web-Based Indexer UI

Experience intuitive document management through our built-in web interface.

Indexer UI Features

Drag-and-drop file upload with batch processing capabilities
Real-time indexing progress monitoring and status updates
Admin Dashboard to monitor RAG components (Indexer, VectorDB, TaskStateManager, etc)
Partition management - organize documents into logical collections
Visual document preview and metadata inspection
Search and filtering capabilities for indexed content

🗂️ Partition-Based Architecture

Organize your knowledge base with flexible partition management:

Multi-tenant support - isolate different document collections

💬 Interactive Chat UI with Source Attribution

Engage with your documents through our sophisticated chat interface:

Chat UI Features

Chainlit-powered UI - modern, responsive chat experience
Source transparency - every response includes relevant document references

🔌 OpenAI API Compatibility

OpenRag API is tailored to be compatible with the OpenAI format (see the openai-compatibility section for more details), enabling seamless integration of your deployed RAG into popular frontends and workflows such as OpenWebUI, LangChain, N8N, and more. This ensures flexibility and ease of adoption without requiring custom adapters.

Summary of features

Drop-in replacement for OpenAI API endpoints
Compatible with popular frontends like OpenWebUI, LangChain, N8N, and more
Authentication support - secure your API with token-based auth

⚡ Distributed Ray Deployment

Scale your RAG pipeline across multiple machines and GPUs.

Distributed Ray Deployment

Horizontal scaling - distribute processing across worker nodes
GPU acceleration - optimize inference across available hardware
Resource management - intelligent allocation of compute resources
Monitoring dashboard - real-time cluster health and performance metrics

See the section on distributed deployment in a ray cluster for more details

🔍 Advanced Retrieval & Reranking

OpenRag Leverages state-of-the-art retrieval techniques for superior accuracy.

Implemented advanced retrieval techniques

Hybrid search - combines semantic similarity with BM25 keyword matching
Contextual retrieval - Anthropic’s technique for enhanced chunk relevance
Multilingual reranking - using Alibaba-NLP/gte-multilingual-reranker-base