Skip to content

✨ Key Features

OpenRag supports a comprehensive range of file formats for seamless document ingestion:

  • Text Files: txt, md
  • Document Files: pdf, docx, doc, pptx - Advanced PDF parsing with OCR support and Office document processing
  • Audio Files: wav, mp3, mp4, ogg, flv, wma, aac - Audio transcription and content extraction
  • Images: png, jpeg, jpg, svg - Vision Language Model (VLM) powered image captioning and analysis

All files are intelligently converted to Markdown format with images replaced by AI-generated captions, ensuring consistent processing across all document types.

Experience intuitive document management through our built-in web interface.

Indexer UI Features
  • Drag-and-drop file upload with batch processing capabilities
  • Real-time indexing progress monitoring and status updates
  • Admin Dashboard to monitor RAG components (Indexer, VectorDB, TaskStateManager, etc)
  • Partition management - organize documents into logical collections
  • Visual document preview and metadata inspection
  • Search and filtering capabilities for indexed content

Organize your knowledge base with flexible partition management:

  • Multi-tenant support - isolate different document collections

💬 Interactive Chat UI with Source Attribution

Section titled “💬 Interactive Chat UI with Source Attribution”

Engage with your documents through our sophisticated chat interface:

Chat UI Features
  • Chainlit-powered UI - modern, responsive chat experience
  • Source transparency - every response includes relevant document references

OpenRag API is tailored to be compatible with the OpenAI format (see the openai-compatibility section for more details), enabling seamless integration of your deployed RAG into popular frontends and workflows such as OpenWebUI, LangChain, N8N, and more. This ensures flexibility and ease of adoption without requiring custom adapters.

Summary of features
  • Drop-in replacement for OpenAI API endpoints
  • Compatible with popular frontends like OpenWebUI, LangChain, N8N, and more
  • Authentication support - secure your API with token-based auth

Scale your RAG pipeline across multiple machines and GPUs.

Distributed Ray Deployment
  • Horizontal scaling - distribute processing across worker nodes
  • GPU acceleration - optimize inference across available hardware
  • Resource management - intelligent allocation of compute resources
  • Monitoring dashboard - real-time cluster health and performance metrics

See the section on distributed deployment in a ray cluster for more details

OpenRag Leverages state-of-the-art retrieval techniques for superior accuracy.

Implemented advanced retrieval techniques
  • Hybrid search - combines semantic similarity with BM25 keyword matching
  • Contextual retrieval - Anthropic’s technique for enhanced chunk relevance
  • Multilingual reranking - using Alibaba-NLP/gte-multilingual-reranker-base