Business RAG Assistant is an AI-powered application designed for companies to provide accurate internal information based on user queries. The system uses Retrieval-Augmented Generation (RAG) to retrieve relevant company data from a vector database and generate intelligent responses using an LLM.
This project is useful for answering questions like:
- Company details
- Management information (CEO, Manager, etc.)
- Salary structure
- Internal policies and general company knowledge
- Company data is stored in Qdrant vector store.
- When a user asks a question, relevant data is retrieved from Qdrant.
- The retrieved data is analyzed using DeepSeek LLM API.
- A clear and accurate response is generated for the user.
To maintain privacy and security:
- API keys and URLs are NOT exposed in the codebase.
- Sensitive information is managed using environment variables and configuration files.
Create a .env file in the root directory and add your LLM API keys there.
DEEPSEEK_API_KEY=your_api_key_here
The config.json file is used to store:
- Qdrant API key
- Qdrant URL
This file allows easy configuration without modifying core code.
- The notebook
vector_store_setup.ipynbis provided. - Use it to:
- Upload your data to Qdrant
- Create embeddings
- Store and retrieve vectors efficiently
business-rag-assistant/ │ ├── Images/ │ ├── Screenshot 2026-01-03 211644.png │ ├── Screenshot 2026-01-03 214912.png │ └── Screenshot 2026-01-03 215256.png │ ├── Code/ │ └── vector_store_setup.ipynb ├── README.md ├── app.py ├── config.json ├── index.html ├── requirements.txt
git clone https://github.com/AzeemAIDev/business-rag-assistant.git cd business-rag-assistant
pip install -r requirements.txt
uvicorn app:app --port 8000
- Right-click on
index.html - Select Open with Live Server
- Python
- FastAPI
- Qdrant Vector Database
- DeepSeek LLM API
- Retrieval-Augmented Generation (RAG)
Azeem
ML Engineer & AI Learner
https://github.com/AzeemAIDev