Contextual RAG Agent Project

Overview

This project implements a Contextual Retrieval-Augmented Generation (RAG) system using Flask for the backend, Streamlit for the frontend, and various AI services. The system allows users to upload PDF documents, process them, and then query the processed information using natural language.

Repository Structure

CONTEXTUAL_RAG_AGENT/
├── src/
│   ├── app.py            # Streamlit frontend application
│   └── main.py           # Flask backend application
├── .env                  # Environment variables (not in repo)
├── dockerfile            # Docker configuration for backend
├── requirements.txt      # Python dependencies
└── CONTEXTUAL_RETRIVAL+RLFH(PPO).ipynb  # Google Colab notebook with step-by-step implementation

Components

Flask Backend (src/main.py): Handles document processing, querying, and interfacing with AI services.
Streamlit Frontend (src/app.py): Provides a user-friendly interface for interacting with the system.
Docker: Optional containerization for the backend application.
Google Colab Notebook (CONTEXTUAL_RETRIVAL+RLFH(PPO).ipynb): Contains the step-by-step implementation and explanation of the RAG system.
AI Services: Utilizes Pinecone, Cohere, Google AI, and Groq for various AI tasks.

Prerequisites

Python 3.11
API keys for Pinecone, Cohere, Google AI, and Groq
Streamlit

Setup

Clone the Repository

git clone https://github.com/Pandurangmopgar/Contextual_Retrival.git
cd CONTEXTUAL_RAG_AGENT

Environment Variables Create a .env file in the root directory with the following content:

PINECONE_API_KEY=your_pinecone_api_key
COHERE_API_KEY=your_cohere_api_key
GOOGLE_API_KEY=your_google_api_key
GROQ_API_KEY=your_groq_api_key

Install Requirements
```
pip install -r requirements.txt
```

Running the Application

Option 1: Running without Docker

Start the Flask Backend
```
cd src
python main.py
```
The backend will start running on http://localhost:5000.
Run the Streamlit Frontend In a new terminal window:
```
cd src
streamlit run app.py
```
The Streamlit frontend will open in your default web browser.

Option 2: Using Docker (Optional)

If you prefer to use Docker, you can create your own Docker image:

Build the Docker Image
```
docker build -t _name_of_your_image .
```

Run the Docker Container

docker run -p 5000:5000 _name_of_your_image

Run the Streamlit Frontend
```
cd src
streamlit run app.py
```

Google Colab Notebook

The CONTEXTUAL_RETRIVAL+RLFH(PPO).ipynb notebook in the repository provides a detailed, step-by-step implementation of the RAG system. To use it:

Open the notebook in Google Colab.
Follow the instructions to set up your environment and API keys.
Run through the cells to understand the implementation details and experiment with the system.

Usage

Process a Document
- Use the Streamlit interface to upload a PDF file.
- The system will process and store the document information.
Query the System
- Enter your query in the Streamlit interface.
- The system will retrieve relevant information from processed documents.

System Architecture

The Contextual RAG system works as follows:

Document Processing:
- PDFs are uploaded and text is extracted.
- Text is split into chunks.
- Each chunk is contextualized using AI.
- Contextualized chunks are stored in Pinecone and a local BM25 index.
Querying:
- User sends a query through the Streamlit interface.
- Backend performs hybrid search (vector + BM25).
- Results are re-ranked.
- Final answer is generated using retrieved context.
Caching:
- Redis is used to cache various results to improve performance.

Troubleshooting

Ensure all API keys are correctly set in the .env file.
For backend issues, check the console output where you started the Flask application.
For frontend issues, check the Streamlit console output.
Verify that all required packages are installed and listed in requirements.txt.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
src		src
.gitignore		.gitignore
CONTEXTUAL_RETRIVAL+RLFH(PPO).ipynb		CONTEXTUAL_RETRIVAL+RLFH(PPO).ipynb
README.md		README.md
dockerfile		dockerfile
requirements.txt		requirements.txt
sample_.env		sample_.env

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Contextual RAG Agent Project

Overview

Repository Structure

Components

Prerequisites

Setup

Running the Application

Option 1: Running without Docker

Option 2: Using Docker (Optional)

Google Colab Notebook

Usage

System Architecture

Troubleshooting

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Contextual RAG Agent Project

Overview

Repository Structure

Components

Prerequisites

Setup

Running the Application

Option 1: Running without Docker

Option 2: Using Docker (Optional)

Google Colab Notebook

Usage

System Architecture

Troubleshooting

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages