🎓 ScholarQA: ML Claim Verification Assistant

ScholarQA is a privacy-first, 100% local Retrieval-Augmented Generation (RAG) application designed for researchers and academics. It allows users to ingest complex PDF documents (like research papers) and rigorously verify claims against the text without sending sensitive, unreleased data to cloud APIs.

✨ Key Features

Utilizes Ollama to run both embedding models and Large Language Models locally, guaranteeing data privacy.
Converts PDFs into vector embeddings for highly accurate context retrieval.
The LLM is strictly prompted to return "Not Found" if a claim cannot be explicitly verified by the uploaded text.
UI that exposes the exact raw text chunks the AI used to verify a claim, ensuring algorithmic transparency.
Group related papers using unique Project IDs and instantly wipe local databases when finished.

🛠️ Tech Stack

Frontend: Streamlit
Backend: FastAPI, Uvicorn
Database: ChromaDB (Local Vector Store)
AI Models (via Ollama):
- llama3 (for reasoning and claim verification)
- nomic-embed-text (for hyper-fast local document embeddings)
PDF Processing: Unstructured / PyPDF

🚀 Installation & Setup

1. Prerequisites

Ensure you have Python 3.10+ installed. You must also install Ollama and download the required local models:

ollama pull llama3
ollama pull nomic-embed-text

2. Clone the Repository

git clone https://github.com/UmarNasib/scholarQa.git
cd scholarQa

3. Set Up Virtual Environment

python3 -m venv venv
source venv/bin/activate  # On Windows use: venv\Scripts\activate
pip install -r requirements.txt

4. Run the Application

Because this app separates the frontend and backend for better performance, you need to run two terminal windows.

Terminal 1: Start the FastAPI Backend

source venv/bin/activate
python3 -m app.main

Terminal 2: Start the Streamlit UI

source venv/bin/activate
streamlit run app/ui.py

💻 Usage Instructions

Open the UI in your browser (usually http://localhost:8501).
Enter a Project ID in the sidebar (e.g., thesis_v1).
Go to the Upload Documents tab, upload a PDF, and click Process Document.
Switch to the Verify Claims tab and enter a hypothesis or claim.
Review the AI's verdict and click "View Raw Extracted Context" to see the exact paragraphs used for the audit.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
app		app
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🎓 ScholarQA: ML Claim Verification Assistant

✨ Key Features

🛠️ Tech Stack

🚀 Installation & Setup

1. Prerequisites

2. Clone the Repository

3. Set Up Virtual Environment

4. Run the Application

Terminal 1: Start the FastAPI Backend

Terminal 2: Start the Streamlit UI

💻 Usage Instructions

About

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

🎓 ScholarQA: ML Claim Verification Assistant

✨ Key Features

🛠️ Tech Stack

🚀 Installation & Setup

1. Prerequisites

2. Clone the Repository

3. Set Up Virtual Environment

4. Run the Application

Terminal 1: Start the FastAPI Backend

Terminal 2: Start the Streamlit UI

💻 Usage Instructions

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Contributors

Uh oh!

Languages