Vectorless PDF Chatbot

A revolutionary PDF chatbot that uses no vector embeddings or traditional RAG. Instead, it leverages Large Language Models for intelligent document selection and page relevance detection, providing a completely stateless and privacy-first experience.

🚀 What Makes This "Vectorless"?

Traditional PDF chatbots convert documents into vector embeddings for semantic search. This approach:

❌ Requires expensive vector databases
❌ Needs pre-processing and indexing
❌ Stores document data on servers
❌ Loses context and nuance in embeddings

Our Vectorless approach:

✅ Uses LLM reasoning instead of vectors
✅ Processes documents in real-time
✅ Completely stateless - no server storage
✅ Preserves full document context
✅ Privacy-first - documents stay in your browser

🧠 How the Vectorless Process Works

3-Step Intelligent Document Analysis

graph TD
    A[📄 User uploads PDFs] --> B[🧠 LLM Document Selection]
    B --> C[🎯 LLM Page Relevance Detection]
    C --> D[💬 Contextual Answer Generation]
    
    B --> B1[Analyzes collection description<br/>+ document filenames<br/>+ user question]
    C --> C1[Examines actual page content<br/>from selected documents<br/>in parallel processing]
    D --> D1[Generates comprehensive answer<br/>with proper citations]

Step 1: 🧠 Smart Document Selection

LLM reads your collection description and document filenames
Intelligently selects which documents are likely to contain relevant information
No embeddings needed - uses reasoning and context understanding

Step 2: 🎯 Page Relevance Detection

LLM examines actual page content from selected documents
Processes multiple documents in parallel for speed
Identifies the most relevant pages based on question context

Step 3: 💬 Contextual Answer Generation

Uses only the relevant pages to generate accurate answers
Maintains full document context and nuance
Provides proper citations and references

✨ Key Features

🔒 Privacy-First & Stateless

Zero Server Storage: Documents processed and stored entirely in your browser
LocalStorage Persistence: Your documents persist across browser sessions
No Data Leakage: Document content never persists on servers
Serverless-Friendly: Perfect for Vercel/Netlify deployments

📁 Advanced File Handling

Up to 100 PDF documents per session
Chunked Upload System: Automatically handles large file sets (>4.5MB)
4.5MB per file limit - processes substantial documents
Real-time Processing: No pre-indexing required

💡 Intelligent Processing

Multi-Model Support: GPT-4, GPT-5-mini, and more
Parallel Processing: Multiple documents analyzed simultaneously
Context Preservation: Full document context maintained throughout
Dynamic Descriptions: Edit collection descriptions anytime

🎨 Modern Interface

Responsive Design: Works on desktop and mobile
Real-time Progress: Visual feedback during uploads and processing
GitHub Integration: Easy access to source code
Error Handling: Comprehensive error messages and recovery

🛠 Technology Stack

Frontend

Next.js 15: React framework with App Router
TypeScript: Type safety and better development experience
Tailwind CSS: Modern utility-first styling
Lucide React: Beautiful, consistent icons

Backend (Vercel Functions)

Python Functions: Serverless API endpoints
PyPDF2: Reliable PDF text extraction
OpenAI GPT: Advanced language models for reasoning
Chunked Processing: Handle large uploads efficiently

Infrastructure

Vercel Deployment: Seamless serverless hosting
No Databases: Completely stateless architecture
Automatic Scaling: Handle traffic spikes effortlessly

🚀 Quick Start

Prerequisites

Node.js 18+ and npm
OpenAI API key

1. Clone and Install

git clone https://github.com/roe-ai/vectorless-chatbot.git
cd vectorless-chatbot
npm install

2. Environment Setup

Create .env.local:

# Only needed for local development
OPENAI_API_KEY=your_openai_api_key_here

3. Run Locally

npm run dev

Visit http://localhost:3000

4. Deploy to Vercel

Push to GitHub
Connect to Vercel
Set environment variable: OPENAI_API_KEY=your_key
Deploy! ✅

📖 How to Use

1. Upload Your Documents

Click "Add Your First Document" or "Add Files"
Select up to 100 PDF files (4.5MB each max)
Add a description of your document collection
Large uploads are automatically chunked for reliability

2. Start Chatting

Ask questions about your documents in natural language
Watch the 3-step process: Document Selection → Page Detection → Answer Generation
Get detailed answers with timing and cost breakdowns

3. Manage Your Collection

Add more documents anytime
Edit collection descriptions
Start new sessions as needed
All data stays in your browser

🏗 Architecture Deep Dive

Stateless Design

┌─────────────────┐    ┌──────────────────┐    ┌─────────────────┐
│   Browser       │    │  Vercel Functions │    │   OpenAI API    │
│                 │    │                  │    │                 │
│ • LocalStorage  │◄──►│ • /api/upload    │◄──►│ • GPT Models    │
│ • Document Data │    │ • /api/chat/stream│    │ • Real-time     │
│ • Chat History  │    │ • No Storage     │    │   Processing    │
│ • Session State │    │ • Stateless      │    │                 │
└─────────────────┘    └──────────────────┘    └─────────────────┘

Chunked Upload System

When uploading large document sets:

Size Detection: Frontend calculates total upload size
Automatic Chunking: Splits into 3.5MB chunks if needed
Parallel Processing: Each chunk processed independently
Progressive Results: Documents become available as chunks complete
Error Recovery: Failed chunks can be retried individually

🔧 API Endpoints

`POST /api/upload`

Upload and process PDF documents

Input: FormData with files and description
Output: Processed documents with extracted text
Features: Automatic chunking, progress tracking

`POST /api/chat/stream`

Stream chat responses in real-time

Input: Question, documents, chat history
Output: Server-sent events with processing steps
Features: Real-time progress, cost tracking, citations

`GET /api/health`

Service health check

Output: System status and mode information

🎯 Advantages Over Traditional RAG

Traditional RAG	Vectorless Approach
🗄️ Requires vector database	🚫 No database needed
📊 Pre-processes to embeddings	🔄 Real-time processing
💰 Expensive infrastructure	💸 Serverless & cost-effective
🔒 Stores data on servers	🛡️ Browser-only storage
📏 Limited by embedding dimensions	🧠 Full context understanding
⚡ Fast retrieval, lossy context	🎯 Accurate reasoning, full context

🌟 Example Workflow

Upload: Marketing team uploads 50 company PDFs
Describe: "Company policies, procedures, and guidelines"
Ask: "What is our remote work policy?"
Process:
- 🧠 LLM selects "HR Handbook" and "Remote Work Guidelines"
- 🎯 Identifies relevant pages about remote work
- 💬 Generates comprehensive answer with citations
Result: Accurate answer in ~15 seconds with cost breakdown

🔮 Future Enhancements

Multi-format Support: Word docs, PowerPoint, Excel
Advanced Citations: Highlight exact text passages
Collaboration Features: Share sessions with team members
Analytics Dashboard: Usage patterns and insights
Custom Models: Support for local and custom LLMs
Batch Operations: Process multiple questions simultaneously

🤝 Contributing

We welcome contributions! This project showcases how modern LLMs can replace traditional vector-based approaches while providing better accuracy and user experience.

📄 License

MIT License - see LICENSE file for details.

⭐ Star us on GitHub if you find this vectorless approach interesting!

Built with ❤️ by ROE AI Inc.

Name		Name	Last commit message	Last commit date
Latest commit History 33 Commits
api		api
app		app
backend		backend
public		public
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
config.ts		config.ts
next.config.ts		next.config.ts
package-lock.json		package-lock.json
package.json		package.json
postcss.config.mjs		postcss.config.mjs
requirements.txt		requirements.txt
tsconfig.json		tsconfig.json
vercel.json		vercel.json

License

roe-ai/vectorless

Folders and files

Latest commit

History

Repository files navigation