🧬 BioIntel.AI

FREE AI-powered bioinformatics platform for gene expression analysis and literature summarization

BioIntel.AI is a comprehensive, completely FREE platform that combines advanced bioinformatics analysis with AI-powered literature mining. Built with free alternatives to expensive AI APIs, it provides researchers with powerful tools for gene expression analysis, biomarker discovery, and scientific literature summarization at zero cost.

✨ Features (100% FREE)

🔬 Gene Expression Analysis

Data Upload & Validation: Support for CSV, Excel, and other common formats
Quality Assessment: Comprehensive data quality metrics and validation
Exploratory Data Analysis: Statistical summaries and visualizations
Principal Component Analysis (PCA): Dimensionality reduction and visualization
Clustering Analysis: K-means and hierarchical clustering
Interactive Visualizations: PCA plots, heatmaps, and correlation matrices

📚 Literature Analysis (FREE AI)

Abstract Processing: AI-powered summarization using Hugging Face Transformers
PDF Processing: Full-text extraction and analysis from research papers
Entity Extraction: Rule-based identification of genes, biomarkers, diseases, and methods
Chat with Papers: Interactive Q&A with research papers using free AI
Literature Search: Comprehensive PubMed integration (FREE API)

🌐 Free Bioinformatics APIs

PubMed Integration: Literature search and analysis
UniProt API: Protein information and annotations
Ensembl API: Gene annotations and genomic data
STRING API: Protein-protein interaction networks
KEGG API: Pathway and metabolic information

📊 Report Generation

HTML Reports: Interactive web-based reports
PDF Export: Professional PDF reports for sharing
Custom Templates: 7 pre-configured analysis templates
Automated Workflows: 5 complete research pipelines

🏢 Enterprise Features

Team Collaboration: Workspaces and shared analyses
API Access: Rate-limited API keys for programmatic access
Usage Analytics: Comprehensive usage monitoring
Role-based Access: User permissions and access control

🔐 Security & Authentication

JWT Authentication: Secure user authentication
Data Encryption: Secure data storage and transmission
Rate Limiting: API protection and usage limits
Input Validation: Comprehensive security measures

🚀 Quick Start (5 Minutes)

Prerequisites

Python 3.8 or higher
Database (PostgreSQL recommended - free tier available)
Redis (for caching - free tier available)
Git

Free Deployment Options

Vercel (Recommended) - Zero cost, one-click deployment
Railway - Free tier with 500 hours/month
Heroku - Free tier available
Docker - Local deployment

Installation

Clone the repository

git clone https://github.com/biointel-ai/biointel.git
cd biointel

Configure environment variables

cp .env.example .env
# Edit .env with your FREE database and Redis URLs

Deploy with Docker (Recommended)
```
docker-compose up -d
```
Or deploy to Vercel
```
npm i -g vercel
vercel --prod
```

Access Your Application

API Documentation: http://localhost:8000/docs
Frontend: http://localhost:8501 (if using Streamlit)
Health Check: http://localhost:8000/health

Manual Installation

Create virtual environment

python3 -m venv venv
source venv/bin/activate

Install dependencies
```
pip install -r requirements.txt
```

Set up environment

cp .env.example .env
# Edit .env with your API keys and configuration

Run database migrations
```
alembic upgrade head
```

Start the application

uvicorn api.main:app --reload --port 8000

🔧 Configuration (FREE Setup)

Required Environment Variables

# Database (FREE tier options: Supabase, PlanetScale, Neon)
DATABASE_URL=postgresql://user:password@localhost/biointel

# Security
SECRET_KEY=your-secret-key-change-in-production
JWT_SECRET_KEY=your-jwt-secret-key-change-in-production

# Free AI Configuration (NO API KEYS NEEDED)
USE_FREE_AI=true
HUGGINGFACE_CACHE_DIR=/tmp/huggingface
TORCH_CACHE_DIR=/tmp/torch

# Free APIs (NO API KEYS NEEDED)
PUBMED_BASE_URL=https://eutils.ncbi.nlm.nih.gov/entrez/eutils
UNIPROT_BASE_URL=https://rest.uniprot.org
ENSEMBL_BASE_URL=https://rest.ensembl.org
STRING_BASE_URL=https://string-db.org/api
KEGG_BASE_URL=https://rest.kegg.jp

# Redis (FREE tier options: Upstash, Redis Labs)
REDIS_URL=redis://localhost:6379

No API keys required! See .env.example for complete configuration options.

📖 API Documentation

Once the application is running, you can access:

Interactive API Documentation: http://localhost:8000/docs
ReDoc Documentation: http://localhost:8000/redoc
OpenAPI Specification: http://localhost:8000/openapi.json

🧪 Usage Examples

Data Analysis Workflow

Upload Dataset

curl -X POST "http://localhost:8000/api/bio/upload" \
  -H "Authorization: Bearer YOUR_TOKEN" \
  -F "file=@your_dataset.csv" \
  -F "metadata={\"name\":\"My Dataset\",\"description\":\"Test dataset\"}"

Perform PCA Analysis

curl -X POST "http://localhost:8000/api/bio/pca" \
  -H "Authorization: Bearer YOUR_TOKEN" \
  -H "Content-Type: application/json" \
  -d '{"dataset_id": 1, "n_components": 2}'

Generate Report

curl -X POST "http://localhost:8000/api/reports/generate" \
  -H "Authorization: Bearer YOUR_TOKEN" \
  -H "Content-Type: application/json" \
  -d '{"report_type": "analysis", "title": "My Analysis Report", "dataset_ids": [1]}'

Literature Analysis

Process Abstract

curl -X POST "http://localhost:8000/api/literature/abstract" \
  -H "Authorization: Bearer YOUR_TOKEN" \
  -H "Content-Type: application/json" \
  -d '{"abstract": "Your research abstract here...", "title": "Paper Title"}'

Chat with Paper

curl -X POST "http://localhost:8000/api/literature/chat/1" \
  -H "Authorization: Bearer YOUR_TOKEN" \
  -H "Content-Type: application/json" \
  -d '{"question": "What biomarkers were identified?"}'

🏗️ Architecture

BioIntel.AI/
├── api/                    # FastAPI routes
│   ├── auth.py            # Authentication endpoints
│   ├── bioinformatics.py  # Data analysis endpoints
│   ├── literature.py      # Literature processing endpoints
│   ├── reports.py         # Report generation endpoints
│   └── main.py            # Main application
├── models/                # SQLAlchemy models
├── services/              # Business logic
├── utils/                 # Utilities and helpers
├── templates/             # Report templates
└── tests/                 # Test suite

🧪 Testing

# Run all tests
./run_tests.sh

# Run specific test types
./run_tests.sh -t unit        # Unit tests
./run_tests.sh -t integration # Integration tests
./run_tests.sh -t api         # API tests

# Run with coverage
./run_tests.sh -c

# Run specific test file
./run_tests.sh -f test_bioinformatics.py

# Verbose output
./run_tests.sh -v

🚀 Deployment

Quick Docker Deployment

# Deploy with Docker Compose
./docker-deploy.sh

# Deploy in production mode
./docker-deploy.sh -e production --rebuild

# Scale services
./docker-deploy.sh --scale-api 3 --scale-frontend 2

Vercel Deployment

# Deploy to Vercel
./deploy.sh -p vercel -e production

# Set environment variables
vercel env add DATABASE_URL
vercel env add SECRET_KEY
vercel env add ANTHROPIC_API_KEY
vercel env add OPENAI_API_KEY

Heroku Deployment

# Deploy to Heroku
./deploy.sh -p heroku -e production

Manual Docker Commands

# Build image
docker build -t biointel-ai .

# Run container
docker run -p 8000:8000 biointel-ai

# Using Docker Compose
docker-compose up -d

# View logs
docker-compose logs -f

Manual Deployment

Set up production environment variables
Install dependencies: pip install -r requirements.txt
Run migrations: alembic upgrade head
Start with production server: gunicorn api.main:app -k uvicorn.workers.UvicornWorker

🤝 Contributing

We welcome contributions! Please see our Contributing Guide for details.

Fork the repository
Create a feature branch
Make your changes
Add tests
Submit a pull request

📄 License

This project is licensed under the MIT License - see the LICENSE file for details.

🆘 Support

📖 Documentation: https://docs.biointel.ai
🐛 Issues: https://github.com/biointel-ai/biointel/issues
💬 Discussions: https://github.com/biointel-ai/biointel/discussions
📧 Email: [email protected]

🙏 Acknowledgments

Built with FastAPI
AI powered by Anthropic Claude and OpenAI
Bioinformatics analysis with scikit-learn and pandas
Visualizations with Plotly and Matplotlib

Made with ❤️ by the BioIntel.AI Team

Name		Name	Last commit message	Last commit date
Latest commit History 31 Commits
api		api
frontend		frontend
models		models
services		services
templates/reports		templates/reports
tests		tests
utils		utils
.dockerignore		.dockerignore
.env.example		.env.example
.env.production		.env.production
.gitattributes		.gitattributes
.gitignore		.gitignore
.secrets.production		.secrets.production
.vercelignore		.vercelignore
CLAUDE.md		CLAUDE.md
DEPLOYMENT_GUIDE.md		DEPLOYMENT_GUIDE.md
Dockerfile		Dockerfile
Dockerfile.frontend		Dockerfile.frontend
FREE_DEPLOYMENT_GUIDE.md		FREE_DEPLOYMENT_GUIDE.md
LICENSE		LICENSE
PRODUCTION_READY.md		PRODUCTION_READY.md
QUICK_DEPLOY.md		QUICK_DEPLOY.md
README.md		README.md
Screenshot 2025-07-09 at 6.24.01 PM.png		Screenshot 2025-07-09 at 6.24.01 PM.png
alembic.ini		alembic.ini
deploy.sh		deploy.sh
docker-compose.yml		docker-compose.yml
error.md		error.md
generate_secrets.py		generate_secrets.py
init.sql		init.sql
nginx.conf		nginx.conf
requirements-full-backup.txt		requirements-full-backup.txt
requirements-full.txt		requirements-full.txt
requirements-minimal.txt		requirements-minimal.txt
requirements.txt		requirements.txt
setup_production_db.py		setup_production_db.py
setup_vercel_env.sh		setup_vercel_env.sh
vercel.json		vercel.json

License

prathikkv/BioIntel

Folders and files

Latest commit

History

Repository files navigation