A Streamlit-based application that enriches data from Google Sheets or CSV files using AI and web search capabilities.
- Google Sheets Integration
- CSV File Support
- AI-powered data enrichment using Groq
- Web search integration using Google Serper API
- Rate limiting for API calls
- Export results to CSV
- Python 3.8+
- Google Cloud Project with Sheets API enabled
- Groq API key
- Serper API key
- Clone the repository:
git clone https://github.com/pantShrey/AI-Data-Enrichment-Agent.git
cd ai-data-enrichment
- Create and activate a virtual environment:
python -m venv venv
source venv/bin/activate # On Windows: venv\Scripts\activate
- Install dependencies:
pip install -r requirements.txt
- Copy
.env.example
to.env
and fill in your API keys:
cp .env.example .env
- Set up Google OAuth credentials:
- Go to Google Cloud Console
- Create a new project or select existing one
- Enable Google Sheets and Drive APIs
- Create OAuth 2.0 credentials
- Download client secret file and place it in the project root
- Start the application:
streamlit run app/main.py
- Select your data source (Google Sheets or CSV)
- Authenticate with Google (if using Google Sheets)
- Select the column to process
- Enter your query template
- Process the data and download results
The following environment variables are required:
GROQ_API_KEY
: Your Groq API keySERPER_API_KEY
: Your Serper API keyCLIENT_SECRET_FILE
: Path to your Google OAuth client secret file
- Fork the repository
- Create your feature branch (
git checkout -b feature/AmazingFeature
) - Commit your changes (
git commit -m 'Add some AmazingFeature'
) - Push to the branch (
git push origin feature/AmazingFeature
) - Open a Pull Request
This project is licensed under the MIT License - see the LICENSE file for details.
https://www.loom.com/share/68f008635ae04c7ea52917147d1c5080
Don't know how to blur the code on loom sorry for that