This repository provides utilities for working with PDF documents using LlamaExtract.
The main workflow is:
- Split large PDFs into smaller ones by chapters.
- Run the LlamaExtract scripts to extract structured information.
pip install python-dotenv PyPDF2
Create a .env file in the root of the project with the following content:
LLAMA_CLOUD_API_KEY=your_api_key_here