npm-pdfreader is a Node.js library for reading text and parsing tables from PDF files. It supports tabular data with automatic column detection and rule-based parsing, making it useful for extracting structured data from PDFs.
Features
- Reads text content from PDF files
- Parses tables with automatic column detection
- Supports rule-based data extraction
- Handles various PDF layouts and formats
- Integrates with Node.js applications
- Open-source under the MIT license
Categories
Data AnalyticsLicense
MIT LicenseFollow npm-pdfreader
Other Useful Business Software
Gen AI apps are built with MongoDB Atlas
MongoDB Atlas is the developer-friendly database used to build, scale, and run gen AI and LLM-powered apps—without needing a separate vector database. Atlas offers built-in vector search, global availability across 115+ regions, and flexible document modeling. Start building AI apps faster, all in one place.
Rate This Project
Login To Rate This Project
User Reviews
Be the first to post a review of npm-pdfreader!