MarkPDFDown

English | 中文

A powerful tool that leverages multimodal large language models to transcribe PDF files into Markdown format.

Overview

MarkPDFDown is designed to simplify the process of converting PDF documents into clean, editable Markdown text. By utilizing advanced multimodal AI models, it can accurately extract text, preserve formatting, and handle complex document structures including tables, formulas, and diagrams.

Features

PDF to Markdown Conversion: Transform any PDF document into well-formatted Markdown
Image to Markdown Conversion: Transform image into well-formatted Markdown
Multimodal Understanding: Leverages AI to comprehend document structure and content
Format Preservation: Maintains headings, lists, tables, and other formatting elements
Customizable Model: Configure the model to suit your needs

Demo

Installation

Using uv (Recommended)

# Install uv if you haven't already
curl -LsSf https://astral.sh/uv/install.sh | sh

# Clone the repository
git clone https://github.com/MarkPDFdown/markpdfdown.git
cd markpdfdown

# Install dependencies and create virtual environment
uv sync

Using conda

conda create -n markpdfdown python=3.9
conda activate markpdfdown

# Clone the repository
git clone https://github.com/MarkPDFdown/markpdfdown.git
cd markpdfdown

# Install dependencies
pip install -e .

Usage

# Set up your OpenAI API key
export OPENAI_API_KEY="your-api-key"
# Optionally, set up your OpenAI API base
export OPENAI_API_BASE="your-api-base"
# Optionally, set up your OpenAI API model
export OPENAI_DEFAULT_MODEL="your-model"

# pdf to markdown
python main.py < tests/input.pdf > output.md

# image to markdown
python main.py < input_image.png > output.md

Advanced Usage

python main.py page_start page_end < tests/input.pdf > output.md

Docker Usage

docker run -i -e OPENAI_API_KEY=your-api-key -e OPENAI_API_BASE=your-api-base -e OPENAI_DEFAULT_MODEL=your-model jorbenzhu/markpdfdown < input.pdf > output.md

Development Setup

Code Quality Tools

This project uses ruff for linting and formatting, and pre-commit for automated code quality checks.

Install development dependencies

# If using uv
uv sync --group dev

# If using pip
pip install -e ".[dev]"

Set up pre-commit hooks

# Install pre-commit hooks
pre-commit install

# Run pre-commit on all files (optional)
pre-commit run --all-files

Code formatting and linting

# Format code with ruff
ruff format

# Run linting checks
ruff check

# Fix auto-fixable issues
ruff check --fix

Requirements

Python 3.9+
uv (recommended for package management) or conda/pip
Dependencies specified in pyproject.toml
Access to the specified multimodal AI model

Contributing

Contributions are welcome! Please feel free to submit a Pull Request.

Fork the repository
Create your feature branch ( git checkout -b feature/amazing-feature )
Set up the development environment:
```
uv sync --group dev
pre-commit install
```

Make your changes and ensure code quality:

ruff format
ruff check --fix
pre-commit run --all-files

Commit your changes ( git commit -m 'feat: Add some amazing feature' )
Push to the branch ( git push origin feature/amazing-feature )
Open a Pull Request

Please ensure your code follows the project's coding standards by running the linting and formatting tools before submitting.

License

This project is licensed under the Apache License 2.0. See the LICENSE file for details.

Acknowledgments

Thanks to the developers of the multimodal AI models that power this tool
Inspired by the need for better PDF to Markdown conversion tools

Name		Name	Last commit message	Last commit date
Latest commit History 49 Commits
.github/workflows		.github/workflows
core		core
tests		tests
.env.sample		.env.sample
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
.python-version		.python-version
CHANGELOG.md		CHANGELOG.md
Dockerfile		Dockerfile
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
README_zh.md		README_zh.md
main.py		main.py
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

MarkPDFDown

Overview

Features

Demo

Installation

Using uv (Recommended)

Using conda

Usage

Advanced Usage

Docker Usage

Development Setup

Code Quality Tools

Install development dependencies

Set up pre-commit hooks

Code formatting and linting

Requirements

Contributing

License

Acknowledgments

About

Uh oh!

Releases 8

Packages

Uh oh!

Contributors 3

Languages

License

MarkPDFdown/markpdfdown

Folders and files

Latest commit

History

Repository files navigation

MarkPDFDown

Overview

Features

Demo

Installation

Using uv (Recommended)

Using conda

Usage

Advanced Usage

Docker Usage

Development Setup

Code Quality Tools

Install development dependencies

Set up pre-commit hooks

Code formatting and linting

Requirements

Contributing

License

Acknowledgments

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 8

Packages 0

Uh oh!

Contributors 3

Languages

Packages