Browse free open source Text Processing software and projects below. Use the toggles on the left to filter open source Text Processing software by OS, license, language, programming language, and project status.

  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • Keep company data safe with Chrome Enterprise Icon
    Keep company data safe with Chrome Enterprise

    Protect your business with AI policies and data loss prevention in the browser

    Make AI work your way with Chrome Enterprise. Block unapproved sites and set custom data controls that align with your company's policies.
    Download Chrome
  • 1
    BibDesk

    BibDesk

    Bibliography manager for Mac OS X

    BibDesk is a graphical bibTeX bibliography manager for Mac OS X.
    Leader badge
    Downloads: 6,688 This Week
    Last Update:
    See Project
  • 2
    XML Copy Editor
    XML Copy Editor is a fast, free, validating XML editor.
    Leader badge
    Downloads: 790 This Week
    Last Update:
    See Project
  • 3
    TeXworks

    TeXworks

    A simple interface for working with TeX documents

    TeXworks is a free and simple working environment for authoring TeX (LaTeX, ConTeXt and XeTeX) documents. Inspired by Dick Koch's award-winning TeXShop program for Mac OS X, it makes entry into the TeX world easier for those using desktop operating systems other than OS X. It provides an integrated, easy-to-use environment for users on other platforms particularly GNU/Linux and Windows and features a clean, simple interface accessible to casual and non-technical users.
    Downloads: 101 This Week
    Last Update:
    See Project
  • 4
    jq

    jq

    Lightweight and flexible command-line JSON processor

    jq is like sed for JSON data - you can use it to slice, filter, map and transform structured data with the same ease that sed, awk, grep and friends let you play with text. jq is written in portable C, and it has zero runtime dependencies. You can download a single binary, scp it to a far away machine of the same type, and expect it to work. jq can mangle the data format that you have into the one that you want with very little effort, and the program to do so is often shorter and simpler than you'd expect. Data in jq is represented as streams of JSON values - every jq expression runs for each value in its input stream, and can produce any number of values to its output stream. jq filters run on a stream of JSON data. The input to jq is parsed as a sequence of whitespace-separated JSON values which are passed through the provided filter one at a time. The output(s) of the filter are written to standard out, again as a sequence of whitespace-separated JSON data.
    Downloads: 55 This Week
    Last Update:
    See Project
  • Level Up Your Cyber Defense with External Threat Management Icon
    Level Up Your Cyber Defense with External Threat Management

    See every risk before it hits. From exposed data to dark web chatter. All in one unified view.

    Move beyond alerts. Gain full visibility, context, and control over your external attack surface to stay ahead of every threat.
    Try for Free
  • 5
    biber
    Biber is a sophisticated bibliography processing backend for the LaTeX biblatex package. It supports a unsurpassed feature set for automated conformance to complex bibliography style requirements such as labelling, sorting and name handling. It has comprehensive Unicode support.
    Leader badge
    Downloads: 222 This Week
    Last Update:
    See Project
  • 6
    A Swiss Army Knife GUI application for PDF documents: combine, split, rotate, reorder (n-up, booklet), watermark, edit bookmarks/fileinfo/pagetransition, compress, encrypt, decrypt, sign, repair, edit attachments and more.
    Leader badge
    Downloads: 87 This Week
    Last Update:
    See Project
  • 7
    biblatex
    Biblatex is a LaTeX package which provides full-featured bibliographic facilities
    Leader badge
    Downloads: 91 This Week
    Last Update:
    See Project
  • 8

    abnTeX

    abnTeX está em novo endereço: http://www.abntex.net.br

    ATENÇÃO: abnTeX está em novo endereço: http://www.abntex.net.br abnTeX is a set of LaTeX macros that follow the rules of ABNT (Brazilian Association of Technical Standards). ### abnTeX é um conjunto de macros LaTeX que segue as regras da ABNT (Associação Brasileira de Normas Técnicas). O projeto foi totalmente reconstruído baseado nas novas regras da ABNT e utilizando uma nova técnica para produção da classe, mais informações em: Portal do projeto: http://www.abntex.net.br Grupo de desenvolvedores: http://groups.google.com/group/abntex2 Esta página no SourceForge contém os ativos do projeto original, hospedados originalmente no portal CodigoLivre.org
    Leader badge
    Downloads: 135 This Week
    Last Update:
    See Project
  • 9
    iText®, a JAVA PDF library

    iText®, a JAVA PDF library

    PDF Library for Developers

    iText is an open-source PDF library available for Java and .NET (C#). iText allows you to effortlessly generate and manipulate standards-compliant PDF documents with a powerful and feature-rich SDK. With iText, you can create archivable and accessible PDFs, split and merge documents, fill and flatten forms, digitally sign documents, and more. iText add-ons enable additional functionality, such as PDF creation from HTML templates, secure redaction, OCR, and much more. The latest versions of iText build on the success of previous versions and feature an improved document engine, high and low-level programming capabilities, and a more efficient modular structure. iText represents the next level for developers looking to leverage PDF in document workflows. The main project page for iText is now on GitHub, and all the latest releases, code samples, open source add-ons and tools, etc. can be found at https://github.com/itext/.
    Leader badge
    Downloads: 322 This Week
    Last Update:
    See Project
  • Cloud-based help desk software with ServoDesk Icon
    Cloud-based help desk software with ServoDesk

    Full access to Enterprise features. No credit card required.

    What if You Could Automate 90% of Your Repetitive Tasks in Under 30 Days? At ServoDesk, we help businesses like yours automate operations with AI, allowing you to cut service times in half and increase productivity by 25% - without hiring more staff.
    Try ServoDesk for free
  • 10
    XSLT syntax highlighting

    XSLT syntax highlighting

    Java based XSLT Processor extension for syntax highlighting

    Please note that project moved to GitHub: https://github.com/xmlark/xslthl This is an implementation of syntax highlighting as an extension module for XSLT processors (Xalan, Saxon), so if you have e.g. article about programming written in DocBook, code examples can be automatically syntax highlighted during the XSLT processing phase.
    Leader badge
    Downloads: 105 This Week
    Last Update:
    See Project
  • 11
    PDF-Shuffler
    PDF-Shuffler is a small python-gtk application, which helps the user to merge or split pdf documents and rotate, crop and rearrange their pages using an interactive and intuitive graphical interface. It is a frontend for python-pyPdf.
    Leader badge
    Downloads: 46 This Week
    Last Update:
    See Project
  • 12

    ConcatPDF

    PDF Concatenation Tool

    ConcatPDF is the tool to concatenate PDF files. It can concatenate, extract, encrypt, decrypt, configure PDF files, convert image files to PDF. GUI version and CUI version are both available. iText.NET is iText porting on .NET Framework by J#. This library allows you to generate PDF, (X)HTML, XML, RTF files on Microsoft.NET Framework including ASP.NET.
    Downloads: 39 This Week
    Last Update:
    See Project
  • 13
    The DITA Open Toolkit is an implementation of the OASIS DITA XML Specification. The Toolkit transforms DITA content into many deliverable formats. See https://www.dita-ot.org/ for documentation and links to downloads. The source code and issue trackers have been moved to https://github.com/dita-ot/dita-ot
    Downloads: 24 This Week
    Last Update:
    See Project
  • 14
    Jaxe
    Jaxe is a free Java XML editor with a configurable GUI, using XML schemas for validation and XSL for exports in HTML or XML.
    Downloads: 11 This Week
    Last Update:
    See Project
  • 15
    The XSD editor is a cross-platform XML editor. Although it can be used to edit any type of XML file, the editor is specifically designed to allow easy creation, editing, and validation of XML Schema (XSD) files.
    Downloads: 28 This Week
    Last Update:
    See Project
  • 16
    RTextDoc

    RTextDoc

    An editor for structured documents

    RTextDoc is an editor for structured text documents such as LaTeX, AsciiDoc, DocBook. RTextDoc has proofreading capabilities: on-the-fly spelling, instant grammar checking and built-in free dictionaries. RTextDoc has syntax highlighting, bracket matching, folding, document structure browser for sections and labels, bookmarks, manager for LaTeX symbols, an editor for mathematical equations,integrated BibTeX database manager and several tools to convert LaTeX to HTML and back. AsciiDoc files can be converted to DocBook, HTML and PDF files.
    Leader badge
    Downloads: 14 This Week
    Last Update:
    See Project
  • 17
    Create beautiful song books for your church or fellowship using this LaTeX package and related tools.
    Downloads: 13 This Week
    Last Update:
    See Project
  • 18
    Unicode Converter

    Unicode Converter

    All in one Text, Decimal, and Unicode Converter as you type.

    Unicode Converter is an open source software for converting Text to Unicode and decimal and vice versa while you type. No clicks or convert button! Latest source code: https://github.com/ahmed-fathy/unicode-converter Conversion Options: Text -> Unicode and Decimal Unicode -> Text and Decimal Decimal -> Text and Unicode *Smart settings: The settings take effect directly after you change it and will be saved for you and applied in next runs. Formatting options: Unicode and decimal perfix and postfix to get your desired output, such as \u0001  0001 etc...
    Leader badge
    Downloads: 35 This Week
    Last Update:
    See Project
  • 19
    CONVERTCP

    CONVERTCP

    Text File Codepage Converter for the Windows command line

    This command line utility is a codepage converter to be used to change the character encoding of text. It fully supports charsets such as ANSI code pages, UTF-8, UTF-16 LE/BE, UTF-32 LE/BE, and EBCDIC. It's designed to convert big text files, too. It runs on Windows XP onwards (tested on XP, Windows 7, Windows 8.1, Windows 10, and Windows 11). The "readme.txt" file and the Wiki gives you some more information. You'll find the compiled tool for 32 bit (x86) and 64 bit (x64) Windows in the "bin" directory. The C source code is available in the "src" directory. Just click on the "Files" tab. Regardless if you have or don't have a SourceForge account - whenever you have questions about CONVERTCP or you want to give feedback then you are welcome to post it in the forum. Click on the "Discussion" tab.
    Leader badge
    Downloads: 9 This Week
    Last Update:
    See Project
  • 20
    Flat file extractor can be used for reading and parsing different flat file structures and printing them in different formats. ffe is a command line tool developed in GNU/Linux environment and it is distributed under GPL. Project moved to https://github.com/igitur/ffe
    Leader badge
    Downloads: 10 This Week
    Last Update:
    See Project
  • 21
    Jericho HTML Parser is a java library allowing analysis and manipulation of parts of an HTML document, including server-side tags, while reproducing verbatim any unrecognised or invalid HTML.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 22
    A C++ library to read and write PDF files, plus a GUI editor.
    Downloads: 10 This Week
    Last Update:
    See Project
  • 23
    EpiDoc: Epigraphic Documents in TEI XML

    EpiDoc: Epigraphic Documents in TEI XML

    XML text markup for ancient documents

    The EpiDoc Collaborative is developing specifications and tools for standards-based, digital publication and interchange of scholarly and educational editions of documentary and literary texts like inscriptions and papyri. The link below will take you to the EpiDoc home page on this site.
    Leader badge
    Downloads: 5 This Week
    Last Update:
    See Project
  • 24
    RTF to HTML converter for use both with your applications and as a standalone tool. Small and fast. Processes tables better than any other tool I've seen.
    Leader badge
    Downloads: 9 This Week
    Last Update:
    See Project
  • 25
    DITA2wiki is a toolkit that enables you to publish DITA content (maps and topics) to a wiki.
    Downloads: 2 This Week
    Last Update:
    See Project