Open Source Python Text Processing Software

Python Text Processing Software

View 91 business solutions

Browse free open source Python Text Processing Software and projects below. Use the toggles on the left to filter open source Python Text Processing Software by OS, license, language, programming language, and project status.

  • Gen AI apps are built with MongoDB Atlas Icon
    Gen AI apps are built with MongoDB Atlas

    Build gen AI apps with an all-in-one modern database: MongoDB Atlas

    MongoDB Atlas provides built-in vector search and a flexible document model so developers can build, scale, and run gen AI apps without stitching together multiple databases. From LLM integration to semantic search, Atlas simplifies your AI architecture—and it’s free to get started.
    Start Free
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • 1
    Notepad++ Python Script

    Notepad++ Python Script

    A Python Scripting plugin for Notepad++

    A Python Scripting plugin for Notepad++. Complete easy script access to all of the editor's features (including absolutely everything in Scintilla). Configurable menus and toolbar options, assign shortcuts to scripts.
    Leader badge
    Downloads: 506 This Week
    Last Update:
    See Project
  • 2
    Diffuse
    Diffuse is a graphical tool for comparing and merging text files. It can retrieve files for comparison from Bazaar, CVS, Darcs, Git, Mercurial, Monotone, RCS, Subversion, and SVK repositories.
    Leader badge
    Downloads: 223 This Week
    Last Update:
    See Project
  • 3
    Utilities for general- and special-purpose documentation. Includes reStructuredText, the easy to read, easy to use, what-you-see-is-what-you-get plaintext markup language.
    Leader badge
    Downloads: 139 This Week
    Last Update:
    See Project
  • 4
    EpiDoc: Epigraphic Documents in TEI XML

    EpiDoc: Epigraphic Documents in TEI XML

    XML text markup for ancient documents

    The EpiDoc Collaborative is developing specifications and tools for standards-based, digital publication and interchange of scholarly and educational editions of documentary and literary texts like inscriptions and papyri. The link below will take you to the EpiDoc home page on this site.
    Leader badge
    Downloads: 6 This Week
    Last Update:
    See Project
  • Build Securely on Azure with Proven Frameworks Icon
    Build Securely on Azure with Proven Frameworks

    Lay a foundation for success with Tested Reference Architectures developed by Fortinet’s experts. Learn more in this white paper.

    Moving to the cloud brings new challenges. How can you manage a larger attack surface while ensuring great network performance? Turn to Fortinet’s Tested Reference Architectures, blueprints for designing and securing cloud environments built by cybersecurity experts. Learn more and explore use cases in this white paper.
    Download Now
  • 5
    PyWord is a powerful and flexible text editor written in Python. It aims to be similar to other, existing editors (including emacs), but has several unique features as well.
    Downloads: 9 This Week
    Last Update:
    See Project
  • 6
    PyRTF is a pure python module for the efficient creation of RTF documents.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 7
    Latex editor with document structure tree view and project handling. Latex output allows direct jump to warnings/errors. Projects folders gives support for figures/graphs. Editor component includes usual features like search/replace and syntax highlight.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 8
    Word segmentation utility for Thai language written in C
    Downloads: 1 This Week
    Last Update:
    See Project
  • 9
    WikiPDF is a mediawiki extension based on Wiki2PDF that adds PDF/LaTeX features to mediawiki. Wiki2PDF is a python script to convert multiple articles of a mediawiki based wiki (pre-configured to use with www.wikipedia.org) to a single LaTeX or PDF file.
    Downloads: 1 This Week
    Last Update:
    See Project
  • Build Securely on AWS with Proven Frameworks Icon
    Build Securely on AWS with Proven Frameworks

    Lay a foundation for success with Tested Reference Architectures developed by Fortinet’s experts. Learn more in this white paper.

    Moving to the cloud brings new challenges. How can you manage a larger attack surface while ensuring great network performance? Turn to Fortinet’s Tested Reference Architectures, blueprints for designing and securing cloud environments built by cybersecurity experts. Learn more and explore use cases in this white paper.
    Download Now
  • 10
    Tools for extracting and transforming XML-like mark-up, embedded in source code comments, into proper external entities or well-formed XML files. Can be used for JavaDoc-like "literate programming", or embedding other build-related or CM metadata.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 11
    A simple Gedit Plugin which enables a markup syntax to produce extended characters. 59 defaults but extra character support can be added easily making this plugin highly configurable.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    A python module that provides algorithms for advanced search - basically all you need to build a search engine.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    AmandaVCD splits large VCF files into smaller files with only one vCard in each file.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    Contains a LaTeX style file and an associated GUI that allow for the annotation of LaTeX documents. Tracks changes made by multiple editors. This package provides a way for multiple authors to collaboratively edit a latex document.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Have you ever encountered dynamic web page (php, asp.net...) problems as like being browsed full of question marks? This (currently) tiny utility is for you. It tries to correct the problem by changing the file encoding to unicode.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17

    Chinese-Korean Bilingual Corpus

    Construction of Chinese-Korean Bilingual Corpus and Search Technology

    Construction of Chinese-Korean Bilingual Corpus and Search Technology. Some auto alignment programs and a search engine based on jung seong are provided.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    The Contraction Converter is meant for people who write papers a lot and get deducted for having contractions. Run your file through this program and can't, etc. will be changed to the proper equivalent (ex. can not).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    DocScript is an approach to document preparation. It presents tools and utilities to edit and publish documents. The philosophy behind the DocScript project is to utilize the programming tools you're working with anyway in your daily work.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20

    GeditChangelog

    A Gedit plugin for exact, character based change logging

    This is a plugin for the famous text editor "Gedit" (Available for Linux, WIndows, ...?) When writing a text, GeditChangelog records every character typed. For instance, capture a video of your text creation process. It adds another dimension of text writing, the time dimension. It is very exciting to experiment with this type of writing. Since this is a plugin for gedit, any type of text can be tracked: Programming, Writing short or long text, etc. For a nice visualization, have a look at this youtube video: http://www.youtube.com/watch?v=iFhd7rpbM_M&hd=1 Consider donating to this project: https://sourceforge.net/projects/geditchangelog/donate
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    JReferences is a tool to store and retrieve bibliographic references from a file or MySQL database. It reads BibTeXML, DocBook XML and RIS type references, and can output these and BibTex. A bibtex like alternative is also provided for DocBook XML docu
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Turn a PC keyboard into a Musical Instrument! Keyano has the ability to turn your PC into a Musical Keyboard, or select Alphabet mode and it becomes fun and educational for the Kids. Type "A B C" and it says them out loud while it shows letters on screen
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    A text editor which converts wiki-like markup to XHTML and supports the FTP.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    Notepad++ Python Script Automation
    Provides a set of Python scripts that allow client apps to automate Notepad++ using the excellent Python Script plugin and the Python SimpleXMLRPC modules. Demo apps in IronPython, C#, and now a "pure"-Python2.7 console app. No .Net required!
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    A graphical, hierarchical note-taking application, using reStructuredText, python and PyQt.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • Next
Want the latest updates on software, tech news, and AI?
Get latest updates about software, tech news, and AI from SourceForge directly in your inbox once a month.