-
regex
regular expressions for Rust. This implementation uses finite automata and guarantees linear time matching on all inputs.
-
unicode-width
Determine displayed width of
charandstrtypes according to Unicode Standard Annex #11 rules -
tokenizers
today's most used tokenizers, with a focus on performances and versatility
-
textwrap
word wrapping, indenting, and dedenting strings. Has optional support for Unicode and emojis as well as machine hyphenation.
-
encoding_rs
A Gecko-oriented implementation of the Encoding Standard
-
fancy-regex
regexes, supporting a relatively rich set of features, including backreferences and look-around
-
similar
A diff library for Rust
-
ropey
A fast and robust text rope for Rust
-
convert_case
Convert strings into any case
-
const_format
Compile-time string formatting
-
tabled
An easy to use library for pretty print tables of Rust
structs andenums -
lopdf
PDF document manipulation
-
heck
case conversion library
-
unicode-normalization
functions for normalization of Unicode strings, including Canonical and Compatible Decomposition and Recomposition, as described in Unicode Standard Annex #15
-
pulldown-cmark
A pull parser for CommonMark
-
deunicode
Convert Unicode strings to pure ASCII by intelligently transliterating them. Suppors Emoji and Chinese.
-
lazy-regex
lazy static regular expressions checked at compile time
-
unicode-segmentation
Grapheme Cluster, Word and Sentence boundaries according to Unicode Standard Annex #29 rules
-
sqlformat
Formats whitespace in a SQL string to make it easier to read
-
mime_guess
detection of a file's MIME type by its extension
-
rustybuzz
A complete harfbuzz shaping algorithm port to Rust
-
widestring
wide string Rust library for converting to and from wide strings, such as those often used in Windows API or other FFI libaries. Both
u16andu32string types are provided, including support for UTF-16 and UTF-32… -
termimad
Markdown Renderer for the Terminal
-
regress
A regular expression engine targeting EcmaScript syntax
-
nucleo-matcher
plug and play high performance fuzzy matcher
-
fuzzy-matcher
Fuzzy Matching Library
-
indenter
A formatter wrapper that indents the text, designed for error display impls
-
unicase
A case-insensitive wrapper around strings
-
const-str
compile-time string operations
-
html2text
Render HTML as plain text
-
strip-ansi-escapes
Strip ANSI escape sequences from byte streams
-
onig
Rust-Onig is a set of Rust bindings for the Oniguruma regular expression library. Oniguruma is a modern regex library with support for multiple character encodings and regex syntaxes.
-
prometheus-client
Open Metrics client library allowing users to natively instrument applications
-
lngcnv
linguistics: display pronunciation, translate between dialects, convert between orthographies; support for multiple languages: English, Latin, Polish, Quechua, Spanish, Tikuna
-
grok
popular Java & Ruby grok library which allows easy text and log file processing with composable patterns
-
diff
An LCS based slice and string diffing implementation
-
printpdf
reading and writing PDF files
-
text-splitter
Split text into semantic chunks, up to a desired chunk size. Supports calculating length by characters and tokens, and is callable from Rust and Python.
-
emojis
✨ Lookup emoji in *O(1)* time, access metadata and GitHub shortcodes, iterate over all emoji, and more!
-
garde
Validation library
-
finl_unicode
handling Unicode functionality for finl (categories and grapheme segmentation)
-
charabia
detect the language, tokenize the text and normalize the tokens
-
unicode-xid
Determine whether characters have the XID_Start or XID_Continue properties according to Unicode Standard Annex #31
-
pulldown-cmark-to-cmark
Convert pulldown-cmark Events back to the string they were parsed from
-
titlecase
Capitalize text according to a style defined by John Gruber for Daring Fireball
-
Inflector
Adds String based inflections for Rust. Snake, kebab, camel, sentence, class, title and table cases as well as ordinalize, deordinalize, demodulize, foreign key, and pluralize/singularize…
-
lindera
A morphological analysis library
-
jieba-rs
The Jieba Chinese Word Segmentation Implemented in Rust
-
protobuf
Protocol Buffers - Google's data interchange format
-
unescaper
Unescape strings with escape sequences written out as literal characters
-
chardetng
A character encoding detector for legacy Web content
-
marmite
easiest static site generator
-
unicode-script
exposes the Unicode
ScriptandScript_Extensionproperties from UAX #24 -
text-size
Newtypes for text offsets
-
synoptic
low-level, syntax highlighting library with unicode support
-
line-index
Maps flat
TextSizeoffsets to/from(line, column)representation -
unicode-general-category
Fast lookup of the Unicode General Category property for char
-
os_display
Display strings in a safe platform-appropriate way
-
lipsum
lorem ipsum text generation library. It generates pseudo-random Latin text. Use this if you need filler or dummy text for your application. The text is generated using a simple Markov chain…
-
rphonetic
Rust port of phonetic Apache commons-codec algorithms
-
cow-utils
Copy-on-write string utilities for Rust
-
typstyle
The CLI for Typstyle
-
unicode-reverse
Unicode-aware in-place string reversal
-
roff
ROFF (man page format) generation library
-
heatseeker
A fast, robust, and portable fuzzy finder
-
pdf-extract
extract content from pdfs
-
prettydiff
Side-by-side diff for two files
-
regex-cursor
regex fork that can search discontiguous haystacks
-
unicode_names2
Map characters to and from their name given in the Unicode standard. This goes to great lengths to be as efficient as possible in both time and space, with the full bidirectional tables weighing barely 500 KB…
-
mdbook-katex
mdBook preprocessor rendering LaTeX equations to HTML
-
regex-syntax
A regular expression parser
-
rxing
port of the zxing barcode library
-
harfrust
A complete HarfBuzz shaping algorithm port to Rust
-
html-to-markdown-rs
High-performance HTML to Markdown converter using the astral-tl parser. Part of the Kreuzberg ecosystem.
-
olpc-cjson
serde_json Formatter to serialize as OLPC-style canonical JSON
-
any_ascii
Unicode to ASCII transliteration
-
stfu8
Sorta Text Format in UTF-8
-
decancer
that removes common unicode confusables/homoglyphs from strings
-
stringzilla
Search, hash, sort, fingerprint, and fuzzy-match strings faster via SWAR, SIMD, and GPGPU
-
unicode-joining-type
Fast lookup of the Unicode Joining Type and Joining Group properties
-
omekasy
Decorate alphanumeric characters in your input with various font; special characters in Unicode
-
spellbook
A spellchecking library compatible with Hunspell dictionaries
-
uncased
Case-preserving, ASCII case-insensitive, no_std string types
-
html2md
binary to convert simple html documents into markdown
-
dwrote
Lightweight binding to DirectWrite
-
igrepper
The interactive grepper
-
mdbook-pdf
A backend for mdBook written in Rust for generating PDF based on headless chrome and Chrome DevTools Protocol
-
crop
A pretty fast text rope
-
textdistance
Lots of algorithms to compare how similar two sequences are
-
stringcase
Converts string cases between camelCase, COBOL-CASE, kebab-case, and so on
-
icu_pattern
ICU pattern utilities
-
epub-builder
generating EPUB files
-
xan
The CSV magician
-
textsurf
Webservice for efficiently serving multiple plain text documents or excerpts thereof (by unicode character offset), without everything into memory
-
minimizer
Minimize files to find minimal test case
-
stop-words
Common stop words in many languages
-
markdown_timesheet
processing markdown files to extract and format timesheet data
-
hypher
separates words into syllables
-
idna
IDNA (Internationalizing Domain Names in Applications) and Punycode
-
deno_doc
doc generation for deno
-
async-memcached
An Tokio-based memcached client for Rust
-
diffy
Tools for finding and manipulating differences between files
-
mdbook-admonish
A preprocessor for mdbook to add Material Design admonishments
-
mdbook-yapp
mdBook preprocessor for simple text replacements
-
rumdl
A fast Markdown linter written in Rust (Ru(st) MarkDown Linter)
-
cruet
Adds String based inflections for Rust. Snake, kebab, camel, sentence, class, title and table cases as well as ordinalize, deordinalize, demodulize, foreign key, and pluralize/singularize…
-
mdbook-epub
An EPUB renderer for mdbook
-
simdnbt
an unnecessarily fast nbt decoder
-
str_indices
Count and convert between indexing schemes on string slices
-
diff-match-patch-rs
The fastest implementation of Myer's diff algorithm to perform the operations required for synchronizing plain text
-
sdml-cli
Rust CLI for Simple Domain Modeling Language (SDML)
-
entities
raw data needed to convert to and from HTML entities
-
kak-lsp
Kakoune Language Server Protocol Client
-
hgrep
grep tool with human-friendly search output. This is similar to
-Coption ofgrepcommand, but its output is enhanced with syntax highlighting focusing on human readable outputs. -
hck
A sharp cut(1) clone
-
font-types
Scalar types used in fonts
-
boreal
evaluate YARA rules, used to scan bytes for textual and binary pattern
-
fontcull
Pure Rust font subsetting library
-
edit
Open a file in the default text editor
-
emojic
Emoji constants
-
htmd
A turndown.js inspired HTML to Markdown converter
-
htop
HTML to PDF converter
-
chewing
(酷音) intelligent Zhuyin input method
-
usage-lib
working with usage specs
-
languagetool-rust
LanguageTool API bindings in Rust
-
sapling-streampager
streampager is a pager for command output or large files
-
askalono
detect the contents of license files
-
sliceslice
A fast implementation of single-pattern substring search using SIMD acceleration
-
lsp-textdocument
A LSP text documents manager that map of text document
-
wana_kana
checking and converting between Japanese characters - Kanji, Hiragana, Katakana - and Romaji
-
focaccia
no_std implementation of Unicode case folding comparisons
-
lindera-tantivy
Lindera Tokenizer for Tantivy
-
uwc
Counts things in unicode text files
-
mdbook-plantuml
A preprocessor for mdbook which will convert plantuml code blocks into inline SVG diagrams
-
tiefdownconverter
A CLI tool to manage and convert Markdown-based projects
-
difflib
Port of Python's difflib library to Rust
-
serpl
terminal UI for search and replace, ala VS Code
-
in_definite
Get the indefinite article ('a' or 'an') to match the given word. For example: an umbrella, a user.
-
mime-infer
detection of a file's MIME type by its extension
-
glob-match
An extremely fast glob matcher
-
repgrep
An interactive command line replacer for
ripgrep -
readable-name-generator
Generate a readable name for throwaway infrastructure
-
bulletty
a pretty TUI feed reader (RSS+ATOM) that stores articles locally as Markdown files
-
unicode_titlecase
add Unicode titlecase and Turkish and Azeri locale upper/lowercase utilities to chars and strings
-
savvy
R extension interface
-
norad
Read and write Unified Font Object files
-
matchers
Regex matching on character and byte streams
-
mlc
The markup link checker (mlc) checks for broken links in markup files
-
unicode-id
Determine whether characters have the ID_Start or ID_Continue properties according to Unicode Standard Annex #31
-
tendril
Compact buffer/string type for zero-copy parsing
-
nuch
A CLI to manage Markdown content and images for Nuxt Content sites
-
hyperlink
Very fast link checker for CI
-
mdbook-quiz
Interactive quizzes for your mdBook
-
icy_sauce
handling SAUCE – Standard Architecture for Universal Comment Extensions
-
arborium-cli
Command-line syntax highlighter powered by arborium
-
lgtmeow
🐾 —— 「本喵觉得很不错~」
-
line-ending
Detect, normalize, and convert line endings across platforms, including support for character streams. Ensures consistent handling of LF, CRLF, and CR line endings in text processing.
-
sanitizer
A collection of methods and macros to sanitize struct fields
-
wit_owo
interacting with the Wit.ai API
-
rschess
chess library with the aim to be as feature-rich as possible
-
treegrep
regex pattern matcher that displays results in a tree structure with an interface to jump to matched text
-
wordcut-engine
Word segmentation/breaking library
-
ferris-says
flavored replacement for the classic cowsay
-
markdown-tool
A CLI utility for converting Markdown into AST and vice versa
-
esed
Easy sed
-
qpdf
Rust bindings to QPDF C++ library
-
autumnus
Syntax highlighter powered by Tree-sitter and Neovim themes
-
unicode_categories
Query Unicode category membership for chars
-
unicode-truncate
Unicode-aware algorithm to pad or truncate
strin terms of displayed width -
arrow-string
String kernels for arrow arrays
-
hyphenation
Knuth-Liang hyphenation for a variety of languages
-
chat-gpt-lib-rs
interacting with OpenAI's ChatGPT API, providing a simple interface to make API requests and handle responses
-
treelog
A highly customizable, optimized, and modular tree rendering library
-
codebook
A code-aware spell checker library (dependency for codebook-lsp)
-
simple-string-patterns
Makes it easier to match, split and extract strings in Rust without regular expressions. The parallel string-patterns crate provides extensions to work with regular expressions via the Regex library
-
decasify
A CLI utility and library to cast strings to title-case according to locale specific style guides including Turkish support
-
presenterm
A terminal slideshow presentation tool
-
yggdrasil-cli
Yggdrasil is a project flattener and diff engine that turns any subset of your codebase into a single AI-ready codex (index + contents), or compares snapshots with annotated diffs
-
rst_parser
a reStructuredText parser
-
buup
Core transformation library with zero dependencies
-
zawk
An efficient Awk-like language implementation by Rust with stdlib
-
inlyne
Introducing Inlyne, a GPU powered yet browserless tool to help you quickly view markdown files in the blink of an eye
-
matcher_rs
A high-performance matcher designed to solve LOGICAL and TEXT VARIATIONS problems in word matching, implemented in Rust
-
unicode-blocks
contains a list of all unicode blocks and provides some functions to search across them
-
mdbook-indexing
mdbook preprocessor for index generation
-
unindent
Remove a column of leading whitespace from a string
-
todo_lib
Collection of utilities for todo.txt format
-
zhconv
Traditional, Simplified and regional Chinese variants converter powered by MediaWiki & OpenCC rulesets and the Aho-Corasick algorithm 中文简繁及地區詞轉換
-
srgn
A grep-like tool which understands source code syntax and allows for manipulation in addition to search
-
dptran
run DeepL translations on command line written by Rust
-
unicode-casing
Titlecase helper function on characters
-
awabi
A morphological analyzer using mecab dictionary
-
mad-useful
A multi-tool utility for file operations and text processing
-
termdiff
Write a diff with color codes to a string
-
oranda
🎁 generate beautiful landing pages for your projects
-
near-facsimile
Find similar or identical text files in a directory
-
bloodtree
A hierarchical note-taking system with focus on relationships between nodes
-
at-commands
AT Commands builder and parser for Rust #![no_std]
-
mdbook
Creates a book from markdown files
-
aki-gsub
substitude text command, replace via regex
-
base-d
Universal base encoder: Encode binary data to 33+ dictionaries including RFC standards, hieroglyphs, emoji, and more
-
stylin
Convert markdown to pandoc markdown with custom styles
-
ndg-commonmark
Flavored CommonMark processor for Nix-related projects, with support for CommonMark, GFM, and Nixpkgs extensions
-
boxen
creating styled terminal boxes around text
-
quixote
Quizzes and tests in Markdown
-
arf-strings
Encoding and decoding for ARF strings
-
alfrusco
building Alfred workflows with Rust
-
uncomment
A CLI tool to remove comments from code using tree-sitter for accurate parsing
-
pdflens-mcp
An MCP server for reading PDFs, coded by human, designed for AI
-
turbovault-parser
Obsidian Flavored Markdown (OFM) parser
-
fax
Decoder and Encoder for CCITT Group 3 and 4 bi-level image encodings used by fax machines TIFF and PDF
-
type-safe-id
A type-safe, K-sortable, globally unique identifier
-
jx
An interactive JSON explorer for the command line
-
ncount
A word count tool intended to derive useful stats from markdown
-
crowbook
Render a Markdown book in HTML, PDF or Epub
-
mime_guess2
detection of a file's MIME type by its extension
-
patchkit
parsing and manipulating patch files
-
vaporetto
pointwise prediction based tokenizer
-
bashtestmd
Compiles shell commands in .md files into Bash scripts for testing
-
mdbook-kroki-preprocessor
render kroki diagrams from files or code blocks in mdbook
-
graphannis
new backend implementation of the ANNIS linguistic search and visualization system
-
ngrammatic
Character-oriented ngram generator and fuzzy matching library
-
dvd-term
A bouncing ASCII art DVD logo (or custom text) for the terminal
-
tossicat
입력된 단어에 맞게 같이 입력된 토시(조사)를 적절하게 변환하는 라이브러리
-
rapidfuzz
rapid fuzzy string matching library
-
mintyml
A minialist alternative syntax to HTML
-
rdfless
A colorful pretty printer for RDF (Turtle/TriG/N-Triples/N-Quads) with ANSI colors
-
apisnip
A terminal user interface (TUI) tool for trimming OpenAPI specifications down to size ✂️
-
zalgo-codec
Convert an ASCII text string into a single unicode grapheme cluster and back. Provides a macro for embedding Rust source code that has been encoded in this way.
-
harper-core
The language checker for developers
-
ascii
ASCII-only equivalents to
char,strandString -
ctj
A command-line tool to convert CSV to JSON written in Rust
-
llmvm-core
The core application for llmvm
-
mq-lang
Core language implementation for mq query language
-
madato
command line tool for reading and writing tabular data (XLS, ODS, CSV, YAML), and Markdown
-
ul/kak-lsp
Kakoune Language Server Protocol Client
-
datafusion-functions
Function packages for the DataFusion query engine
-
iepub
epub、mobi电子书读写
-
stringdex
A suffixtree search system for static sites
-
quickmark-cli
Lightning-fast Markdown/CommonMark linter CLI tool with tree-sitter based parsing
-
unicode-case-mapping
Fast lowercase, uppercase, and titlecase mapping for characters
-
analiticcl
approximate string matching or fuzzy-matching system that can be used to find variants for spelling correction or text normalisation
-
inflection-rs
Inflection is a string transformation library. It singularizes and pluralizes English words, and transforms strings from CamelCase to underscored string.
-
koji
An interactive CLI for creating conventional commits
-
indefinite
Prefix a noun with an indefinite article - a or an - based on whether it begins with a vowel
-
rake
Rapid Automatic Keyword Extraction (RAKE) algorithm
-
commitbot
A CLI assistant that generates commit and PR messages from your diffs using LLMs
-
typst-kit
Common utilities for Typst tooling
-
rds2rust
A pure Rust library for reading and writing R's RDS (R Data Serialization) files without requiring an R runtime
-
unicode-security
Detect possible security problems with Unicode usage according to Unicode Technical Standard #39 rules
-
lindera-ko-dic-builder
A Korean morphological dictionary builder for ko-dic
-
tokstream-cli
CLI token stream simulator using Hugging Face tokenizers
-
giallo
A code highlighter giving the same output as VSCode
-
yake-rust
Yake (Yet Another Keyword Extractor) in Rust
-
seeyou-cub
reading and writing the SeeYou CUB binary file format, which stores airspace data for flight navigation software
-
rsxxd
hex-dump-type utility rsxxd
-
whitespace-sifter
Sift duplicate whitespaces away!
-
ristretto_classfile
reading, writing and verifying Java classfiles
-
wildcard
matching
-
litho-book
Litho Book is a modern web documentation reader specifically designed for the Litho (deepwiki-rs) documentation generation engine. It provides…
-
reword
some utility functions for human-readable formatting of words
-
levenshtein_automata
Creates Levenshtein Automata in an efficient manner
-
simsearch
lightweight fuzzy search engine that works in memory, searching for similar strings (a pun here)
-
textra
type less, write more
-
etradeTaxReturnHelper
Parses etrade and revolut financial documents for transaction details (income, tax paid, cost basis) and compute total income and total tax paid according to chosen tax residency (currency)
-
ascii-canvas
canvas for drawing lines and styled text and emitting to the terminal
-
cirru_parser
Parser for Cirru text syntax
-
awful_dataset_builder
Build LLM-ready Q/A datasets from reference text-to-question mappings produced by Awful Knowledge Synthesizer
-
tantivy-common
common traits and utility functions used by multiple tantivy subcrates
-
text_io
really simple to use panicking input functions
-
makepad-widgets
Makepad widgets
-
trueno-rag
Pure-Rust Retrieval-Augmented Generation pipeline built on Trueno
-
html-to-markdown-cli
Command-line interface for html-to-markdown - high-performance HTML to Markdown converter
-
spec-oxide
Spec-driven development for humans and AI
-
str-utils
some traits to extend
[u8],strandCow<str> -
regex-literal
delimited regular expression literals
-
roman-numerals-rs
Manipulate well-formed Roman numerals
-
obsidian-logging
A journaling/logging CLI that stores logs in Obsidian markdown files
-
rhai-autodocs
Custom documentation generator for the Rhai scripting language
-
mdcat
cat for markdown: Show markdown documents in terminals
-
ttypr
terminal typing practice
-
skyspell
Fast and handy spell checker for the command line
-
tabprinter
creating and printing formatted tables in the terminal. It supports various table styles and offers both color and non-color output options.
-
fuzzt
Implementations of string similarity metrics. Includes Hamming, Levenshtein, OSA, Damerau-Levenshtein, Jaro, Jaro-Winkler, and Sørensen-Dice.
-
mdbook-mermaid
mdbook preprocessor to add mermaid support
-
phs
Runtime for Phlow Script, PHS
-
terraphim-cli
CLI tool for semantic knowledge graph search with JSON output for automation
-
txt_to_md
Command converting from a txt file to a markdown file
-
markdown2pdf
Create PDF with Markdown files (a md to pdf transpiler)
-
pomsky
A new regular expression language
-
eloran
Comics and Ebook web library written in rust, with reading, search, reading status, bookmarks
-
cedarwood
efficiently-updatable double-array trie in Rust (ported from cedar)
-
deno_media_type
Media type used in Deno
-
asimov-cli
ASIMOV Command-Line Interface (CLI)
-
slugomatic
🐌 A simple CLI tool to slugify and unslugify text, perfect for branch names and URLs
-
sentencex
Sentence segmentation library with wide language support optimized for speed and utility
-
swappy
An anagram generator
-
wayland-clipboard-listener
impl wlr-data-control-unstable-v1, listen for clipboard
-
okh-tool
A CLI tool to deal with Open Know-How (OKH) data files. Its main functionalities are: validation of and conversion between the different formats
-
word-tally
Output a tally of the number of times unique words appear in source input
-
svgdx-pandoc
pandoc filter for svgdx codeblocks in Markdown
-
zettel-cli
cli app for Luhmann-style Zettelkasten management
-
uroman
A blazingly fast, self-contained Rust reimplementation of the uroman universal romanizer
-
raylib_interactive
An interactive library for Raylib
-
unreal-doc
generating documentation from Unreal C++ sources
-
mdbook-pandoc
A pandoc-powered mdbook backend
-
dicexp
A Dice Expression Interpreter program and library for parsing (and rolling) role-playing game style dice notations (e.g. "2d8+5")
-
semtools
Semantic search and document parsing tools for the command line
-
measured
A better way to measure your application statistics
-
pad
padding strings at runtime
-
bareun_rs
an unofficial Rust library for Bareun, a Korean morphological analyzer
-
galm
pattern matching library
-
inflections
High performance inflection transformation library for changing properties of words like the case
-
basalt-tui
Basalt TUI application for Obsidian notes
-
braillify
Rust 기반 크로스플랫폼 한국어 점역 라이브러리
-
md-tui
A terminal markdown viewer
-
mdbook-svgbob
SvgBob mdbook preprocessor which swaps code-blocks with neat SVG
-
opentalk-types-common-identifiers
Common identifier types for OpenTalk crates
-
rustdoc-stripper
manipulate rustdoc comments
-
topiary-queries
tree-sitter query files compatible with Topiary
-
cmx
Rust Spectral Color Management Library
-
regexr
A high-performance regex engine built from scratch with JIT compilation and SIMD acceleration
-
dug
An exhaustive name resolution aggregator
-
xrusty
Parse documents and transform using χrust
-
sile
Simon’s Improved Layout Engine
-
claudius
SDK for the Anthropic API
-
cicero-sophia
High-performance NLU (natural language understanding) engine built in Rust for speed, accuracy, and privacy
-
gen-mdbook-summary
generate SUMMARY.md for mdbook project
-
mdbook-shiftinclude
mdbook preprocessor for file inclusion with shift
-
yangon
A high-performance, stack-allocated string type for Rust with fixed capacity and zero heap allocations
-
sbnf
A BNF-style language for writing sublime-syntax files
-
codeprism-core
Core incremental polyglot parser and graph builder for code analysis
-
shaperglot
Test font files for OpenType language support
-
slice-command
slice is a command-line tool that allows you to slice the contents of a file using syntax similar to Python's slice notation
-
howmany
A blazingly fast, intelligent code analysis tool with parallel processing, caching, and beautiful visualizations
-
asimov-imap-module
ASIMOV module for IMAP email import
-
vi
An input method library for vietnamese IME
-
obsidian-export
associated CLI program to export an Obsidian vault to regular Markdown
-
cmark-writer
A CommonMark writer implementation in Rust for serializing AST nodes to CommonMark format
-
kas-text
Text layout and font management
-
dom-content-extraction
Content extraction via text density paper
-
zspell
Native Rust library for spellchecking
-
derivre
A derivative-based regular expression engine
-
markdown-org-extract
CLI utility for extracting tasks from markdown files with Emacs Org-mode support
-
anda_db_hnsw
A high-performance vector search library in Rust
-
cskk
C ABIから使う事を目的とした SKK(Simple Kana Kanji henkan)方式のかな漢字変換ライブラリ
-
regect
A cli tool to quickly test regular expressions
-
deeprl
DeepL client library with all the things (blocking)
-
unbom
Remove UTF-8 BOM from files
-
sd
An intuitive find & replace CLI
-
mdbook-typst
An mdBook backend to output Typst markup, pdf, png, or svg
-
instant-segment
Fast English word segmentation
-
text2num
Parse and convert numbers written in English, Dutch, Spanish, Portuguese, German, Italian or French into their digit representation
-
unidown
Convert Markdown to Unicode
-
fontconfig
Safe, higher-level wrapper around the Fontconfig library
-
latexmk-diff-head
LaTeX compilation tool that generates diff PDFs against Git commits
-
codex
Human-friendly notation for Unicode symbols
-
mdbook-preprocessor
assist implementing an mdBook preprocessor
-
mdbook-codeblocks
A mdbook preprocessor to prepend customizable vignette to code blocks
-
panfix
parsing: linear time parsing of multifix operators
-
pure-tui
A modern terminal-based word processor for Markdown and other structured text documents
-
jetscii
A tiny library to efficiently search strings and byte slices for sets of ASCII characters or bytes
-
collclean
Clean up collaboration commands in LaTeX files
-
newdoc
Generate pre-populated module files formatted with AsciiDoc that are used in Red Hat and Fedora documentation
-
flickzeug
A fork of diffy: diff, patch, and merge library featuring Myers' algorithm, unified diff format parsing, fuzzy patch application, and three-way merge with conflict detection
-
comically
fast manga & comic optimizer for e-readers
-
trpl
A support crate for The Rust Programming Language book
-
ripgrep_all
rga: ripgrep, but also search in PDFs, E-Books, Office documents, zip, tar.gz, etc
-
string_wizard
manipulate string like a wizard
-
mdbook-catppuccin
🎊 Soothing pastel theme for mdBook
-
fuzzy-aho-corasick
Aho–Corasick automaton with fuzzy matching
-
ib-matcher
A multilingual, flexible and fast string, glob and regex matcher. Support 拼音匹配 (Chinese pinyin match) and ローマ字検索 (Japanese romaji match).
-
gulagcleaner_rs
Ad removal tool for PDFs
-
esri_ascii_grid
reading ESRI Ascii Grid .asc files
-
mandown
Markdown to groff (man page) converter
-
asciisavers
A small collection of ascii screensavers
-
uast
Unicode Aware Saṃskṛta Transliteration in Rust 🦀
-
koicore
core KoiLang module
-
unicode-properties
Query character Unicode properties according to UAX #44 and UTR #51
-
lindera-python
Python binding for Lindera
-
asposepdf
Aspose.PDF for Rust via C++ is a powerful toolkit that allows developers to manipulate PDF files directly and helps do various tasks for PDF. Contains unique features for converting PDF to other formats.
-
pulumi_gestalt_core
Core Pulumi Gestalt implementation
-
mdbook-theme
A preprocessor and a backend to config theme for mdbook, especially creating a pagetoc on the right and setting full color themes from the offical ace editor
-
ADA_Standards
help you handle checks on your ADA projects, especially good to build scripts to check coding standards conformity
-
minimo
terminal ui library combining alot of things from here and there and making it slightly easier to play with
-
whatwg_streams
whatwg_streams for rust
-
precis-tools
Tools and parsers to generate PRECIS tables from the Unicode Character Database (UCD)
-
svgbob
Transform your ascii diagrams into happy little SVG
-
document_tree
reStructuredText’s DocumentTree representation
-
ultra-nlp
A NLP library
-
mdbook-combiner
combine mdbook summaries from multiple source into one mdbook
-
quagga
CLI tool that combines multiple text files into a single prompt suitable for Large Language Models
-
arborium-c
C grammar for arborium (tree-sitter bindings)
-
minspan
a package for determining the minimum span of one vector within another
-
mdpdf
A fast CLI tool to convert Markdown files to PDF
-
mdless
A terminal-based markdown file viewer
-
codefmt
a markdown code block formatter
-
aki-xtee
copy standard input to each files and standard output
-
mdbook-linkcheck2
A backend for
mdbookwhich will check your links for you -
jsongrep
A JSONPath-inspired query language for JSON documents
-
unicode-bidi-mirroring
Unicode Bidi Mirroring property detection
-
furigana
Map furigana to a word given its reading
-
reconcile-text
Intelligent 3-way text merging with automated conflict resolution
-
llm-utl
Convert code repositories into LLM-friendly prompts with smart chunking and filtering
-
quickmd
Quickly preview a markdown file
-
mono
Mono repository automation toolkit
-
mdbook-typst-math
An mdbook preprocessor to use typst to render math
-
datafusion-spark
DataFusion expressions that emulate Apache Spark's behavior
-
zhconv-cli
Convert Traditional/Simplified Chinese and regional words of Taiwan/Hong Kong/mainland China/Singapore based on Wikipedia and OpenCC rulesets 轉換中文簡體、繁體及兩岸、新馬地區詞,基於維基百科和…
-
hanja_hangul
that converts Chinese characters to Korean characters. That is, convert hanja to hangul
-
string_pipeline
A flexible, template-driven string transformation pipeline for Rust
-
roe
Unicode case conversion
-
shibuichi
zsh prompt preprocessor to add git integration
-
annatto
Converts linguistic data formats based on the graphANNIS data model as intermediate representation and can apply consistency tests
-
latkerlo-jvotci
Tools for creating and decomposing Lojban lujvo
-
linkup
Automatically add links to Markdown files
-
rust_metrics
Incremental evaluation metrics for various machine learning pipelines
-
ripsecrets
A command-line tool to prevent committing secret keys into your source code
-
nu_plugin_regex
nu plugin to search text with regex
-
semantic-edit-mcp
MCP server for semantic code editing with tree-sitter
-
moonwave
generating documentation from comments in Lua source code
-
bibleref
Structures and functions for managing Bible references
-
go22dos
go to todos
-
mdbook-angular
mdbook renderer to run angular code samples
-
runiq
An efficient way to filter duplicate lines from input, à la uniq
-
marqant
Quantum-compressed markdown format for AI consumption with 90% token reduction
-
vlazba
Lojban words generator and analyzer
-
bangumi-api
An api implementation for Bangumi website
-
chinese_dictionary
A searchable Chinese / English dictionary with helpful utilities
-
noil
file explorer using text buffers
-
tu
CLI tool to convert a natural language date/time string to UTC
-
libdav
CalDAV and CardDAV client implementations
-
ttlint
Small, fast utility to lint text
-
quickctx
A bidirectional file content aggregator and extractor that converts between files and markdown-formatted representations
-
bear-query
A read-only Rust library for querying the Bear note-taking app's SQLite database with minimal interference
-
dirgrab
CLI tool to concatenate file contents from directories, respecting Git context
-
blocklet
A cross-platform CLI tool that generates ASCII art using Unicode block characters, similar to figlet but with beautiful solid Unicode blocks instead of outlines or hash symbols
-
unicode-charname
functions for retrieving Unicode character name properties as described in Unicode Standard Annex #44
-
objectstore-client
Client SDK for Objectstore, the Sentry object storage platform
-
bmfont_rs
Load/ save/ manipulate BMFont files
-
kmdbook
Creates a book from markdown files
-
rust-port-kill
A cross-platform command-line tool to manage listening ports and processes
-
smart-patcher
Patcher based on rules
-
erg_common
A common components library of Erg
-
kiru
Fast text chunking for Rust
-
ragit
git-like rag pipeline
-
syllabize-es
Syllabize Spanish text, and much more
-
lo_
A modern Rust utility library delivering modularity, performance & extras ported from JavaScript Lodash
-
csvpretty
A command-line tool that formats CSV input into tables with Unicode box-drawing characters
-
jpreprocess
Japanese text preprocessor for Text-to-Speech application (OpenJTalk rewrite in rust language)
-
gazenot
Gaze Not Into The Abyss, Lest You Become A Release Engineer
-
no-crlf
A CLI tool to convert CRLF line endings to LF in text files
-
omry-search
Search-related abstractions for the Omry project
-
fonts
High-performance font parsing and analysis library for Grida Canvas
-
Inscribe
A markdown preprocessor that executes code fences and embeds their output
-
four-char-code
A string of 4 ascii chars represented by an u32
-
zp
Copy the contents of the source file or the standard output buffer to the clipboard, with support for maintaining a history of copied content, allowing users to easily paste into another file or program
-
unidoc
Unite all Markdown
-
probly-search
A lightweight full-text search engine with a fully customizable scoring function
-
swc_ecma_transformer
Compatibility layer for the ECMAScript standard
-
archive-pdf-urls
Extract all links from a PDF and archive the URLs in the Internet Archive's Wayback Machine
-
dmos
Djot HTML renderer with advanced features
-
ocr-rs
A lightweight and efficient OCR library based on PaddleOCR models, using the MNN inference framework for high-performance text detection and recognition
-
escaping
configurable string escaping and unescaping
-
mdbook-toc
mdbook preprocessor to add Table of Contents
-
rustdoc-md
Convert Rust documentation JSON into clean, organized Markdown files
-
simstring_rust
A native Rust implementation of the SimString algorithm
-
mdbook-pagetoc
A mdbook plugin that provides a table of contents for each page
-
mdbook-wordcount
Word count for mdbook, inspired by the mdbook tutorial
-
linkify
Finds URLs and email addresses in plain text. Takes care to get the boundaries right with surrounding punctuation like parentheses.
-
mini-calc
A Fully-Featured Configurable (mini) Rust Calculator
-
gibberish-or-not
Figure out if text is gibberish or not
-
twas
A text substitution application for using random look-up tables to generate text in a manner similar to the Mad Libs game
-
markex
Fast, non-validating markup element extractor (Tag Element, MdRef, MdCodeBlock, MDSection)
-
fdn
uniformly change file or directory names and support rollback these operations
-
strs_tools
Tools to manipulate strings
-
cai
The fastest CLI tool for prompting LLMs
-
mdbook-bib
mdbook plugin allowing to load and present a bibliography in BibLaTex format in your books and cite its references
-
async-utf8-decoder
Convert AsyncRead to incremental UTF8 string stream
-
hawkeye-fmt
The formatter library for hawkeye cli
-
pdf_tables
Scrape text from tables in PDF files
-
bbd
Binary Braille Dump
-
search-text
A fast and flexible command-line tool to recursively search for text or regex patterns in files under a directory
-
my-notes
note taking app for taking hierarchical notes in markdown
-
md-formatter
A fast, opinionated Markdown formatter
-
leptos-sync-components
Leptos components for synchronization UI
-
pretty-console
A fluent, zero-cost API for styling terminal text with colors and attributes
-
ferrous-opencc
A pure Rust implementation of Open Chinese Convert (OpenCC), for fast and reliable conversion between Traditional and Simplified Chinese
-
mdbook-cmdrun
mdbook preprocessor to run arbitrary commands
-
wdl-doc
Documentation generator for Workflow Description Language (WDL) documents
-
substring
method for string types
-
worf-launcher
Wayland application launcher inspired by wofi, rofi, and walker. Written in Rust with GTK4, supporting multiple modes (math, drun, file, ssh, run, emoji, search, auto), modern theming, and high performance.
-
yaml-include
A lib and a CLI for recursively parsing "!include" data in yaml files
-
xi-unicode
Unicode utilities useful for text editing, including a line breaking iterator
-
chatpack
Prepare chat data for RAG / LLM ingestion. Supports Telegram, WhatsApp, Instagram, Discord.
-
atog
ascii to greek - prints greek letters given latin alphabets as input
-
dspy-rs
A DSPy rewrite(not port) to Rust
-
rustkorean
processing Korean characters. It provides functionalities to check if a character is Korean, classify Korean characters, verify if a character is a leading consonant (choseong), a medial vowel (jungseong)…
-
claude-kb-cli
CLI for generating, validating, and searching Markdown knowledge bases under a .claude/ hierarchy
-
airs-memspec
Streamline AI-assisted development with Multi-Project Memory Bank management and GitHub Copilot integration
-
memory-indexer
An in-memory full-text fuzzy search indexer
-
wchar
Procedural macros for compile time UTF-16 and UTF-32 wide strings
-
mdbook-alerts
mdBook preprocessor to add GitHub Flavored Markdown's Alerts to your book
-
richrs
port of the Rich Python library for beautiful terminal output
-
tsuki
Lua 5.4 ported to Rust
-
latex-thebib
Clean and sort legacy TeX bibliographies written using ‘thebibliography’ via the
refactorsub-command. Compile BibTeX files to legacythebibliographyTeX code using thecompilesub-command… -
swc-plugin-inferno
SWC plugin for InfernoJS
-
rust_string_utils
String utilities for rust based on org.apache.commons.lang3
-
forbidden-bands
8-bit string handling library
-
xim-ctext
compound text en/decoder
-
engish
A language utility for sampling and building words
-
capitalize
Change first character to upper case and the rest to lower case, and other common alternatives
-
ggemtext
Glib-oriented Gemtext API
-
picodiff
Tiny GUI app to compare text easily
-
rwkv-tokenizer
A fast RWKV Tokenizer
-
srt2txt
Convert SRT subtitle files into clean plain text (strip timestamps, tags, merge lines)
-
mdbook-pikchr
A mdbook preprocessor to render pikchr code blocks as images in your book
-
mdbook-callouts
mdBook preprocessor to add Obsidian Flavored Markdown's Callouts to your book
-
threeway_merge
Git-style 3-way string merging using proven algorithms from libgit2/xdiff. Statically links xdiff (LGPL-2.1+).
-
none-shall-pass
Artifact for GitHub Action to validate hyperlinks in all markdown files
-
blockwatch
Language agnostic linter that keeps your code and documentation in sync and valid
-
vibequest
A vibe-coded scripting language focused on developer happiness with a REPL implementation in Rust
-
two_percent
Fuzzy Finder in rust!
-
textpod
Local, web-based notetaking app inspired by 'One Big Text File' idea
-
oxford_join
Join string slices with Oxford Commas!
-
utf8_iter
Iterator by char over potentially-invalid UTF-8 in &[u8]
-
wikipedia_prosesize
Count Wikipedia prose size
-
rfgrep
Advanced recursive file grep utility with comprehensive file type classification - search, list, and analyze 153+ file formats with intelligent filtering and safety policies
-
plsfix
Text cleaner upper
-
hebrew_unicode_script
A low-level library designed to ascertain whether a character belongs to the Hebrew Unicode script. It supports checks for individual characters as well as for membership within collections
-
ipset_lookup
ipset is a command-line tool that takes networks or IPs and searches through a lot of different threat feeds quickly. It can also download the feed data necessary to perform the queries…
-
artificial
Typed, provider-agnostic prompt-engineering SDK for Rust
-
fast_symspell
Spelling correction & Fuzzy search
-
fm
Non-backtracking fuzzy text matcher
-
kataru
YAML-based dialogue engine for game development
-
glyphana
Quickly find, inspect & collect unicode glyps
-
mdbook-replace
mdBook preprocessor that simply replaces text
-
indent
Functions for indenting multiline strings
-
ruma-events
Serializable types for the events in the Matrix specification
-
rlvgl-ui
High-level UI components and theming for rlvgl
-
autotex
Continuously compile TeX and LaTeX
-
gllm
Pure Rust library for local text embeddings and reranking with 26 supported models
-
casile
The command line interface to the CaSILE toolkit, a book publishing workflow employing SILE and other wizardry
-
booky
analyze English text
-
easymark
Lightweight Markdown rendering utility that just works
-
yosina
Japanese text transliteration library
-
clima
A minimal Markdown reader in the terminal
-
mdbook-variables
mdBook proprocessor for risolve variables configured from book.toml
-
sff
SemanticFileFinder (sff): Fast semantic file finder using sentence embeddings. Searches .txt, .md, .mdx files.
-
md-scatter
split up and reassemble markdown files
-
mds
A skim-based
*.mdexplore and surf note-taking tool -
skimtoken
Fast token count estimation library
-
csep
Cosine Similarity Embeddings Print
-
vmks-exam-generator
CLI program for pseudo-randomly generating different variants of an embedded programming exam
-
shifter
A CLI tool to rotate lines from stdin by n positions
-
blogr-cli
A CLI static site generator for blogs
-
kfst-rs
Fast and portable HFST-compatible finite-state transducers
-
twars-url2md
A powerful CLI tool that fetches web pages and converts them to clean Markdown format using Monolith for content extraction and htmd for conversion
-
autosurgeon
working with data in automerge documents
-
hexout
A compact and dependency-free, flexible and customizable hex dump library for Rust that provides beautiful, configurable binary data visualization
-
spanned
string processing with file/line/col information and the regular rust
strAPI -
rsxiv
Tools for working with arXiv and the arXiv API
-
claudiofsr_lib
General-purpose library used by my programs
-
dash-em
Enterprise-Grade Em-Dash Removal Library — SIMD-Accelerated String Processing
-
model2vec-rs
Official Rust Implementation of Model2Vec
-
tarzi
Rust-native lite search for AI applications
-
mdv
Terminal Markdown Viewer
-
flatpage
file system based markdown flat page
-
quant1x
Cross-language standard library for quantitative trading
-
fkys-rs
F*cking Kill Yourself lang interpreter written in Rust
-
wrap-ansi
A high-performance, Unicode-aware Rust library for intelligently wrapping text while preserving ANSI escape sequences, colors, styles, and hyperlinks
-
mdka
HTML to Markdown converter
-
mq-markdown
Markdown parsing and manipulation utilities for mq
-
css_lexer
A spec-compliant CSS tokenizer with zero-copy cursors and optional feature gates
-
tphrase
A translatable phrase generator
-
zipcodes
Query US zipcodes without SQLite
-
liblevenshtein
Levenshtein/Universal Automata for approximate string matching using various dictionary backends
-
mdbook-aquascope
Interactive Aquascope editor for your mdBook
-
xml-3dm-cli
3DM XML Tree Differencing and Merging Tool CLI
-
awful_news_vibes
Daily news meta-analysis pipeline with AI-powered clustering and D3 visualizations
-
moobi
A random password generation CLI tool
-
repr
The regular-expression-as-linear-logic interpretation and its implementation
-
gxter
A parsing library for creating and reading GTA 3/VC/SA GXT (text string) files
-
streplace
A tiny library for matching and replacing in strings and slices with user-defined functions
-
quillmark
engine API
-
mdbook-git
Insert git commit files and diffs into mdbook
-
affinidi-messaging-text-client
Affinidi Messaging SDK
-
xhtml_parser
Non-validating XHTML Tree-based parser
-
stam-tools
Command-line tools for working with stand-off annotations on text (STAM)
-
difference-rs
text diffing and assertion library
-
mdbook-footnote
mdbook preprocessor for footnotes
-
ntcip
National Transportation Communications for ITS Protocol
-
ansic
does ansi parsing in a dynamic DSL and at compile time for efficient and zero cost ansi styling
-
easy_reader
easily navigating forward, backward or randomly through the lines of huge files
-
text-to-ascii-art
program to convert text to ASCII art
-
indent_write
Write adapters to add line indentation
-
rob_test_sagebox_integration_001
Internal test crate for validating Sagebox packaging and README rendering. Not intended for public use.
-
repoyank
Interactively traverse your repository, select files/directories, and quickly prepare structured snippets for LLM interactions
-
url_encor
A lightweight library to encode and decode special characters in urls
-
mantra-miner
your software recite mantras while it runs
-
runmunch
hunspell's unmunch tool for expanding dictionary words using affix files
-
iregex
Intermediate representation for Regular Expressions
-
mdbook-numbering
A mdBook preprocessor that adds numbers to headings and code block lines (for mdbook 0.5.0 and above)
-
ucd
Extends the char type to provide access to most fields of the UCD, Unicode Character Database, as of version 9.0.0. It aims to be compact, fast, and use minimal dependencies (only rust's core crate)…
-
nu_plugin_emoji
a nushell plugin called emoji
-
lethe-core-rust
High-performance hybrid retrieval engine combining BM25 lexical search with vector similarity using z-score fusion. Features hero configuration for optimal parity with splade baseline…
-
tre-regex
Rust safe bindings to the TRE regex module
-
dodo-zh
operate on pinyin and cedict.u8 file
-
mdbook-embedify
based mdbook preprocessor plugin that allows you to embed apps to your book, like youtube, codepen, giscus and many other apps
-
nu-command
Nushell's built-in commands
-
bogrep
Full-text search for bookmarks from multiple browsers
-
COXave
Instruments for codings
-
luciferous-case-converter
A CLI tool to convert text between different cases
-
izihawa-tantivy-bitpacker
Tantivy-sub crate: bitpacking
-
cosmic-text-tessera-fork
Pure Rust multi-line text handling
-
steelkilt
Draft 0.4 RPG rule set
-
emoji-remover
A fast command-line tool to remove emojis from source code files
-
tesseract-rs
Rust bindings for Tesseract OCR with optional built-in compilation
-
cleanpy
Python 3.12+ 最佳实践知识库 - MCP 服务器
-
folderwalk
Folder walking tool
-
smt-str
working with SMT-LIB strings in Rust
-
mdbook_fork4ls
Fork of mdBook for mdBook_LS
-
asciidork-eval
Asciidork eval
-
unicode-ccc
Unicode Canonical Combining Class detection
-
good-pc
Just a silly little tool to pet your pc and display some ASCII art
-
madoru
markdown task runner
-
twincat
ADS tools
-
markon
A lightweight Markdown renderer with GitHub styling and Medium-like annotations
-
rsonpath-lib
Blazing fast JSONPath query engine powered by SIMD. Core library of
rsonpath. -
kreuzberg-cli
Command-line interface for Kreuzberg document intelligence
-
md-ulb-pwrap
Markdown paragraph wrapper using Unicode Line Breaking Algorithm
-
r2md
Entire codebase to single markdown or pdf file
-
string-auto-indent
Normalizes multi-line string indentation while preserving platform-specific line endings
-
mdbook-numthm
An mdbook preprocessor for automatically numbering theorems, lemmas, etc
-
fasttext
binding
-
creature_feature
Composable n-gram combinators that are ergonomic and bare-metal fast
-
float-pretty-print
Format f64 for showing to user, not for serialisation
-
armnod
random string generator
-
nobom
remove UTF-8 BOM (Byte Order Mark) from stdin and write to stdout
-
mdbook-curly-quotes
mdBook preprocessor that replaces straight quotes with curlyquotes, except within code blocks or code spans
-
newline_normalizer
Zero-copy newline normalization to \n or \r\n with SIMD acceleration
-
sk-skimmer
Fuzzy Finder in rust!
-
lumin
searching and displaying local files
-
rst
a reStructuredText parser and renderer for the command line
-
mdbook-chess
An mdbook preprocessing plugin to generate chess boards
-
neofile
linux file command for windows. PATCH 4.
-
vidyut-prakriya
A Sanskrit word generator
-
goose-eggs
in writing Goose load tests
-
phd
an esoteric gopher server
-
slugrs
A fast, locale-aware slugify library for Rust
-
retrofont
Retro terminal font toolkit: TDF/FIGlet parsing, rendering, conversion
-
large-text-core
Core library for handling large text files search and replace efficiently
-
mdbook-private
An mdbook preprocessor that controls visibility of private chapters and sections within them
-
zet
zet finds the union, intersection, set difference, etc of files considered as sets of lines
-
markdown-translator
A translation library with DeepLX API integration, rate limiting, and smart text chunking
-
gazetta-render-ext
A static site generator framework. Extra render code.
-
secular
No Diacr!
-
runefix-core
Unicode character display width engine supporting CJK, emoji, and grapheme clusters
-
genedex
A small and fast FM-Index implementation
-
genpdf
User-friendly PDF generator written in pure Rust
-
htmd-cli
The command line tool for htmd
-
dnd-character
A Dungeons and Dragons character generator
-
mdbook-d2
D2 diagram generator plugin for MdBook
-
ripdoc
Query Rust docs and crate API from the command line
-
lindera-unidic-builder
A Japanese morphological dictionary builder for UniDic
-
teip
Masking tape to help commands "do one thing well"
-
case
A set of letter case string helpers
-
lister-cli
Lister: Navigate Markdown Lists
-
utf16_lit
macro_rules to make utf-16 literals
-
mdsql
SQL queries for markdown tables
-
orgflow
managing documents with support for tasks and notes
-
anthropic-text-editor
A micro-CLI to apply tool calls from Anthropic for their text_editor_20250124 built-in computer use tool
-
gremlh
A CLI tool to find and fix invisible 'gremlin' characters (homoglyphs, zero-width spaces, Bidi overrides) in source code
-
string-patterns
Makes it easier to work with common string patterns and regular expressions in Rust, adding convenient regex match and replace methods (pattern_match and pattern_replace) to the standard…
-
kashida
Insert Kashidas/Tatweel into Arabic text, e.g. for justification purposes.
-
doxygen-bindgen
Converts Doxygen comments into Rustdoc markdown
-
anaso_site_api_models
API models for Ana.so
-
hanconv
Convert between Chinese characters variants
-
zh_num
Convert ASCII numbers and zh words
-
spacemod
A easy to understand and powerful text search-and-replace tool
-
opusmeta
reading and writing metadata to opus files
-
wcount
CLI word counting tool
-
repvar
A tiny CLI tool that replaces variables of the style
${KEY}in text with their respective value. It can also be used as a rust library -
vibrato-rkyv
Vibrato: viterbi-based accelerated tokenizer with rkyv support for fast dictionary loading
-
langextract-rust
extracting structured and grounded information from text using LLMs
-
mdbook-last-changed
mdbook preprocessor to add the last modification date per page
-
aki-xcat
concatenate files that are plain, gzip, xz and zstd
-
rexpaint
functionality for reading and writing .xp files of the Grid Sage Games REXPaint ASCII art editor
-
zen-rs
generating non-interactive content like cards or files
-
ruchydbg
ML-powered debugger for Ruchy with SBFL fault localization
-
unicode-matching
match Unicode open/close brackets
-
searcher_txt
A copy of grep that I made to show that im bad at rust
-
mdbook-environment
A preprocessor for MdBook for working with environment variables
-
tiny_pretty
Tiny implementation of Wadler-style pretty printer
-
kreuzberg-tesseract
Rust bindings for Tesseract OCR with cross-compilation, C++17, and caching improvements
-
simplematch
Fast wildcard pattern matching for strings and bytes with a simple api
-
lil-tabby
A macro-based library for creating visually appealing tables with automatic column spanning
-
rascii_art
Advanced ASCII Art Generator
-
distrs
PDF, CDF, and percent-point/quantile functions for the normal and Student’s t distributions
-
deencode
Reverse engineer encoding errors
-
pdf2pwg
Single purpose A4 page renderer rendering PDF using pdfium to PWG/URF
-
koto_test_utils
Testing utilities for the Koto programming language
-
text-similarity-metrics
A high-performance Rust library for computing text similarity using multiple algorithms
-
tibco_ems
A high level API for the Tibco EMS
-
tortilla
Somewhat syntax-aware text wrapping for source code and plain text documents
-
hlight
dedicated to delivering exceptional syntax highlighting capabilities
-
wdl-lint
Lint rules for Workflow Description Language (WDL) documents
-
ucfirst
Uppercase the first letter of a string
-
kaff_sso
Small-buffer-optimized generic buffer and UTF-8 string type
-
fontheight
Find out the vertical extents your font reaches on shaped words
-
recursive-file-loader
recursively load files via references in the files
-
acroform
High-level PDF form manipulation library using lopdf
-
twilight-mention
working with mentions in the Twilight ecosystem
-
memchr-rs
Fast memchr and memchr2 implementations in Rust
-
xid
Globally unique sortable id generator. A Rust port of https://github.com/rs/xid.
-
rewrite
Safely rewrite file contents from stdin, even when file is open as an input
-
pinyin-parser
Parses a string of pinyin syllables. Covers marginal cases such as
ẑ,ŋandê. -
neo4j_cypher
A flexible and intuitive query builder for Neo4j and Cypher
-
patiencediff
algorithm
-
pangu
Paranoid text spacing for good readability, to automatically insert whitespace between CJK (Chinese, Japanese, Korean) and half-width characters (alphabetical letters, numerical digits and symbols)
-
fontcull-font-types
Scalar types used in fonts. (Vendored fork for fontcull)
-
sigrs
Interactive grep (for streaming)
-
waterui-str
String utilities for WaterUI
-
ncp-matcher
plug and play high performance fuzzy matcher
-
lexicmp
comparing and sorting strings lexicographically and naturally
-
mdsh
Markdown shell pre-processor
-
mdtrans
Markdown parser and transformer using
pest.rs, focused on flexibility to a project’s needs -
grapheme_machine
Grapheme cluster text segmentation (UAX #29) state machine for streaming input
-
json_to_table
pretty print JSON as a table
-
bullet_stream
Bulletproof printing for bullet point text
-
krafna
terminal-based alternative to Obsidian's Dataview plugin, allowing you to query your Markdown files using standard SQL syntax
-
CompactPrefixRadix
a minimalistic but efficient radix tree implementation with extra prefix support
-
agentai
designed to simplify the creation of AI agents
-
doxx
Terminal document viewer for .docx files
-
fencecat
Walkdir cat with markdown fenced code output
-
bwrap
A fast, lightweight, embedded systems-friendly library for wrapping text
-
nerdfetch-rs
A minimal, fast system fetch tool - rewritten in Rust for speed, clarity, and configurability
-
dedoc
Terminal-based viewer for DevDocs documentation
-
rustpress
增量编译倒分页无后端 Rust 纯静态博客程序
-
textcon
Template text files with file/directory references for AI/LLM consumption
-
basic-text
Basic Text strings and I/O streams
-
prompt-input
lightweight library for user input prompts in Rust, designed to make input handling straightforward
-
mdbook-yml-header
mdBook preprocessor for removing yml header
-
datadriven
Rewritable table-driven testing
-
codebase-to-markdown
convert codebase to markdown format
-
saytify
greeting and farewell messages
-
langram
Natural language detection library
-
token_processor
fast, streaming‑first Rust library for processing LLM outputs by attaching callbacks to XML‑style tags—supporting both streaming and buffered handlers—and using aho‑corasick for ultra‑efficient…
-
rsticle
Treat source files as articles / narrative documentation
-
rsrpp
project for research paper pdf
-
utf64
encode utf-8 strings into utf-64, and decode them back
-
byteutils
that provides a collection of frequently used utility functions for working with bytes, strings, and vectors. It includes common tasks such as converting between strings and byte arrays…
-
tars-bin
A small, fast, static site generator
-
tbll
tbll outputs data in tabular format
-
omnix-common
Common functionality for omnix frontends
-
seshat-unicode
A Unicode Library for Rust. Unicode 16.0.0 ready. XID_Start and XID_Continue are also available.
-
unicode-writing-script
The package provides detection of writing scripts from Unicode codepoints
-
codesearch
A fast CLI tool for searching codebases
-
mktoc
Generate Table of Contents from Markdown files
-
ere
A compile-time alternative for POSIX extended regular expressions
-
unicode-display-width
Unicode 15.1.0 compliant utility for determining the number of columns required to display an arbitrary string
-
unicode-language
detect language coverage given a list of codepoints
-
asimov-brightdata-module
ASIMOV module for data import powered by the Bright Data web data platform
-
likewise
A diff library for Rust (fork of similar)
-
regexml
XPath compatible regex engine
-
postcode_extractor
extract and identify postcodes
-
loc
Count lines of code (cloc) fast
-
mdvalidate
Markdown schema validation engine
-
tergo-formatter
Formatter for tergo
-
isbn
handling ISBNs
-
yara-x-parser
A parsing library for YARA rules
-
autofoam
related tools
-
glu
Unpacks a Markdown document full of code snippets into a temp directory
-
mdtablefix
mdtablefixunb0rks and reflows Markdown tables so that each column has a uniform width. When the--wrapoption is used, it also wraps paragraphs and list items to 80 columns. -
codebase-to-prompt
bundling text files like code to single file
-
cli-boxes
Unicode box drawing characters for creating beautiful CLI interfaces
-
brk_string_wizard
manipulate string like a wizard
-
normalized-line-endings
Returns an iterator over characters with all line endings converted to \n (U+000A)
-
typed-oid
Typed Object IDs
-
illuminate-string
A comprehensive Rust library for advanced string manipulation and processing
-
syntaxfmt
A derive macro-based library for flexible syntax tree formatting with pretty printing support
-
scrunch
full-text-searching compression
-
terminal_tools
Power-Terminal TUI - fuzzy file and text finder, process manager, git browser, and more
-
rust-regex-dsl
Regular expression DSL
-
litsea
extreamely compact word segmentation and model training tool implemented in Rust
-
emojicon
Find Emoji by using Emoticons and GitHub's, Bengali emoji names
-
caco3
common lib
-
readability
Port of arc90's readability project to rust
-
slack-blocks-render
Slack blocks render is a Rust library to render Slack blocks as Markdown
-
stringzz
strings and opcodes extraction from various file formats
-
erebus
A CLI message generation library
-
mdbook-tocjs
A mdbook preprocessor which adds extra js and css file for ToC hydration
-
ttf_word_wrap
Wraps text based on character width
-
dwg-core
Deterministic Writing Guard core analysis engine for spotting AI-styled prose
-
uresamp
HIFI uresamp delivers ultrasonic-fidelity text resampling via adaptive 64-bit floating-point spectral mapping, preserving Unicode 32-bit codepoint integrity with zero-phase distortion
-
text_trees
textual output for tree-like structures
-
anycase
a case conversion library for Rust
-
text_utils_s
edit array. Example delete duplicate in array. Clear string
-
parserst
A recursive-descent reST parser and renderer
-
opentalk-types-signaling-legal-vote
Signaling types for the OpenTalk legal vote module
-
encoding-next
Character encoding support for Rust
-
transportations_library
A comprehensive Rust-based library implementing transportation engineering methodologies (e.g. the Highway Capacity Manual (HCM)) with Python bindings.
-
latentdirichletallocation
Latent Dirichlet Allocation (LDA) using collapsed Gibbs sampling for topic modeling
-
human_regex
A regex library for humans
-
diagnostic
Pretty diagnostic report
-
vectradb-chunkers
Chunking utilities for VectraDB in Rust
-
mdbook-qr
An mdBook preprocessor that generates a QR code using fast_qr
-
crate2bib
Create BibLaTeX entries for crates hosted on crates.io
-
himmelblau_red_asn1
A little library to encode/decode ASN1 DER
-
pukram2html
converting Pukram-formatted text to HTML
-
mdref
Markdown Reference finding and migration tool
-
iati-types
Core data types for IATI Activity v2.03: Activity, Transaction, Money, TxType, etc
-
pspp
Statistical analysis software
-
asimov-prompt
ASIMOV Software Development Kit (SDK) for Rust
-
conveyor-etl-cli
CLI tool for managing Conveyor ETL resources
-
lcat
lolcat in rust! Full unicode support, escapes for ANSI escape sequences, hue shift in Cubehelix
-
xifetch
neofetch alternative written in Rust
-
pgf2json
Application Programming Interface to load and interpret grammars compiled in Portable Grammar Format (PGF). The PGF format is produced as a final output from the GF compiler. The library…
-
typing_engine
A typing game engine for Japanese and English
-
syslog_fmt
A formatter for the 5425 syslog protocol
-
linkcheck2
extracting and validating links
-
ansi-align
Text alignment library with ANSI escape sequence and Unicode support
-
pdfcat
Concatenate PDF files into a single document
-
markdown-readtime
estimate reading time for Markdown content
-
caseify
A CLI tool to convert strings between different cases
-
wordcutw
A C-interface wrapper for Wordcut - a Lao/Thai word segmentation/breaking library
-
rust-persian-tools
Official Rust implementation of Persian Tools
-
rascii_art_img
Advanced ASCII Art Generator. Fork for imgii.
-
trailfix
Trim trailing whitespace and ensure single newline at EOF
-
string-offsets
Converts string offsets between UTF-8 bytes, UTF-16 code units, Unicode code points, and lines
-
monochora
gif to ascii art converter written in rust
-
harfbuzz_rs
A high-level interface to HarfBuzz, exposing its most important functionality in a safe manner using Rust
-
markov_strings
A simplistic Markov chain text generator
-
n_gram
training n-gram language models
-
codebook-lsp
A code-aware spell checker with language server implementation, installable via cargo install
-
mudssky_utils
A comprehensive Rust utility library providing common functionality for everyday programming tasks
-
ticker-sniffer
extracting multiple stock ticker symbols from a text document
-
pdfgen
PDF rendering library
-
ungoliant
The pipeline for the OSCAR corpus
-
alphabet_detector
Natural language alphabet detection library
-
quot
A fast and flexible command-line tool that converts text input into escaped string literals
-
hangeul_jamo_rs
A high-performance Korean Hangul syllable and jamo manipulation library. included Python bindings.
-
matcher_c
A high-performance matcher designed to solve LOGICAL and TEXT VARIATIONS problems in word matching, implemented in Rust
-
compression-prompt
Fast statistical compression for LLM prompts - 50% token reduction with 91% quality retention
-
maybe-regex
Wrapper for strings that may be either a regex or a plain-text string
-
mdlynx
Small, fast utility to find broken file links in Markdown documents
-
jsrmx
command-line tool to manipulate JSON files. It can split large single-object JSON files into many files; merge multiple JSON files into one large JSON file; bundle multiple JSON files into one NDJSON file…
-
e9571_str_lib
string processing, sorting, counting, and hashing
-
nanohtml2text
A zero-dependency library to convert HTML to plain text
-
stenotype
Machine stenography primitives
-
inline_flexstr
copy/clone-efficient inline string type for Rust
-
pragmatic-segmenter
Rust port of pySBD v3.1.0
-
drova_sdk
Sdk for absolute converter of formats for dalet
-
ystd
An opinionated and batteries included
stdmirror for convenient, correct code and pleasant error messages -
ebg
Eric's Blog Generator, a simple static site generator
-
adc-lang
Array-oriented reimagining of dc, a terse RPN esolang
-
gemini-map
A command-line tool to run files in parallel through Google Gemini
-
tree-sitter-stack-graphs-typescript
Stack graphs definition for TypeScript & TSX using tree-sitter-typescript
-
lindera-wasm
A morphological analysis library for WebAssembly
-
attuned-infer
Fast, transparent inference of human state axes from natural language
-
zeroten-denote
Handle denote name scheme
-
filenamify
Convert a string to a valid filename
-
terraphim-repl
Offline-capable REPL for semantic knowledge graph search
-
google-book-scraper
downloading the contents of books hosted on books.google.com for offline viewing
-
spellchk
A blazingly fast spellchecker CLI for any text file
-
numeronym-cli
A CLI tool that generates numeronym abbreviations from text
-
did-toolkit
spec-compliant implementation of did-core, W3C's 'Decentralized Identity Documents'
-
colonnade
format tabular data for display
-
fast-slice-utils
Highly optimized slice utilities using SIMD instructions when available
-
seam
Symbolic Expressions As Markup
-
constr
Constant string generics
-
husk-lexer
Lexer for the Husk programming language
-
caseless
Unicode caseless matching
-
rust-ai
A collection of 3rd-party AI APIs for Rust
-
polished_scancodes
handling and mapping keyboard scancodes in Rust
-
pandoc_types
Rust port of pandoc-types
-
chord3
Create pdf songbooks from chopro source
-
sedregex
Sed-like regex library
-
uv-requirements-txt
internal component crate of uv
-
memkb
A local AI in memory RAG for markdown files in a folder exposed as MCP
-
dictutils
Dictionary utilities for Mdict and other formats
-
mdbook_header_footer
mdBook preprocessor to prepend header and append footer to certain chapters
-
cin
that simplifies command-line input in Rust, especially mimicking C++-style input
-
thediff
Difference between 2 files in percentages
-
domrs
Document builder and serializer
-
soon
Predict your next shell command based on history — like shell autocomplete, but MORE stupid
-
doryen-rs
Pure rust OpenGL accelerated roguelike console API with native/wasm support
-
mailidator
A lightweight Rust library for checking email address misspellings
-
ucf
A universal code formatter
-
duca
Search and read Dante's Divine Comedy from your terminal
-
mut-str
A toolkit for working with mutable string slices (&mut str)
-
colourss
parsing CSS color strings into RGB values
-
ponsic-winsafe
The dependency of the
ponsiccrate -
mdbook-tiny
Use mdbook to generate tiny and fast static sites
-
bm25-vectorizer
A minimal Rust library for creating sparse vector representations (embeddings) using the BM25 algorithm for information retrieval
-
libchai
汉字编码优化算法
-
grapheme-cluster-utils
Unicode-safe String manipulation utilities by extended grapheme cluster (UAX #29)
-
metatron
core library
-
flxy
Full-text searching and scoring of strings
-
ascii_table_rs
Elegant ASCII table renderer for Rust CLI and terminal apps
-
crowbook-text-processing
some utilities functions for escaping text (HTML/LaTeX) and formatting it according to typographic rules (smart quotes, ellipsis, french typograhic rules)
-
rustpython-parser-vendored
RustPython parser vendored third-party crates
-
soft-ascii-string
char/str/string wrappers which add a "is-ascii" soft constraint
-
okh-scraper
A scraper of Open Source Hardware (OSH) projects. based on the Open Know-How (OKH) standard
-
sakurs-core
High-performance sentence boundary detection using Delta-Stack Monoid algorithm
-
invisible-characters
A list of invisible characters
-
onig-regset
Rust-Onig is a set of Rust bindings for the Oniguruma regular expression library. Oniguruma is a modern regex library with support for multiple character encodings and regex syntaxes.
-
beautiful-md
A CLI tool to format and beautify Markdown files
-
paperless-api-client
Paperless-ngx API client
-
ident_case
applying case rules to Rust identifiers
-
trustformers-tokenizers
Tokenizers for TrustformeRS
-
repose-text
Text handling (wrappers around cosmic-text apis)
-
fuse-rust
Fuse is a super lightweight library which provides a simple way to do fuzzy searching. Fuse-Rust is a port of Fuse-Swift, written purely in rust
-
xerg
Ultra-fast grep implementation in Rust - built for maximum speed with direct output and parallel processing
-
java_string
Java strings, tolerant of invalid UTF-16 encoding
-
mupdf-basic-text-extractor
Basic structured text extraction using mupdf-rs
-
hyphertool
Hypertool is a command-line tool for syllabification and hyphenisation
-
img2epub
Convert images to EPUB
-
fast-cat
An ergonomic macro for efficient string concatenation with a single memory allocation
-
stringmatch
Allow the use of regular expressions or strings wherever you need string comparison
-
devek
CLI for copying HTML to clipboard
-
utf-64
The next-generation text encoding standard using 64 bits per character
-
dossiers
home for your specs, policies, and process docs
-
tetratto-shared
Shared stuff for Tetratto
-
csvpp
Compile csv++ source code to a target spreadsheet format
-
charname
Incredibly simple library that just gives you the Unicode name for a character
-
recase
Changes the convention case of input text
-
vds
Visibly distinguishable string types for identifiers and codes
-
mdbook-jupyter
enables using jupyter notebooks in mdbook
-
gitbook2text
A CLI tool to download GitBook pages and convert them to markdown and text
-
input_py
you can use input like python3
-
lingua-english-language-model
The English language model for Lingua, an accurate natural language detection library
-
diffy-imara
Tools for finding and manipulating differences between files
-
udpipe-rs
Rust bindings for UDPipe - a trainable pipeline for tokenization, tagging, lemmatization and dependency parsing of CoNLL-U files
-
docki
cli for building and publishing documentation using asciidoctor
-
gh-emoji
Convert
:emoji:to Unicode using GitHub’s emoji names -
tmpltr
Template-based document generation CLI
-
mat-o-viewer
A modern terminal file viewer combining cat, less, and grep with syntax highlighting and markdown rendering
-
ansimake
Quickly convert pixel images of ANSI art created with AI to actual ANSI art
-
blame-rs
Line-by-line authorship tracking for revisioned text
-
include-preprocessor
Tooling for C preprocessor style include directives
-
create_broken_files
Create broken files from other ones
-
dprint-plugin-markdown
Markdown formatter for dprint
-
inflector-plus
Adds String based inflections for Rust. Snake, kebab, camel, word, sentence, class, title and table cases as well as ordinalize, deordinalize, demodulize, foreign key, and pluralize/singularize…
-
sortuniq
Find or count unique values in an input stream
-
mdbook-driver
High-level library for running mdBook
-
charx
A replacement for char::is_ascii*
-
egui_text_animation
text animation using egui
-
ya
YAML parser in pure Rust
-
sourcefile
Retain mapping information when concatenating source files, to make error messages more useful
-
univiz
A command-line tool for analyzing Unicode strings, providing detailed information about graphemes, code points, and UTF-8 byte sequences
-
wkhtmlapp
Convert html to pdf or image
-
hemoglobin
Bloodless
-
array_tool
Helper methods for processing collections
-
yeslogic-ucd-generate
A program for generating packed representations of the Unicode character database that can be efficiently searched with support for additional tables
-
spel-right
A fast and lightweight spell checker and suggester
-
swift-check
High-performance, robust, and expressive searching and validation (uses SIMD on x86_64, aarch64, and WASM)
-
bump-bin
Increments version with semver specification
-
ps-str
String transcoding library
-
mdbook-cat-prep
a preprocessor for mdbook which provides teacher, subject, material and tag functionality
-
adobe-cmap-parser
parse Adobe CMap files
-
mdbook-numeq
An mdbook preprocessor for automatically numbering centered equations
-
md_check
Check markdown link validity
-
pager2
pipe your output through an external pager
-
tantivy-stemmers
A collection of Tantivy stemmer tokenizers
-
unsafe-tools-mimic
Size and alignment matched opaque types
-
unicode-intervals
Search for Unicode code points intervals by including/excluding categories, ranges, and custom characters sets
-
mdbook-selfpath
A preprocessor for mdbook that allows self-referencing paths
-
wikidot-normalize
provide Wikidot-compatible string normalization
-
case_insensitive_hashmap
A HashMap that uses case-insensitive strings as keys
-
topiary-cli
CLI app for Topiary, the universal code formatter
-
scraps_libs
Scraps is a portable CLI knowledge hub for managing interconnected Markdown documentation with Wiki-link notation
-
fmd
Find Markdown files by metadata - Search by tags, frontmatter, and custom fields
-
mdbook-open-on-gh
mdbook preprocessor to add a open-on-github link on every page
-
colored_text
adding colors and styles to terminal text
-
kd-rust
A crystal clear command-line dictionary
-
inlet_manifold
A general purpose highlighting library
-
trump
Spawn a background thread that prints a Donald Trump quote every 10 seconds
-
opencc-fmmseg
High-performance OpenCC-based Chinese conversion using FMM (Forward Maximum Matching) segmentation
-
vibrato
viterbi-based accelerated tokenizer
-
regextrie
High-performance Rust library designed for matching a large number of regular expressions against a given input string
-
snips
Keep code snippets in markdown files in sync
-
unified-diff
GNU unified diff format
-
custard
A frontmatter-querying server
-
edit-without-waiting
Open a file in the default text editor
-
deucalion
High-performance Windows library for capturing decoded FFXIV packets
-
bpetok
CLI for tokenizing text input using Byte Pair Encoding (BPE)
-
shift_or_euc
Detects among the Japanese legacy encodings
-
yy1
Tiny utility to convert KiCad centroid files into Neoden YY1 pick and place machine format
-
csml_interpreter
The CSML Interpreter is the official interpreter for the CSML programming language, a DSL designed to make it extremely easy to create rich and powerful chatbots
-
convert-to-spaces
Convert tabs to spaces in a string
-
pink_accents
Replacement of patterns in string to simulate speech accents
-
snailquote
Escape and unescape strings with shell-inspired quoting
-
like
A SQL like style pattern matching
-
bfom-lib
Brendan's Flavor of Markdown: I'll build my own markdown format, what could go wrong?
-
mdbook-blox
A preprocessor for mdbook to add figures and admonishment-like blox
-
skyspell_kak
skyspell - kakoune integration
-
rlex
A cursor-based, utf-8 Vec<char> lexer
-
case_clause
case clause macro for rust
-
ferret
A trigram-based tool for detecting similarity in groups of text documents or program code
-
facet-singularize
Fast, no-regex English singularization for the facet ecosystem
-
udataframe_rs
A pure Rust library for data frame operations, particularly useful for processing data extracted from PDF files or OCR recognize
-
enma
serving anime and manga information 📦
-
luau-lexer
A lexer for the luau language
-
text-editing
string with utilities for editing
-
plot3d
reading, writing, and manipulating NASA PLOT3D structured grids
-
herring-automata
Automata construction for Herring
-
europa
A lightweight AI utilities library for Rust
-
strval
Parse strings into values
-
gspell
Rust bindings for gspell
-
flowmark
Fast, modern Markdown formatter with smart typography and paragraph wrapping
-
laser-pdf
programmatic PDF generation with precise, predictable layout control
-
lformat
Clone of Lua
string.formatin Rust based on Cs(n)printf -
regex-charclass
Manipulate and convert regex character classes
-
rrag
High-performance Rust framework for Retrieval-Augmented Generation with pluggable components, async-first design, and comprehensive observability
-
streampager
pager for command output or large files
-
japanese-codepoints
A high-performance Rust library for Japanese character validation and code point handling based on JIS standards
-
pullup
Convert between markup formats
-
gtk4cmark
A GTK4 widget for rendering CommonMark content
-
dictator-frontmatter
Markdown frontmatter decree for Dictator structural linter
-
catfx
Apply effects to a text stream
-
maddi-recipe
parsing and scaling markdown recipes
-
prettyless
Wadler-style pretty-printing combinators in Rust with more features
-
utf16_iter
Iterator by char over potentially-invalid UTF-16 in &[u16]
-
zz-data
Data structures for Zanzarah apis
-
cesu8
Convert to and from CESU-8 encoding (similar to UTF-8)
-
kathoey
text feminization using open corpus linguistics data
-
filecheck
writing tests for utilities that read text files and produce text output
-
ansi-width
Calculate the width of a string when printed to the terminal
-
santoka
Translations of 668 of Taneda Santoka's free-verse haiku
-
semantic-query
AI-powered schema validation with automatic JSON generation for type-safe responses
-
deno_node
Node compatibility for Deno
-
lemmeknow
Identify any mysterious text or analyze strings from a file
-
slugi
slugify text (e.g., 'Café au lait' → 'cafe_au_lait')
-
learner
learning stuff
-
cron_clock
A cron expression parser and schedule explorer. Rich documentation and case studies and related upper-level libraries are available.
-
advent-ocr
Converts ASCII-art representations of letters generated by Advent of Code puzzles into a String containing those letters
-
yara-x-fmt
A code-formatting library for YARA rules
-
feedr
feature-rich terminal-based RSS/Atom feed reader written in Rust
-
broken-md-links
A command-line tool and library to detect broken links in Markdown files
-
niho
A command-line tool for converting romanized Japanese text to Japanese characters
-
ezemoji
Catigoryized Emoji's
-
anda_nexus
Anda -- an AI Nexus powered by the Knowledge Interaction Protocol (KIP)
-
nonsense
Lorem ipsum placeholder text generator with clipboard integration
-
enum-ts
TypeScript Enum pattern matcher codegen
-
yamake
yet another make tool
-
quranize
Encoding transliterations into Quran forms
-
artifacts-rs
Rust client for Artifacts
-
loki_text
advanced string manipulation with pattern searching and replacement capabilities
-
expression_parser
minimal Rust parser and evaluator for arithmetic expressions with variables, built using the pest parsing library. It converts a text formula into an abstract syntax tree (AST) and…
-
xarray
version of the XArray with copy-on-write capabilities
-
companies-house-api
consuming various APIs from Companies House
-
hangeul
Korean alphabet manipulation library
-
flo_rope
An attributed and streaming implementation of the rope data structure
-
mdbook-langtabs
An mdbook preprocessor that adds language tabs for code blocks
-
tracery
Text-expansion library
-
ised
An interactive tool for find-and-replace across many files
-
tectonic
A modernized, complete, embeddable TeX/LaTeX engine. Tectonic is forked from the XeTeX extension to the classic “Web2C” implementation of TeX and uses the TeXLive distribution of support files.
-
clipboard-stream
Async stream of clipboard change events
-
mdbook-glossary
mdBook preprocessor to generate glossary tables
-
ik-rs
chinese segment, ik-analyzer for rust
-
vidyut-lipi
A Sanskrit transliterator
-
json-carver
Digital forensics tool that reads (carves) JSON strings from a dump. Think of it as a more accurate and faster replacement for the strings(1) utility.
-
libappindicator-zbus
zbus implement for libappindicator
-
kodegen_native_notify
KODEGEN.ᴀɪ: Memory-efficient, Blazing-Fast, MCP tools for code generation agents
-
mdbook-godbolt
A preprocessor for mdbook to add runnable code snippets via Godbolt
-
opentalk-types-signaling-meeting-notes
Signaling types the OpenTalk meeting-notes module
-
nib
static site generator
-
unescape
Unescapes strings with escape sequences written out as literal characters
-
reggy
friendly, resumable regular expressions for text analytics
-
jawk
JSON AWK
-
pathmut
Command line utility for manipulating path strings
-
journey-cli
A CLI-based journal application with automatic timestamping, vault management, and Obsidian integration
-
unicount
Alphabetic counter supporting unicode
-
pinyin2ch
converting Chinese Pinyin to Chinese characters with various levels of detail
-
frawk
an efficient Awk-like language
-
utf58
High-tech encoding of the Unicode space in one quibble and up to 3 bytes
-
speedreader
A command-line speed reading tool
-
ngram_rs
Facilitate creating ngrams in Rust to be used in the polars plugin
-
reason-shell
Reason: A Shell for Research Papers
-
block-list
A minimalist hosts-based tool for managing block lists and ad-blocking
-
eddie
Fast and well-tested implementations of edit distance/string similarity metrics: Levenshtein, Damerau-Levenshtein, Hamming, Jaro, and Jaro-Winkler
-
logappend
Execute child process, read from stdin and stderr, emit into files, truncate at given total content sizes
-
yuuang-test-napi
N-API bindings
-
mdbook-hints
mdBook preprocessor to add hover hints to your book
-
nsys-curses-utils
Rust *curses utilities
-
pomsky-bin
Compile pomsky expressions, a new regular expression language
-
matchr
A fast fuzzy matcher library written in Rust for use in CLI tools and TUI apps
-
skills-ref-rs
agentskills library for validating, parsing, and managing Agent Skills
-
sansaccent
Convertit les chaînes françaises en slugs URL-friendly en supprimant accents et caractères spéciaux
-
qpprint
console printing/formatting
-
fast-str
A flexible, easy-to-use, immutable, efficient
Stringreplacement for Rust -
jira-mcp-server
AI-friendly JIRA MCP server with semantic search capabilities
-
typoglycemia
A function to convert text to typoglycemic format
-
retrogress
Progress bars with a thin API around complex features
-
matrix-ui-serializable
Opinionated abstraction of the matrix-sdk crate with serializable structs
-
glk
Bindings for the Glk I/O interface for hosting interactive fiction interpreters
-
hexdump
Easy hexdump to stdout or as an iterator
-
twitter_text_config
Configuration for twitter-text in Rust
-
seams
High-throughput sentence extractor for Project Gutenberg texts with dialog-aware detection
-
tdoc
assorted CLI tools for working with FTML (Formatted Text Markup Language) documents
-
dirdocs
Generate terse, one-sentence documentation for files; plus Nushell-style directory viewers (dls, dtree) that surface .dirdocs.nu descriptions
-
rzozowski
A regex crate using Brzozowski derivatives
-
vader_sentiment
Bindings for Rust from the original Python VaderSentiment analysis tool
-
tergo-tokenizer
R language tokenizer
-
gaze
small utility library with the goal of making it easier to scan/lex text and collections
-
harfshapedfa
Glue functionality between skrifa & harfrust
-
heiwa
A minimalist flat file CMS
-
files-to-text-cli
Files to text cli
-
cn-font-split
划时代的字体切割工具,CJK与任何字符!支持 otf、ttf、woff2 字体多线程切割,完美地细颗粒度地进行包大小控制。A revolutionary font subetter that supports CJK and any characters…
-
askalono-cli
detect the contents of license files
-
strf
Format strings with arbitrary expressions using highlighting friendly syntax
-
hebrew_unicode_utils
Some functions for processing Hebrew unicode characters
-
diacritics
Remove diacritics from letters, for example when standardizing input for a search
-
markdown-extract
Extract sections of a markdown file
-
timeblok
A language for event scheduling in plain text
-
penmanship
A Unicode character lookup library for converting text patterns to Unicode characters
-
shell2batch
Coverts simple basic shell scripts to windows batch scripts
-
prettychars
Unicode text styling and named glyph lookup with zero runtime overhead
-
modeling
tools to analysis different languages by Ctags
-
dmos-cli
Djot HTML renderer with advanced features - CLI
-
markdowndown
acquiring markdown from URLs with smart handling
-
screenplay-doc-parser-rs
Tools to parse Screenplay-formatted documents into semantically-typed structs
-
nlpo3
Thai natural language processing library, with Python and Node bindings
-
pretty_regex
🧶 Elegant and readable way of writing regular expressions
-
linebreak
breaking a given text into lines within a specified width
-
mdbook-quiz-schema
Schema for quizzes used in mdbook-quiz
-
turndown
An opionated Rust port of Turndown.js
-
libphext
A rust-native implementation of phext
-
pangu2
Paranoid text spacing for good readability, to automatically insert whitespace between CJK (Chinese, Japanese, Korean) and half-width characters (alphabetical letters, numerical digits and symbols)
-
mind-tree
Organize your thoughts in a tree-like structure
-
mongodb-gridfs
Mongo GridFS
-
rtranslate
dependency-free Rust wrapper for Google Translate public web API
-
minigrep_pack
A small clip clone of minigrep
-
bitutils2
A package of tools for bit manipulations, including bit indexing, bitfields, and a variation of regular expressions for binary data
-
vesti
A preprocessor that compiles into LaTeX
-
hexxy
all over the place!
-
colored-hexdump
Create beautifuly colored hexdumps
-
easy-regex
Make long regular expressions like pseudocodes
-
filename-refactor
Command to refactor file names
-
emoji
Every emoji, their metadata, and localized annotations
-
inkjet
A batteries-included syntax highlighting library for Rust, based on tree-sitter
-
sniffer-rs
that simplifies fuzzy string matching in rust
-
ezstr
A String wrapper supporting negative indexing with grapheme indexing for slices and Regex::find_iter and Regex::find
-
bpe-match
A pattern matching library for BPE tokenization, intended to replace regex-based approaches
-
promptify
A plaintext directory formatting tool for interacting with LLMs on the command line
-
minigrep_wise
minigrep project from the rust book project
-
see-cat
A cute cat(1)
-
ps-hash
Generates 64-byte ascii hashes with 256 bits of security
-
tantivy-tokenizer-api
Tokenizer API of tantivy
-
pandoc
API that wraps calls to the pandoc 2.x executable
-
wrapr
wrap your code for ai
-
furze
finite state transducers (fst) writen in rust
-
afrim-translator
Manage the predication system of the afrim input method
-
xgrammar-rs
Efficient, Flexible and Portable Structured Generation for Rust - Rust bindings for XGrammar
-
cfd16-assembler
An assembler backend for the CFD-16 ISA
-
textwrap-macros
procedural macros to use textwrap utilities at compile time
-
riimut
Transform latin letters to runes & vice versa
-
src2md
Turn source code into a Markdown document with syntax highlighting, or extract it back
-
viks
vim-like key crate
-
pulldown-cmark-toc
Generate a table of contents from a Markdown document
-
FlashCardsForWiedner
It's a flashcard quiz taker. Takes in a .txt
-
clipcount
Counting words from the clipboard content
-
rjot
A minimalist, command-line jotting utility that's fast, private, and git-friendly
-
linurgy
Manipulate the output of multiple newlines. Replace/Insert/Append newlines with text. Input and output from stdio/files/buffers
-
vectorscan-rs
Ergonomic bindings to the Vectorscan high-performance regex library
-
avila-regex
Regex nativo - substitui regex
-
mq-hir
High-level Internal Representation (HIR) for mq query language
-
tessera-embeddings
Multi-paradigm embedding library: ColBERT, dense, sparse, vision-language, and time series models
-
safe-string
safe interface for interacting with multi-byte strings in Rust, namely IndexedStr, IndexedString, and IndexedSlice
-
json_keyquotes_convert
convert JSON from and to JSON without key-quotes
-
ohos-ime
Bindings to the
inputmethodAPI of OpenHarmony -
srx
A mostly compliant Rust implementation of the Segmentation Rules eXchange (SRX) 2.0 standard for text segmentation
-
rexify
trait based matcher alternative to Regex
-
spongebob
convert text to spongebob case a.k.a tHe MoCkInG sPoNgEbOb MeMe
-
text-scatters
A cut-up technique generator from text and ebook files in the terminal
-
cosmic-text
Pure Rust multi-line text handling
-
bmo-to-md
Dump a bugzilla bug to markdown, with attachments unziped
-
aho-corasick
Fast multiple substring searching
-
mdbook-plugin-utils
mdBook plugins
-
bash-builtins
implement loadable builtins for bash
-
aki-mcolor
mark up text with color
-
genpdfi
User-friendly PDF generator written in pure Rust
-
hunspell-rs
Rust bindings to the Hunspell library
-
ik-mini-epub
Minimal async IK to EPUB downloader | Extremely minimal
-
lindera-cc-cedict-builder
A Chinese morphological dictionary builder for CC-CEDICT
-
mdbook-fs-summary
Summary generator for mdbook
-
term_grid
formatting strings into a grid layout
-
basen
Convert binary data to ASCII with a variety of supported bases
-
biometrics
provide the vitals of a process in the form of counters, gauges, moments, and T-digests
-
cwc
A word counter utility that properly handles CJK and Unicode text
-
tcalc-rustyline
A fork of Rustyline for use specifically with tcalc
-
chaf
A CLI filter tool that removes lines based on logical conditions using AND, OR, NOT, and parentheses
-
subslay
Text → emoji 💅🏻 Powered by Rust
-
yekdast
normalizing and cleaning up Persian (Farsi) text
-
css_recess_order
Recess-based sort order for CSS properties
-
alass-util
convenience API for subtitle synchronization with alass-core
-
yozuk
Chatbot for Programmers
-
minigrep_xyz1
For finding strings in text
-
icondata_zi
Zi icon set for the icondata family of crates
-
marisa-rs
Safe Rust wrapper for the marisa-trie C++ library
-
markov
A generic markov chain implementation in Rust
-
string-replace-all
String replacement utility inspired by JavaScript, allowing pattern-based substitutions with support for both exact matches and regex patterns
-
mdi
markdown include
-
rustruut
Text-to-IPA converter and phonetic translator for Rust, powered by the Goruut phonemization engine
-
stam
powerful library for dealing with stand-off annotations on text. This is the Rust library.
-
csmlinterpreter
The CSML (Conversational Standard Meta Language) is a Domain-Specific Language developed for creating conversational experiences easily
-
mdloc
command-line tool for processing image links in Markdown files. Download remote images and convert them to Base64 embedded format or local file references.
-
mdbook-tabs
mdBook plugin for rendering content in tabs
-
purlu
A full-text search engine
-
thesaurus
An offline thesaurus library for Rust
-
fortune-rs
classic BSD fortune program
-
miku_best_friend_printer
Contains a MikuPrinter that can produce an ASCII art of Miku - your best friend - from the music video 'Monitoring (Best Friend Remix)' by DECO*27
-
upid
Universally Unique Prefixed Lexicographically Sortable Identifier
-
galaxy_save_core
Defining common types and traits used in the save data of Super Mario Galaxy and Super Mario Galaxy 2
-
waterui-text
Text and typography components for WaterUI
-
eco
reasoning about breaking changes in Rust ecosystems
-
ojcmp
online judge comparer
-
mdbook-typst-pdf
mdbook typst pdf backend
-
duvet
A requirements traceability tool
-
tuicam
Terminal-based camera with switchable modes
-
text-tokenizer
Custom text tokenizer
-
tengwar
Transliterate text into J.R.R. Tolkien's Tengwar.
-
vn-settings
Various settings intended to simulate visual novels
-
leptos_forge_ui_components
Components you can use in your leptos_forge application
-
unicode-rs
A comprehensive Unicode character library for Rust applications with theme support
-
viddy
A modern watch command
-
opentalk-report-generation
OpenTalk report generation functionality
-
sesters
💱 Fast, offline currency converter 💴 💷 💶 💵
-
dala
Dalia is a light weight formula language
-
ilyvion-util
Collection of utility functions and types for use in my personal projects
-
ricat
A Rust-Based implemenation of classic UNIX
catcommand -
scanlex
lexical scanner for parsing text into tokens
-
substring-replace
developer-friendly methods to manipulate strings with character indices
-
basalt-core
core functionality for Basalt TUI application
-
toolstr
formatting data into String's
-
cloc
Count, or compute differences of, lines of source code and comments
-
markdown-toc
Markdown Table of Contents generator
-
emoji-search
Fast fuzzy emoji searcher and picker for the terminal
-
tiktokenx
A high-performance Rust implementation of OpenAI's tiktoken library
-
asimov-module-cli
ASIMOV Module Command-Line Interface (CLI)
-
text-parsing
Hierarchical text processing preserving char position info
-
typos-cli
Source Code Spelling Correction
-
scan-rules
some macros for quickly parsing values out of text. Roughly speaking, it does the inverse of the print!/format! macros; or, in other words, a similar job to scanf from C.
-
taboc
A table of contents generator for markdown documents
-
codump
A straightforward and flexible code/comment dump tool
-
readability-js
wrapper for Mozilla's Readability.js library
-
is_printable
Determine whether a given text-based value is printable
-
keep-a-changelog
generating and manipulating CHANGELOG.md files that use the Keep A Changelog format
-
text-document
Text document structure and management
-
simple-xlsx-writer
memory efficient XLSX writer
-
lightgrep
A fast, ergonomic grep-like tool in Rust
-
nlprule
A fast, low-resource Natural Language Processing and Error Correction library
-
bashdoc
generating documentation/help menu for user defined bash functions
-
retrofont-cli
CLI for retrofont: Render and convert retro ASCII/ANSI art fonts
-
diamond-types
The world's fastest text CRDT
-
mdbook-ocirun
mdbook preprocessor to run arbitrary commands and code snippets inside containers
-
chonkier
🦛 Chonkie, now in Rust 🦀: No-nonsense, ultra-fast, ultra-light chunking library
-
ipa-translate
translating between IPA and ASCII text
-
eldenring
Raw structures and bindings for From Software's title Elden Ring
-
mle
The markup link extractor (mle) extracts links from markup files (Markdown and HTML)
-
loe
Very fast and yet another line ending (CRLF <-> LF) converter written in Rust
-
convert_string
A trait to convert Strings to safe non-keywords and/or convert a Strings case (snake_case, PascalCase, ...)
-
mdbook-inline-highlighting
mdBook preprocessor that enables support for inline highlighting
-
sayit
String replacements using regex
-
chatter
Translate a human-readable dialogue script into Rust
-
popgetter-py
Python bindings for popgetter
-
commit_crafter
AI powered tool for Git commit message generator
-
nstd_core
NSTD core crate
-
magic-regexp
creating regular expressions with ease
-
mdbook-org
A preprocessor plugin for mdBook that enables org-mode file support
-
skill-tree
generate graphviz files to show roadmaps
-
ncase
Enforce a case style
-
iregex-automata
Finite automata definitions for the
iregexcrate -
docbox-secrets
Docbox secret management abstraction
-
llmtext
Turns any website into a single LLM-ready markdown file
-
byte_string
Wrapper types for outputting byte strings (b"Hello") using the Debug ({:?}) format
-
izihawa-tantivy-stacker
term hashmap used for indexing
-
magic_string
magic string
-
antex
Styled text and tree in terminal
-
mpdf
writing PDF files
-
human-time-cli
A command-line tool for converting time durations to human-readable formats, built using the
human-timecrate -
pretty-xmlish
Pretty print XML-ish data with unicode art
-
dingtalk
Robot Util, Send text/markdown/link messages using DingTalk robot, 钉钉机器人
-
uwurs
UwUify your strings with uwurs!
-
animated-emojis-rs
Noto Animated emojis
-
merge3
merge tool for three-way merges
-
regex_generate
Use regular expressions to generate text
-
count-md
configurable command-line tool and Rust library for Unicode-aware, Markdown-aware, HTML-aware word counting in Markdown documents
-
fx-mistral
leverage the Mistral API for OCR and data extraction from PDFs
-
wagyan
CLI tool to convert text into extruded ASCII STL meshes (TTF/OTF supported)
-
mdplayscript
An extension of Markdown for play scripts
-
sublime_fuzzy
Fuzzy matching algorithm based on Sublime Text's string search
-
savagestr
SAVAGE string encoder/decoder. If can encode or decode by specifying the code page or the encoding name, it works, else it uses the savage way to encode/decode by using
String::from_utf8_lossy() -
ada-idna
Fast IDNA (Internationalized Domain Names in Applications) implementation in Rust
-
unicode-width-16
Determine displayed width of
charandstrtypes according to Unicode Standard Annex #11 rules -
git2prompt
command-line tool that takes a GitHub repository URL, downloads its contents, and generates a single text file optimized for use as input to AI tools
-
kotoba-formatter
Kotoba Code Formatter - Formats .kotoba files with consistent style
-
mdbook-linkcheck
A backend for
mdbookwhich will check your links for you -
libretranslate
A wrapper for the LibreTranslate web API
-
lindera-ipadic-builder
A Japanese morphological dictionary builder for IPADIC
-
scanix
search a text or pattern in files. A fast and lightwight text tool.
-
kl-hyphenate
Knuth-Liang hyphenation for a variety of languages
-
tpt
Pure Rust implementation of the Unix concatenate (cat), word-count (wc) and echo command
-
doc_loader
A comprehensive toolkit for extracting and processing documentation from multiple file formats (PDF, TXT, JSON, CSV, DOCX) with Python bindings
-
ranked-searcher
Search inside text files using tf-idf formula, showing the most relevant search at the top
-
textgridde-rs
dealing with Praat TextGrid files. MIT licensed.
-
human-names
A random generator for human first and last names
-
mecab
Safe Rust wrapper for mecab a japanese language part-of-speech and morphological analyzer library
-
copier
A bidirectional file content aggregator and extractor that converts between files and markdown-formatted representations
-
transmutation
High-performance document conversion engine for AI/LLM embeddings - 27 formats supported
-
re_view_text_log
A view that shows text entries in a table and scrolls with the active time
-
regex_ansi
ANSI escape sequence regular expression (Rust port of ansi-regex behavior)
-
directwrite
A safe abstraction for interacting with DirectWrite, intended initially to be used with direct2d for easy text rendering
-
string-overlap
A helper crate for "layering" ASCII art
-
clarifai_grpc
The official Clarifai gRPC Rust client
-
tectonic_bridge_core
Exposing core backend APIs to the Tectonic C/C++ code
-
typship
A cli for typst packages
-
ascii-img
Convert images to ASCII
-
mdbook-markdown
Markdown processing used in mdBook
-
ascii-img-cli
Command-line tool for using ascii-img
-
ftb
A fast CLI tool to format and align Markdown tables
-
cistring
A string type that preserve case, but compares insensitiveley
-
slugify-core
Fast, Unicode-aware slug generation library with multi-language bindings
-
vtashkov-bf
Brainfuck interpreter
-
mdbook-translator
A translation preprocessor plugin for mdBook that automatically translates documents using the DeepSeek API
-
cgp-dispatch
Extensible data type dispatchers for CGP handlers
-
sik
A fast and concurrent command-line tool for searching patterns in files
-
subtitles
A command line programme to generate SRT format subtitles for use in video players
-
prometheus-32bitfix
Prometheus instrumentation library for Rust applications
-
mcat
a powerfull extended cat command, to cat all the things you couldn't before
-
webgrep
grep the web: a full-browser-spec search-focused ultra-simple way to read the web without having to leave the terminal
-
boreal-cli
CLI utility to run boreal, a YARA rules engine
-
veltox
A fast, themeable static site generator written in Rust — ideal for documentation and blogs
-
uapi-version
Compare versions according to the UAPI Version Format Specification
-
ascii_help
help you quickly convert ASCII codes
-
mdbook-presentation-preprocessor
A preprocessor for utilizing an MDBook as slides for a presentation
-
rust_iso15924
ISO 15924, Codes for the representation of names of scripts, is an international standard defining codes for writing systems or scripts (a "set of graphic characters used for the written form of one or more languages")…
-
glow_glyph
A fast text renderer for glow, powered by glyph_brush
-
justcsv
CSV-file reader/writer
-
lsp-ty
type definitons for LSP
-
serenity_utils
provide additional utilies for Discord bots created with serenity
-
basic64
A straightforward implementation of base64
-
gdnative-doc
Documentation tool for gdnative
-
readable-regex
Regex made for humans. Wrapper to build regexes in a verbose style.
-
interslavic
in rust
-
artificial-openai
OpenAI backend adapter for the Artificial prompt-engineering SDK
-
thoth-note
note-taking app written in Rust
-
tagalyzer
A CLI tool to gather statistics on collections of plaintext-adjacent files
-
treebender
An HDPSG inspired symbolic NLP library for Rust
-
wimbd
A CLI for inspecting and analyzing large text datasets
-
fsays
flavored replacement for the classic cowsay
-
artificial-prompt
Fluent builders and helpers for composing markdown prompt fragments
-
gigagei
random quote fetching console utility
-
bstr
A string type that is not required to be valid UTF-8
-
codetypo-dict
Source Code Spelling Correction
-
asimov-serpapi-module
ASIMOV module for data import powered by the SerpApi search data platform
-
r-matrix
Rust port of cmatrix
-
rust-tfidf
calculate TF-IDF (Term Frequency - Inverse Document Frequency) for generic documents
-
u8char
An alternative to the char type that uses UTF-8 as its in-memory representation
-
arabic_pdf_to_text
A CLI tool to convert Arabic PDFs to text using Google's Gemini API
-
streaming_serde_json
streaming alternative to serde_json for processing JSON data sources that are too large to fit in memory
-
stego_wps
text-based steganography using the Words Per Sentence (WPS) method. It allows encoding and decoding hidden messages within the structure of a given text.
-
cglue-bindgen
cleanup cbindgen headers for CGlue
-
uniquewords-rs
Count the frequencies of words in text file(s) or stdin
-
latex
An ergonomic library for programatically generating LaTeX documents and reports
-
neo-mime
Strongly Typed Mimes
-
htmls
parsing HTML and extracting HTML elements or text
-
opstr
‘Operate on strings’ command line utility
-
overlap-chunk
splitting text into chunks of specified size with adjustable overlap percentage
-
rumage
framework for making simple markdown sites
-
bk-tree
A Rust BK-tree implementation
-
findcar
An extensible CLI tool that filters, sorts, and aggregates search results from car websites. Because of course you want to search for cars from the command line.
-
catalog-of-markdown
Generate the catalog of markdown file
-
recvmsg
Traits for receiving datagrams reliably, without truncation
-
real_time_note_taker
A terminal UI tool to take time stamped notes in real time
-
regex-chunker
Iterate over the data in a
Readtype in a regular-expression-delimited way -
utf8-fix
Fix invalid UTF-8 sequences in-place while preserving buffer size - useful for fuzzing and mutation testing
-
shellvetica
Bring your shell output to the web
-
getch-rs
getchfor Windows and Unix -
pithy
Ultra-fast, spookily accurate text summarizer that works on any language
-
fusefiles
Concatenate a directory full of files into a single prompt for use with LLMs
-
ethan-rs-wc
The ethan-rs-ws(erwc) is word, line, character, and byte count. Like wc command but not just wc command, more accurate and faster. Text can also be read from standard input for statistics.
-
xi-rope
A generic rope data structure built on top of B-Trees
-
yara-x-cli
A command-line interface for YARA-X
-
ccase
Command line interface to convert strings into any case
-
tectonic_engine_bibtex
The
bibtexprogram as a reusable crate -
mdbook-auto-gen-summary
A preprocessor and cli tool for mdbook to auto generate summary
-
webdog
static site generator fit for a dog
-
minigrep_mm
minigrep app from rust-book with public lib methods
-
memchunk
The fastest semantic text chunking library — up to 1TB/s chunking throughput
-
steer-tui
Terminal UI library for Steer
-
tidyvcf
command-line tool to convert VCF files to tab/comma separated tables
-
quake_text
Utils for Quake strings and characters
-
poriborton
Interconversion between Unicode and various Bengali ANSI encodings
-
pandoc-ac
pandoc filter for converting acronym codes to LaTeX
-
strloin
copy on write slices of a string
-
line-numbers
Find line numbers in strings by byte offsets, quickly
-
rsmorphy
Morphological analyzer / inflection engine for Russian and Ukrainian (soon) languages (WIP)
-
onig_sys
onig_syscrate contains raw rust bindings to the oniguruma library. This crate exposes a set of unsafe functions which can then be used by other crates to create safe wrappers around Oniguruma… -
toonconv
CLI tool for converting JSON to TOON (Token-Oriented Object Notation) format
-
pulldown-cmark-mdcat
Render pulldown-cmark events to TTY
-
markitdown
designed to facilitate the conversion of various document formats into markdown text
-
vestalia
Vestaboard API wrapper
-
jira-clean
clean up Jira task description that is an output of jira-cli tool
-
stylish-style
Internal implementation details of
stylish-core -
lowcharts
draw low-resolution graphs in terminal
-
badascii
Backend rendering library for BadASCII diagrams. Block diagrams in ASCII.
-
lexical-sort
Sort Unicode strings lexically
-
etch
Not just a text formatter, don't mark it down, etch it
-
tkrar
Count frequency of words in a file or a directory
-
asciidoctor-client
A kludge to improve the performance of static site generators that use asciidoc through its cli
-
grapheme-utils
Handy utils for working with utf-8 [unicode] Extended Grapheme Clusters
-
unicode-canonical-combining-class
Fast lookup of the Canonical Combining Class property
-
diff-man
diff utility lib
-
grapheme
Abstractions for working with extended Unicode grapheme clusters
-
justify
plaintext while handling Unicode gracefully
-
rustyink
Blazing fast static site generator
-
suffixsort
suffix (inverse lexicographic) sorting
-
coverio
Better code coverage reporting for Rust crates
-
docstr
Ergonomic multi-line string literals
-
mutf8
Basic support for mutf8 strings
-
maytrix-value
Core value types for Maytrix
-
ib-unicode
Unicode utils
-
probe-code
AI-friendly, fully local, semantic code search tool for large codebases
-
rfc9839-rs
RFC9839 to test for problematic Unicode code points
-
choco
markup language for dialogue systems
-
html-index
Generate an HTML index
-
puppet-fmt
Automatic code formatter for puppet manifests
-
Contact_Book
contact book
-
langsan
sanitizing language model input and output
-
pulldown_typst
A pull parser for Typst markup
-
ricecoder-research
Research and analysis utilities for code understanding
-
what-rs
Identify what something is! A pyWhat reimplementation in Rust
-
foundation_nostd
Foundational nostd core implementation for the ewe_platform project
-
vndb_tags_get
convert VNDB tag list (JSON to markdown)
-
letter-sequence
A method to create sequence displayed as uppercase or lower letters, or digits
-
hns
Human numeric sorting program — does what
sort -his supposed to do! -
lex-analysis
Semantic analysis for the lex format
-
ogam
A markup language for story writers
-
waken_snowball
Snowball stemming algorithms for 33 languages
-
ttpkit-utils
Various utilities for ttpkit
-
typope
Pedantic source code checker for orthotypography mistakes and other typographical errors
-
hebrew_accents
finding, filtering, and displaying Hebrew accents, specifically focusing on the Tiberian accent system as documented by the Masoretes
-
md-to-incodoc
Convert markdown to incodoc
-
fast_trie
A memory efficient trie library
-
base64-lib
A base64 encoder / decoder
-
wildcard_ex
extended wildcards that allows VB-like specifications
-
unveil-rs
Unveil Rs is a tool to create presentations from markdown files
-
bilingual
A cmdline tool used for markdown translation via calling Chinese translation api cloud services
-
harfbuzz
Rust bindings to the HarfBuzz text shaping engine
-
scribe-patterns
Advanced pattern matching and search algorithms for Scribe
-
asciimath-unicode
Convert asciimath to unicode
-
code-to-pdf
Generates a syntax-highlighted PDF of your source code
-
substr-iterator
Substring extractor based on characters without allocation
-
sre-engine
A low-level implementation of Python's SRE regex engine
-
libharu
Rust binding for libharu(http://libharu.org/) PDF library
-
indentsort
Structure-preserving sorting of arbitrary indented text
-
cargo-markdown
Local crates.io readme development server with ultra-fast hot reloading goodness
-
rslug
fast, and configurable library to create URL-friendly slugs from strings
-
mdbook-scientific
Enables inline equations for mdbook to set by $..$ signs and $$..$$
-
ib-pinyin
一个高性能拼音查询、匹配库
-
asoiaf-api
Rust wrapper for the api of ice and fire
-
spcfile
.spc file format reading and writing
-
yeslogic-fontconfig-sys
Raw bindings to Fontconfig without a vendored C library
-
pdfcr
render a codebase to a pdf
-
pups_text
Pretty Understandable Parsers
-
livre
type-safe PDF parser
-
ari
ari. the extras library.
-
re2
Wrapper for the re2 C++ regex library
-
safe-regex-compiler
Regex compiler for the safe-regex crate
-
csv-groupby
execute a sql-like group-by on arbitrary text or csv files
-
natural-sort-rs
sorting strings based on Natural sort order
-
mdast_util_to_markdown
Markdown to AST
-
unicode-vo
Unicode vertical orientation detection
-
md2adf
Markdown to Atlassian Document Format converter
-
tiefdownlib
manage and convert TiefDown projects
-
assemblage_db
Distributed Document/Graph DB for Connected and Overlapping Pages
-
mdbook-svgdx
mdbook preprocessor to convert svgdx fenced code blocks into inline SVG images
-
ob
A Blog and RSS system written in Rust
-
artificial-types
Reusable prompt fragments and helper types for the Artificial prompt-engineering SDK
-
mdzk
Plain text Zettelkasten based on mdBook
-
synthphonia-rs
DryadSynth solver for string transformation programs
-
zine
opinionated tool to build your own magazine
-
wildcard-trie
A space-efficient router / radix-trie that supports wildcards
-
arborium-theme
Theme support for arborium syntax highlighting
-
char-ranges
Iterate chars and their start and end byte positions
-
slicedisplay
Simplistic Display implementation for Vecs and slices
-
vregex
Regular Expression engine
-
carlotk
The main library for Carlo, a simple interpreted programming language
-
markdown-live-preview
A live Markdown preview server with WebSocket and HTML rendering
-
winsfs-cli
Site frequency spectrum estimation based on window expectation-maximisation algorithm
-
codebook_downloader
Dictionary downloading utility for the Codebook spell checker
-
noctisroll
Text-based TRPG dice rolling system
-
marktask
A CLI tool for parsing and manipulating Markdown tasks
-
ironstorm_lookup
Lightning fast lookup table for auto completion, type ahead, suggestion engines
-
geoipsed
Inline decoration of IPv4 and IPv6 address geolocations
-
character-set
High performance
set.contains(char) -
aneurysm
Brainf**k interpreter
-
lasr
Live Action Search and Replace
-
topfew
CLI to find high frequency occurrences in structured text files
-
kbremap
Custom keyboard layouts for windows
-
mq-crawler
Directory crawler for batch Markdown file processing
-
byteforge
A next-generation byte-level transformer with multi-signal patching and SIMD optimization
-
mdbook-renderer
assist implementing an mdBook renderer
-
quickmark-core
Lightning-fast Markdown/CommonMark linter core library with tree-sitter based parsing
-
unicode_reader
Adaptors which wrap byte-oriented readers and yield the UTF-8 data as Unicode code points or grapheme clusters
-
truncate_string_at_whitespace
Truncate a &str at the closest whitespace to a specified length with unicode safety
-
man
Generate structured man pages
-
mdlib
A beautiful markdown note-taking application
-
zenpatch
A robust library for applying text-based patches, designed for AI coding agents with backtracking algorithm
-
rust-regex-dsl-creator
Regular expression DSL derive macros
-
unicode-normalization-alignments
functions for normalization of Unicode strings, including Canonical and Compatible Decomposition and Recomposition, as described in Unicode Standard Annex #15
-
bbcode-tagger
BBCode tree parser and tagger
-
textnonce
Text based random nonce generator
-
to_markdown_table
An easy way to format any data structure into a Markdown table
-
detone
Decompose Vietnamese tone marks
-
unix_str
Unix-compatible strings regardless of platform
-
file-editor
Clean, elegant API for creating and editing text files
-
binyl
A bitwise UTF-8 string inspection tool
-
weasel-gen
Random ascii generation animation until target string is met
-
memvid-ask-model
LLM inference module for Memvid Q&A with local and cloud model support
-
path2regex
Express style path to RegExp utility
-
holy-carpet
customizable blog creator
-
aimds-detection
Fast-path detection layer for AIMDS with pattern matching and anomaly detection
-
lex_sleuther
Sleuth out script languages by lexing
-
mdbook-treesitter
mdBook preprocessor for html adding tree-sitter highlighting support
-
subject-classifier
classifying a commit by it's subject
-
idna-cli
Encode/decode Unicode domain names to/from IDNA ASCII
-
truecase
Restore correct letter casings in arbitrary text using a statistical model
-
bbx
A robust, performant BBCode pull parser
-
orly
Download O'Reilly books as EPUB
-
pi_ucd
unicode字符函数,获得字符的语言区间段;及根据文字排版的需要,判断字符是否为单字字符或字母字符
-
agnostic-levenshtein
Levenshtein distance for ASCII or Unicode strings
-
deindent
A command line utility and Rust library to format overly-indented text
-
book2pdf
CLI utility to turn published code documentation into PDFs
-
east-asian-width
Determine the display width of Unicode characters in East Asian contexts
-
arabic-script
An expressive API for the characters of the Arabic script
-
rusty_regex
A minimalistic regex engine using the pipeline: Regex -> AST -> NFA -> DFA -> Match(String)
-
rfc9839
RFC 9839 specification
-
string-width
Accurate Unicode string width calculation for terminal applications, handling emoji, East Asian characters, combining marks, and ANSI escape sequences
-
ripmors
encoding and decoding international Morse code and several variants
-
gpui_util
A collection of utility structs and functions used by Zed and GPUI
-
bat
A cat(1) clone with wings
-
xee-ir
Xee intermediate representation and compilation to bytecode
-
superfold
A multilingual Rust library and CLI to process UTF-8 strings to exclude diacritics and fold non-phonetic graphemes into their phonetic ASCII representation
-
p4d-mdproof
Markdown to PDF converter
-
rust-functions
A collection of Rust utility functions (starting with format_number)
-
shimmytok
Pure Rust tokenizer for GGUF models with llama.cpp compatibility (SentencePiece + BPE)
-
utf8proc-sys
Unsafe rust bindings to the utf8proc library
-
bible-io
working with Bible text data structures
-
sms_splitter
An SMS message splitter and part calculator with support for GSM and Unicode
-
natord-plus-plus
Natural ordering for Rust
-
character_converter
Turn Traditional Chinese script ot Simplified Chinese script and vice-versa and tokenize
-
BBSGlib11
num BBSGlib11
-
simple_peg
A command-line peg parser implemented by Rust
-
rustme
Automatically generate Rusty READMEs
-
quoted-string-parser
Quoted string parser for grammar defined in RFC3261
-
esl01-renderdag
Render a graph into ASCII or Unicode text
-
lex-core
Parser library for the lex format
-
block-id
generating opaque, unique, and short string values from (unsigned) integers
-
jpreprocess-njd
Japanese text preprocessor for Text-to-Speech application (OpenJTalk rewrite in rust language)
-
url-cleaner-engine
The engine behind URL Cleaner
-
aki-mcycle
mark up text with cycling color
-
kvu
The simplest command line tool to manage key-value pair lines
-
trees-rs
tree-rs is a drop-in replacement for the tree utility (ASCII tree view of folder directory), but written in Rust. The primary goal of this project is to provide a non-bloated and more…
-
repub-rs
binary for converting mhtml webpages into remarkable-style summarized epubs
-
lookbook
Component preview framework for Dioxus
-
slugify
Macro for flexible slug generation
-
libcommand
creating ascii command interpreters with builtin argument-splitting
-
text_lines
Information about lines of text in a string
-
kspconfigtool
KSP1 ConfigNode parser and block removal tool
-
plagiarismbasic_lib
Basic plagiarism checker written in Rust
-
mdbook-ai-pocket-reference
mdbook preprocessor for the ai-pocket-reference project
-
hmd
Custom Markdown Engine for my personal blog
-
raw-string
RawStr/RawString are like str/String, but without any guaranteed encoding
-
fuzzywuzzy
A pure-Rust clone of the incredibly useful fuzzy string matching python package, FuzzyWuzzy
-
enso-lazy-reader
An efficient buffered reader
-
hina
:]
-
ranting
Linguistic formatting placeholder extensions for rust
-
matcher_py
A high-performance matcher designed to solve LOGICAL and TEXT VARIATIONS problems in word matching, implemented in Rust
-
git-blamediff
A program to automatically annotate changes to a file in git(1)
-
akiaki
A good old fashioned wiki engine with a flat-file database
-
swot
community-driven or crowdsourced library for verifying that domain names and email addresses are tied to a legitimate university of college
-
rulet
figlet implementation
-
pgn-traits
Traits for chess variations that can be annotated with Portable Game Notation
-
squishyid
Shorten and obfuscate IDs
-
xml1
sane, non compliant xml parser without allocations
-
xj_scanf
Safe reimplementation of
scanf() -
mdbook-metadata
mdBook preprocessor to parse markdown metadata
-
highly_sendable
Containers for sending Strings and results within your process and beyond
-
orgflow-tui
A terminal user interface for orgflow - manage notes and tasks with a smooth workflow
-
parlib
A barebones parser combinator library, inspired by Haskells parsec
-
svgbob_cli
Transform your ascii diagrams into happy little SVG
-
inkline
Display colorized ascii art to the terminal
-
mdbook-open-git-repo
mdbook preprocessor to add a open-on-git-repo link on every page
-
jq-rs
Run jq programs to extract data from json strings
-
syntax-html
cli application for generating syntax-highlighted HTML of code files
-
detect-newline-style
Determine a string's preferred newline character
-
static-lang-word-lists
Runtime decompressed statically-included word lists
-
tet_rs
A third-party implementation of Text Entry Throughput (ref. https://doi.org/10.1145/3290605.3300866) for Rust
-
kana-converter
converter for half-width/full-width Japanese language characters (katakana, hiragana, and ASCII)
-
rst_renderer
a reStructuredText renderer
-
text-search
easy to use plug and play wrapper around tantivy for simple search scenarios
-
iconv-native
A lightweight text encoding converter based on platform native API or libiconv
-
mask-text
mask text with multiple masking options
-
random-zh
generating random Chinese characters
-
html-linter
An HTML linting library for checking HTML structure and semantics
-
zettel-core
Core library for Luhmann-style Zettelkasten management
-
el_roi
simplify reading user input
-
arinamcnulty-markdown-parser
Markdown parser - university project
-
dynfractal
A program to render any equation (in terms of z and c) into an ASCII fractal
-
asciifolding
ascii folding library
-
unobtanium-segmenter
A text segmentation toolbox for search applications inspired by charabia and tantivy
-
wtf8-rs
WTF-8 encoding
-
kyte
Extensible mechanics for operational transformation in Rust that are generic with respect to their value (not constrained to text), wire-compatible with Quill and fully fuzzed
-
epcmanager
EPC text tool for RFID
-
meztide
SDK for lotide
-
fontheight-cli
Find out the vertical extents your font reaches on shaped words
-
quillmark-cli
Minimal CLI for testing Quillmark packages locally
-
translitrs
Transliteration utility for Serbian language
-
quickner-core
A fast and simple NER tool
-
bottomify
Fantastic (maybe) CLI for translating between bottom and human-readable text
-
dbxcase
Dropbox-compatible case-folding algorithm
-
mdbook-lint-rulesets
Modular rulesets for mdbook-lint - standard and mdBook-specific linting rules
-
swc_ecma_regexp_ast
AST definitions of ECMAScript regular expressions
-
g2-unicode-jp
convert Japanese Half-width-kana[半角カナ] and Wide-alphanumeric[全角英数] into normal ones
-
mdbook-checklist
An mdBook preprocessor for generating checklists and indexes
-
char_reader
Safely read wild streams as chars or lines
-
perm-text
curling straight/dumb quotation marks ("") and apostrophes (') into their curly/smart (“”’) equivalents
-
boykisser
A collection of compile time ASCII art and images, especially boykisser
-
tg_bot_models
Telegram Bot API types (for API 4.3)
-
sakurs-cli
Command-line interface for Sakurs sentence boundary detection
-
opentalk-types-signaling-meeting-report
Signaling types the OpenTalk meeting-report module
-
grammateus
facilitate working with Ancient Greek words
-
kmdbook-driver
High-level library for running mdBook
-
sonai_metrics
Text metrics for sonai
-
hayro-font
A parser for CFF and Type1 fonts
-
gregex
Regex solver utilizing NFA
-
yagenerator
Application that uses tinytemplate engine to generate text files. If you have a set of structured data, and need to generated a bunch of arbitrary types of files from it, this tool can help you to save some time.
-
mango
programming language (UNDER CONSTRUCTION)
-
easy_io
Fast and dead-simple IO for competitive programming in Rust
-
surt-rs
Sort-friendly URI Reordering Transform (SURT)
-
rakugaki
rendering TTF/OTF font characters as ASCII art in the terminal
-
midstring
Create a string between two other strings, that is lexicographically halfway between them
-
kind-tree
Syntatic trees for Kind compiler
-
semchunk-rs
A fast and lightweight Rust library for splitting text into semantically meaningful chunks
-
text-transliterate
transliterate texts using the SO iconv from POSIX
-
usage-cli
CLI for working with usage-based CLIs
-
controlled-option
Custom Option type with explicit control over niches and memory layout
-
film_parser
application for parsing detailed film data from raw text, including information such as title, release year, director, screenwriter, cast, genre, and runtime, into a structured Film…
-
nfa_regex
NFA regex engine for text processing
-
dtools
A collection of tools for working with Commodore disk images
-
turbo-json-checker
A pushdown automaton low memory JSON bytes stream checker returning the JSON root-type followed by its start and end index in the Reader
-
two_five_six
Two-five-six provides high-entropy, human-friendly identifiers
-
escrit
learning languages by reading texts
-
anslatortray
translate from English to Pig Latin!
-
rust_file_encode_mode_convert
这是一个rust的库,用于检测文件的编码格式。 支持GBK,GBK2312 , UTF8, UTF16LE, UTF16BE, UTF8+BOM,UTF32 等多种编码格式。
-
owned_chars
Owned iterators with the same output as Chars and CharIndices
-
gecliht
A disparate collection of text manipulation and formatting algorithms
-
cursive_hexview
hexviewer for cursive
-
ptero-cli
A text steganography CLI tool for Social Media
-
spandex-hyphenation
Knuth-Liang hyphenation for a variety of languages
-
ascii_converter
converting between different ascii representations
-
strange
A static website generator
-
udp-logger-rs
Log macro for log's kv-unstable backend and a UDP socket logger
-
context-notation
Featherweight semantic notation for text
-
minigrep_sensitive
search for a querry case sensitive or not, in a file
-
mdxjs
Compile MDX to JavaScript in Rust
-
guarding
guardians for code, architecture, layered. Guarding crate a architecture aguard DSL which based on ArchUnit.
-
markov_str
Markov Chain implementation optimized for text generation
-
quillmark-typst
Typst backend for Quillmark
-
bcdown
Bilibili漫画下载器,written in Rust,支持epub pdf zip格式
-
ramp-text
Pure Rust
-
nib-cli
A cli for a yet another static site generator Nib
-
qsv_vader_sentiment_analysis
Bindings for Rust from the original Python VaderSentiment analysis tool. Forked for use with qsv.
-
prototty_common
Definition of common ui elements and views for use with prototty
-
cnpj
Brazilian CNPJ parsing, validating and formatting library
-
mdbook-blush
An mdBook small-caps preprocessor
-
inlinable_string
inlinable_stringcrate provides theInlinableStringtype – an owned, grow-able UTF-8 string that stores small strings inline and avoids heap-allocation – and theStringExttrait… -
tgrep
Toy grep that honors .gitignore
-
hitori
Generic compile-time regular expressions
-
jfmt
command-line tool for formatting json files in both readable and compact formats. It supports stdin/stdout shell usage, as well as working on files directly.
-
japanese-text
日本語テキスト正規化ライブラリ - 全角半角変換、ひらがなカタカナ変換
-
sixbit
Small packed strings
-
textwrap-cli
Command line interface for textwrap
-
linkcheck
extracting and validating links
-
wikidump
parsing Mediawiki XML dumps
-
markup_engine
Zero-dependency, trait-based, ultra-lightweight markup renderer (Markdown + HTML + HTML + pluggable). Born from LeadSheetML, useful anywhere.
-
bardic
A performance-focused toolkit for procedural narrative generation
-
docbox-storage
Docbox storage layer abstraction
-
anagrambot
find anagrams of words
-
docket
markdown to HTML documentation rendering
-
ascii_tree
generates ascii trees
-
text_distance
A collection of approximate string matching algorithms
-
kvarn-chute
A Markdown converter designed to use the Kvarn templating engine
-
hubble
Official Hubble plugin SDK for Rust
-
rubidium
Rb37
-
r4d
Text oriented macro processor
-
bitflip
functions to generate bitflips of binary and UTF-8 strings
-
kanuni
AI-powered legal intelligence CLI - The Ottoman Edition
-
mors
converting morsecode to text and reverse
-
liwe
IWE core library
-
clafrica
This application allows you to type most of the characters in the african alphabet in any text field
-
committed
Nitpicking commit history since beabf39
-
pillar
small tool to format lines into columns
-
diaryx_core
Core library for Diaryx - a tool to manage markdown files with YAML frontmatter
-
rustrings
Strings manipulation for Rust
-
rtss
A command-line tool to annotate stdout/stderr with elapsed times
-
corpus-preproc
A preprocessor for text and HTML corpora
-
record-query
doing record analysis and transformation
-
good-morning
A welcome message generator
-
minigrep_tuhin
A mini grep tool implemented in Rust
-
somedoc
A very simple document model and markup generator
-
pdf_composer_definitions
PDF Composer definitions crate
-
trueno-rag-cli
CLI for Trueno-RAG pipeline
-
yinzhe9
喵喵隐者9
-
valid_rust_char
A tiny library to check if a char it's valid in a rust file
-
tweet-memo
A CLI tool to record Twitter-style short memos in Markdown files
-
pg_str
Adds str functions to Postgresql via an extension
-
zenbase-llml
Lightweight Language Markup Language - converts data structures to XML-like markup
-
voirs-g2p
Grapheme-to-Phoneme conversion for VoiRS speech synthesis
-
c6o-obsidian-export
associated CLI program to export an Obsidian vault to regular Markdown
-
case-conv
Faster case conversion crate
-
pinyin-sort
sort pinyin
-
crawdad-rkyv
Crawdad: ChaRActer-Wise Double-Array Dictionary with rkyv support
-
lindera-sqlite
Lindera tokenizer for SQLite FTS5 extention
-
replaxe
A command-line tool to replace text in files with easy patterns
-
rgon
A command-line tool written in Rust that searches for a query string within a file
-
moodle_api
Moodle Webservice Client in Rust
-
reddish
A Rust utility library
-
workstyle
Workspaces with style! This program will dynamically rename your workspaces to indicate which programs are running in each workspace. It uses the i3 ipc protocol, which makes it compatible with sway and i3…
-
swimos_model
SwimOS Generic Representation of the SwimOS Serialization Model
-
omgwtf8
Optimized-Matching-Generalized Wobbly Transformation Format — 8-bit
-
ean-rs
generating and validating EAN barcodes
-
fifthtry-mdbook
fork of mdbook, only for ft-cli
-
awful_knowledge_synthesizer
Generate LLM-powered exam questions from YAML books, manpages, mdbooks, tealdeer pages, and code
-
sqdj
sqdj shortens delimited data
-
swc-formatjs-visitor
formatjs custom transform visitor for SWC
-
pdf_form
programatically filling out pdf forms
-
cgp-monad
Monadic constructs for CGP computation
-
minify_selectors_utils
minify-selectors
-
chardet
rust version of chardet
-
ferrissay
cowsay
-
zrbecker-minigrep
a mini version of the grep tool
-
base256emoji
A multibase base256emoji implementation
-
base16-rs
in Rust offers capabilities for encoding and decoding data in Base16 format. By utilizing the hex library, you can transform data into its hexadecimal representation and also decode…
-
rust_tokenizers
High performance tokenizers for Rust
-
line-rs
Extract lines from files without hacks!
-
slugify-rs
generate slugs from strings
-
casespaetzle
This package introduces common identifier case traits and a macro for case trait generation
-
kbnf-regex-automata
A forked version of regex-automata for kbnf
-
serde_ssml
A robust Rust library for parsing, manipulating, and generating Speech Synthesis Markup Language (SSML) documents
-
hemoglobin-search
Hemoglobin search utilities
-
iso15924
ISO 15924 data
-
snakecase
general purpose snakecase implementation supporting both ascii and unicode
-
help_crafter
help message generator without hussle
-
fuzzy-datetime
Detects, completes and normalises fuzzy date and date-time strings for interoperability with chrono or direct output as ISO-8601-compatible strings
-
mtf
Markdown Table Formatter
-
nile-library
supporting nile
-
string-box
Create Rust string from UTF-8 string, byte string or wide string
-
delay_writer
Wraps a writer and delays its output after each newline
-
zed-collections
Standard collection type re-exports used by Zed and GPUI
-
string_search
searching strings using substrings
-
encode_rs_fs
Read and Write files with encoding_rs
-
cp437_constants
Defined constants covering the entirety of ASCII code page 437 (https://en.wikipedia.org/wiki/Code_page_437)
-
tagsearch
Filter plaintext files based on @keyword tags
-
ucd-raw
Uninterpreted access to the unicode UCD
-
johalun/module
FreeBSD kernel module in Rust
-
noise_search
Nested Object Inverted Search Engine
-
pest3_core
Early WIP prototype version of pest 3.0
-
lindera-ipadic-neologd-builder
A Japanese morphological dictionary builder for IPADIC NEologd
-
datatroll
a robust and user-friendly Rust library for efficiently loading, manipulating, and exporting data stored in CSV files
-
whitespace-conf
Key-value configuration file delimited with whitespaces
-
rustwemoji
Twemoji for Rust
-
portmanteau
create portmanteaux
-
lipgloss-tree
A tree component for terminal user interfaces, styled with Lip Gloss
-
hello_lib
Demonstrate Generics Function
-
cerpton
A 'double' Caesar Cypher
-
mintyml-cli
Creates HTML from MinTyML, a minialist alternative syntax to HTML
-
porter-stemmer
Flexible and unicode friendly, Porter stemmer implementation
-
stam-python
STAM is a library for dealing with standoff annotations on text, this is the python binding
-
stringprep
algorithm
-
timug
It has been created for personal blog creation purpose. Timus has its limits, but it fulfills the purposes for which it was created.
-
wdpe
WebDynpro Parse Engine
-
simplecc
Chinese Convert library (partially) compatible with OpenCC's dictionaries
-
rust_stringz
string manipulation library for Rust that you should probably don't need. It's just for learning purposes.
-
xml-string
String types for XML
-
rawk-core
Core library for the AWK interpreter
-
ctf-brute
Brute-force utilities for Rust
-
rusttyper
Basic text layout, using rusttype
-
strcursor
string cursor type for seeking through a string whilst respecting grapheme cluster and code point boundaries
-
ragtime
Easy Retrieval Augmented Generation
-
vader-sentimental
A faster Rust version from the original Python VaderSentiment analysis tool
-
jp-location-relation
隣接する市区町村の一覧を取得
-
bocu1
BOCU-1 compressed unicode encoding
-
alpha-counter
Alphabetic counter
-
font-map
Macros and utilities for parsing font files
-
gbx
GBX (Grundbuch-Exchange) Dateiformat
-
nmd-core
Official NMD compiler core
-
unicode_extension
Don't use this crate
-
bookrafter
This repository contains code related to bookrafter rendering
-
curtana
Simplified zero-cost wrapper over llama.cpp powered by lama-cpp-2
-
text_layout
Text layout algorithms
-
bgmtv
An API wrapper for bangumi.tv based on reqwest
-
sluggify
slug or clean url generator for rust. With default settings, you will get an hyphenized, lowercase, alphanumeric version of any string you please, with any diacritics removed, whitespace and dashes collapsed…
-
trevordmiller
Personal CLI
-
atlas-memo
Atlas Program Library Memo
-
strip-prefix-suffix-sane
A small utility crate providing ‘sane’
strip_prefixandstrip_suffixmethods forstrthat returnselfinstead of anOptionwhen the prefix/suffix is not found -
fea-rs
Tools for working with Adobe OpenType Feature files
-
naming_utils
generating naming conventions, pluralizing words, and rest api paths in Rust
-
mdbook-compress
Compress an mdBook project into a single PDF file
-
fetch_mediawiki_configuration
Generate
parse_wiki_textconfiguration code for a MediaWiki wiki -
playbill
ASCII art title generator with random gradient effects
-
brainfuck_interpreter
brainfuck interpreter written in Rust
-
htmlescape
HTML entity encoding and decoding
-
tiny-clean
A lightweight, high-performance string sanitizer with configurable rules
-
regex-split
split_inclusive for the regex crate
-
cliche
Dead simple static site generator
-
fmt-extra
Extra formaters not shipped in the standard library
-
forgiving-htmlescape
HTML entity encoding and decoding, with support for leaving malformed entities intact
-
roan-engine
The core engine for the Roan project
-
hunspell-sys
Bindings to the hunspell C API
-
bin-rs
Binary Reader from multi source
-
looking-glass
reflection & type-erasure library for Rust
-
rulebook_query
Rulebook discovery and querying engine for searching, filtering, and extracting coding standards
-
biodiff-wfa2-sys
Rust bindings for the WFA2 library
-
matchpick
Find and replace multi-lines using a match-case
-
lk-inside
A terminal user interface (TUI) application for interactive data analysis
-
anystr
An abstraction over string encoding that supports ASCII, UTF-8, UTF-16 and UTF-32
-
opencc-sys
OpenCC bindings for Rust
-
fst-subseq-ascii-caseless
An automaton that matches if the input contains a specific subsequence ignoring ASCII case to be used with fst
-
deepl-openapi
The DeepL API provides programmatic access to DeepL’s machine translation technology
-
id32
32-character RFC 4648 lowercase Base32 IDs
-
roman_numerals_fn
A function to convert integers to their roman numeral representation as strings. Values from 1 to 3999 are possible, otherwise it returns an OutOfRangeError. Zero has no representation in roman numerals.
-
createqrcode
A CLI tool to convert data into QR codes
-
ruSTLa
A reStructuredText → LarST ⊂ LaTeX transpiler
-
wcrs
GNU wc in Rust
-
github-slugger
A slugger for GitHub headings
-
helia-strings
string storage and retrieval for Helia IPFS nodes
-
deliminator
Universal code documentation generator
-
asaru
Asana Tasks Search CLI
-
mdbook-davids_cooking
A preprocesor for whatever https://davidsotomarchena.gitlab.io/davids-cooking/ needs
-
owlz
"Owlz" ascii emojis, created randomly or by design
-
crawdad
ChaRActer-Wise Double-Array Dictionary
-
notion2md
converting Notion pages to Markdown
-
tree-sitter-stack-graphs-python
Stack graphs definition for Python using tree-sitter-python
-
m_lexer
extensible regular expressions based lexer
-
baste64
A base64 codec
-
strip-codeblocks
strip markdown code blocks from text, preserving only the inner content
-
cofe
tiny string similarity crate
-
fastn-builtins
fastn: Full-stack Web Development Made Easy
-
charclass
define and modify unicode character classes
-
insim_core
Core types and traits for the insim crate
-
fmtt
A diff-friendly text formatter that breaks lines on sensible punctuations and words to fit a line width
-
libabbs
aosc-os-abbs maintenance
-
mdbook-quiz-validate
Input validation for quizzes used in mdbook-quiz
-
flux-tui
Fast and lightweight Terminal UI drawing library
-
strmatch
Conditionally match strings in Rust using regex without much boilerplate
-
cuteness
Cute static site (+ server) generator with a bunch of plugins :3
-
text_searcher
text searcher tool
-
mdbook-typstpdf
An mdBook backend that generates PDF output using Typst
-
comment-strip
Remove comments out of text files
-
mdbook-nice
A mdbook plugin to add nice css to your book
-
u8lit
Custom literal to convert strings to UTF-8 bytes
-
awful_book_sanitizer
CLI to clean up OCR-mangled book excerpts into readable text using OpenAI-compatible APIs
-
correct_word
A No brainer 'did you mean' library for Rust
-
mystem
Wrapper around Yandex Mystem for Rust
-
asciis
ASCII base on RFC20. Just ord() and chr()
-
scrambler
command line tool to scramble letters
-
strip_markdown
remove markdown syntax from markdown files
-
prefix_writer
writer that can prefix text that contains mulptiple lines or incomplete lines
-
extract-strings
Extract ascii strings from files
-
regexy
lightweight Rust library for working with regular expressions. The regexy crate provides an easy-to-use interface for matching patterns in strings using regex
-
scraper
HTML parsing and querying with CSS selectors
-
ansi-escape-sequences
High-performance Rust library for detecting, matching, and processing ANSI escape sequences in terminal text with zero-allocation static regex patterns
-
icemelter
minimize files that trigger internal compiler errors (ICEs)
-
bpack
Fast, simple and easy to use data compression library meant to work with byte slices of String and str types
-
cow-rewrite
Rewrite copy-on-write types copying only when it's neccessary
-
mdbook-dtmo
Creates a book from markdown files with added plugins
-
jtool-grep
notebook-specific grep tool for jtool
-
glyph-names
Mapping of characters to glyph names according to the Adobe Glyph List Specification
-
tabulate-rs
Pretty-print tabular data. Port of python-tabulate to Rust.
-
caser
Change text between PascalCase, camelCase, and snake_case
-
wcounter
Give the word and count the appearance
-
AsgoreCore
A small rust library to manipulate arabic text to fit in non-supporting arabic games or programes
-
mdbook-gitbook
mdBook preprocessor to properly render GitBook specific syntax
-
mdbook-external-links
Open external links inside your mdBooks in a different tab
-
lesspub
CLI tool for downloading Sequences from LessWrong and exporting them as EPUB format ebooks
-
bge
Rust interface for BGE Small English Embedding Library
-
codes-iana-charset
This package contains an implementation of the IANA Character Set registry
-
unindenter
unindent text
-
bard
Creates PDF and HTML songbooks out of easy-to-write Markdown sources
-
gte-rs
Text embedding and re-ranking pipelines
-
rtml-rust
RTML for rust
-
zoitei
alphabet conversions
-
gstring
String with support for Unicode graphemes
-
rckive-genpdf
User-friendly PDF generator written in pure Rust
-
goofy-animals
Generate a name in adjective-adjective-animal form
-
cute_strings
colorize strings in the terminal
-
naming_clt
Extract and convert the naming format(case|notation) of identifiers from files or stdin. Use this tool to prepare identifier name strings for further operations (matching,replacing...) on relative files
-
aprilasr
High-level wrapper for the april-asr C api (libaprilasr) using aprilasr-sys
-
CLI_Project_Scott_Coakley
CLI Project in Rust
-
s3grep
A fast, parallel grep tool for searching logs and unstructured content in AWS S3 buckets
-
latinrs
Rust package that transliterates text and characters into thier latin form
-
hoedown
bindings for the Hoedown markdown processor
-
upodesh
generating Bengali word suggestions based on a given input string
-
show-bytes
Display bytes as printable ascii with escape sequences as needed
-
utf8-bytes
bytes::Bytes, but UTF-8
-
ascii-hangman-backend
customizable Hangman game with ASCII-art rewarding for children (backend)
-
zw
encoding and decoding text using zero-width characters
-
homoglyph-core
Domain layer for the homoglyph cli and service. Compute homoglyphs for an input sentence.
-
byte-num
converting numbers to bytes, and bytes to numbers in base 10!
-
az_minigrep
minigrep CLI program to search a query string from a file and print out the matched lines
-
notegraf
Core library for building a graph-oriented notebook
-
merge-whitespace
Procedural macros for merging whitespace in const contexts
-
mdxbook
Fork of mdBook, with more customizations and flexibility for programmers
-
yhy-email-encoding
Low level email encoding RFCs implementations
-
rupantor
A Bengali Phonetic Parser which is very flexible and supports Avro Phonetic
-
jcalendar
Japanese Calendar for Rust
-
lucide-dioxus
Dioxus port of Lucide
-
giff
Visualizes the differences between the current HEAD and a specified branch in a git repository using a formatted table output in your terminal. The differences are displayed with color-coded…
-
remake
writing maintainable regex and managing symbol soup
-
text_manipulation_rs
generating random placeholder text in different languages
-
chromalog
A customizable logger with dynamic color coding and file logging
-
ctrl-z
A composable reader to treat
0x1Aas an end-of-file marker -
uwildmat
wildmat/uwildmat matching
-
rew
A text processing CLI tool that rewrites FS paths according to a pattern
-
num2en
For converting integer and decimal numbers into English cardinal or ordinal number words
-
flatlake
In the works
-
aki-resort
sort lines of text. You can use regex to specify the KEY.
-
mdbook-morsels
Morsels plugin for Mdbook
-
ftrace
trace files and paths
-
fast_whitespace_collapse
Collapse consecutive spaces and tabs into a single space using SIMD
-
spellabet
Convert characters into spelling alphabet code words
-
ripgrep
line-oriented search tool that recursively searches the current directory for a regex pattern while respecting gitignore rules. ripgrep has first class support on Windows, macOS and Linux.
-
bubble-bath
Small and quick HTML sanitizer
-
libopenlipc-sys
Wrapper around liblipc to interact with Kindle dbus-based LIPC events
-
scfmt
formatting code with structured comments
-
csvsc
Build processing chains for CSV files
-
xenon-lexer
The Xenon compiler's lexer
-
blitztext
fast keyword extraction and replacement in strings
-
wordshk_tools
A combination of parsers and other tools for words.hk (粵典)
-
character_text_splitter
splitting text into chunks with overlap, designed for handling large amounts of text efficiently. Implementation is identical to langchain's CharacterTextSplitter
-
rehuman
Unicode-safe text cleaning & typographic normalization for Rust
-
rpdf
PDF command-line utils written in Rust
-
mdbook-svgbob2
Alternative mdbook preprocessor for svgbob
-
tantivy-object-store
A tantivy Directory implementation against object stores (S3, GCS, etc.)
-
encoding-index-singlebyte
Index tables for various single-byte character encodings
-
solr_post
command line tool and library to post files in a directory to a Solr server
-
blackboxmc_java
BlackboxMC bindings for java.util
-
pprint
Flexible and lightweight pretty printing library for Rust
-
kicad-text-injector
A tiny CLI tool that replaces variables of the style
${KEY}within KiCad PCB (pcbnew) files -
crustword
Crusty Crosswords
-
igpay-atinlay
Translate text to Pig Latin
-
wtf8
WTF-8 encoding. https://simonsapin.github.io/wtf-8/
-
asciirend
no_std compatible ascii rendering core
-
snekdown
A parser for the custom snekdown markdown syntax
-
md_ol_util
Transform markdown unordered lists to ordered lists with current position expressions
-
thank
Shows info about crates used in your projects so you know who to thank for them and where to do the thanking
-
slicestring
slicing Strings
-
shift_or_euc_c
C API for shift_or_euc
-
clippers
Cross-platform clipboard management library
-
target_info
Get text strings of attributes concernign the build target
-
trie-match
Fast match macro
-
dedent
Procedural macro for stripping indentation from multi-line string literals
-
gen3-charset
Pokemon Generation 3 Character Set Support (GBA)
-
dec_from_char
Small library for converting unicode decimal into numbers
-
translation-api-cn
Some useful structs for calling Chinese translation api cloud services. A helper tool for
bilingualcmdline tool. -
makepad-rustybuzz
A complete harfbuzz shaping algorithm port to Rust
-
gchemol-parser
Text parsing made simple
-
mention
code highlighter
-
kindle_formats
File formats used in the Amazon Kindle ecosystem
-
inslice
A command-line utility for filtering text input by columns and rows
-
md-kroki
Render kroki diagrams inside markdown
-
poly_doctest
Polyglot documentation snippet test generator - extract code snippets from docs and generate tests
-
bibutils-sys
Rust bindings for bibutils, a program for bibliography format interconversion
-
tiny-grep
grep-like text search utility written in Rust
-
fancy-regex-fork-pb
A custom fork of the fancy-regex crate. You probably don't want to use this.
-
fasta-cleaner
Transform fasta files by upper-casing all sequence characters and removing non-ACGT sequence characters
-
rust_baht_text
Convert number to Thai Baht text
-
indeed
Append lines to a file with no shell bullshit
-
libphonenumber-sys
rust ffi bindings to libphonenumber
-
small-bwt
BWT construction in small space
-
chargrid_decorator
chargrid views for decorating other views
-
bloom-html
HtmlNode type for bloom-client and bloom-server
-
esperanto-text
Convert Esperanto text between UTF-8, x-system and h-system transliterations
-
persian-tools-cli
cli for rust-persian-tools crate
-
tmenu
TUI fuzzy finder
-
pandoc_ast
deserializes and serializes the markdown ast for writing pandoc filters
-
codetypo
Source Code Spelling Correction
-
mupdf-sys
Rust FFI binding to MuPDF
-
str_overlap
Methods for finding the overlap between two string slices
-
jmemo
CUI tool for maintaining memos
-
integer_scaling
IntegerScaling wrapper for those who prefer snake_case
-
find_unicode
Find Unicode characters, the easy way!
-
analyse-json
CLI tool for inspecting (Newline Delimited) NDJSON or JSON to understand the contents
-
sourceannot
render snippets of source code with annotations
-
uiua-doc-gen
Documentation generator for Uiua libraries
-
key-lang
To be the prettiest and simpliest script
-
replacer-cli
A flexible cli to replace strings in files or a directory
-
mdbook-bibfile-referencing
An mdBook preprocessor to add bibfile referencing to each page
-
futf
Handling fragments of UTF-8
-
mdbook-preprocessor-boilerplate
Boilerplate code for mdbook preprocessors
-
mediawiki_parser
A strict parser for MediaWiki markdown
-
chinese-telegraph
unicode to chinese telegraph code conversion
-
dr
Command-line data file processing in Rust
-
encoding_rs_transcode
Transcode text within writers using encoding_rs
-
syntaxdot-encoders
Encoders for linguistic features
-
hydroper_source_text
Source text containing line locations
-
minigrep_101
Minigrep implementation of the Rust Book
-
uclanr
A random word picker that gives you actually useful words
-
imperative
Check for imperative mood in text
-
yuto51942-servant
cli
-
yozuk-core-skillset
Set of default Yozuk skills
-
macro_colors
colorful printing macros
-
cl
A command line tool for recording changes to be collected for use in a Keep A Changelog formatted CHANGELOG.md
-
open-protocol
Atlas Copco/Torque Open Protocol specification in Rust
-
steve
Search Technical Evidence Very Easily
-
flashtext2
The FlashText algorithm implemented in Rust
-
text-utils
Text utils for unescaping and align
-
ngram
Iterator adaptors for n-grams and k-skip-n-grams
-
align
aligning text
-
gh_page_tool
A github gh-pages tool for static blog site
-
rep-cli
Replace text file in bulk
-
utf8-io
Traits and types for UTF-8 I/O
-
unicode_names2_generator
Generates the perfect-hash function used by
unicode_names2 -
pulldown-cmark-fork
A pull parser for CommonMark
-
runefix-cli
Unified CLI frontend for runefix tools: width detection, markdown formatter, and terminal testing
-
rspy
Pythonic interface for Rust
-
alass-ffi
FFI bindings for subtitle synchronization with alass
-
streamparser
Adapt simple stream parsers for different types of streams
-
assemblage_view
Linearized View Model and Bindings for AssemblageDB
-
synox
Program synthesis of string transformations from input-output examples
-
rawstring
A string type that can hold data which may or may not be valid UTF-8
-
mq-run
Command-line interface for mq Markdown processing tool
-
parser-cli
Command-line interface for extracting text from various file formats
-
invisible-sun
program for generating svg "suns"
-
notmecab
tokenizing text with mecab dictionaries. Not a mecab wrapper.
-
charmap
one-to-(none/one/many) character mapping
-
saville_encode
Encode UTF-8 text into Peter Saville's colour code
-
textcat
detect text categories. It can be used to detect the language of a given text
-
pdf-rename
This script reads a list of PDF files from a specified directory and renames each file based on its content. The renaming logic uses the content of the PDF to generate a more descriptive and meaningful filename.
-
code_generator
A code generator (Currently only targets C)
-
lazy-string-replace
A lazy version of
String::replace, so that it can be formatted or recursively replaced without intermediate allocations -
incredimo
just another font for your terminal
-
token-dict
basic dictionary based tokenization
-
todo-to-issue
CLI tool that converts forgotten TODO comments into actionable GitHub issues
-
mdbook-check-missing-md
A backend for
mdbookwhich will find Markdowns you forgot on SUMMARY.md -
encoding-next-index-simpchinese
Index tables for simplified Chinese character encodings
-
dom_query
HTML querying and manipulation with CSS selectors
-
ewts-cli
Converter from EWTS (Extended Wylie Transliteration Scheme) to Tibetan Unicode symbols (cli)
-
esc
Escape characters in strings
-
asimov-openai-module
ASIMOV OpenAI module
-
glifnames
Mapping of characters to glyph names according to the Adobe Glyph List Specification
-
trie-alg
Trie implementation
-
base58check
encoding
-
pdfrust
PDF parser
-
opentalk-roomserver-report-generation
OpenTalk RoomServer report generation
-
aki-unbody
output first or last n lines, like a head and tail of linux command
-
rex-regex
A pedestrian library for regular expressions: Slow, but easy to understand
-
ai-context-gen
A context generator for Rust repositories that creates structured markdown files with relevant information for LLMs and AI agents
-
vidyut-kosha
A Sanskrit key-value store
-
chunkr
A fast and quick chunking library for rust
-
english-core
English language inflector
-
jpreprocess-jpcommon
Japanese text preprocessor for Text-to-Speech application (OpenJTalk rewrite in rust language)
-
runestr-pancjkv
rune-based Pan-CJKV support -
unicode_types
A mapping of all the unicode characters into convenience types (one enum per block of characters with one variant per character)
-
meaningsearch
package that helps you find meaningful lines of any given input. Especially useful in CTFs.
-
trans-case
Transform case
-
uwu-rs
uwuifying library
-
markdown-table
Creating markdown tables with Rust!
-
csvpeek-rs
A CLI tool to quickly peek into, list, and filter CSV data
-
pukram-formatting
A type to represent the formatting of the pukram markup language
-
latin
A standard library with a focus on common operations
-
convert_case_extras
Extra features for convert_case
-
tectonic_xetex_format
Tectonic/XeTeX engine data structures and their expression in TeX "format" files
-
twitter-text
in Rust
-
unicode-jp
convert Japanese Half-width-kana[半角カナ] and Wide-alphanumeric[全角英数] into normal ones
-
mantra-lang-tracing
Contains functionality needed to collect requirement traces from code or plain text files for the
mantraframework -
floating-ui-utils
Rust port of Floating UI. Utilities for Floating UI.
-
string-simple
containing some simple string utilities that I use in my other projects
-
divvunspell-bin
Spellchecker for ZHFST/BHFST spellers, with case handling and tokenization support
-
arborium-highlight
Unified syntax highlighting for arborium - works with both static Rust grammars and WASM plugins
-
dcsv
Dyanmic csv reader,writer,editor
-
uwl
A management stream for bytes and characters
-
xee-xpath
XPath 3.1 library API
-
hex-utilities
working with hexadecimal numbers
-
matchable
Check if text is matching against string or regex in one API
-
mdbook-summary
Summary parser for mdBook
-
lsp-document
Helpers to convert between LSP documents and Rust strings
-
validated-slice
Helper macros to implement std traits for custom validated slice types
-
lingua-french-language-model
The French language model for Lingua, an accurate natural language detection library
-
pdf_forms
programatically filling out pdf forms
-
rs-tool
A command-line tool to perform reservoir sampling on a file or a stream
-
ucd-util
A small utility library for working with the Unicode character database
-
rustfmt_emitter
Rustfmt emitter library
-
lingua-spanish-language-model
The Spanish language model for Lingua, an accurate natural language detection library
-
mdbook-trunk
mdBook plugin which bundles packages using Trunk and includes them as iframes
-
vew
Visualize lsof output
-
rjoin
joining CSV data on command line
-
august
& program for converting HTML to plain text
-
jmdict-fast
Blazing-fast Japanese dictionary engine with FST-based indexing
-
pencil_case
but effective library for writing GUI in bevy
-
reflexo-vec2svg
Export a frame of Typst document into svg element
-
renamer-rs
process and rename files or text
-
terraphim-markdown-parser
Terraphim Markdown Parser
-
syllable
counter for use with reading level calculations
-
cur
that will hunt for your regular expression
-
unicode-character-database
Unicode character database tables (Unicode Standard Annex #44) generated using
ucd-generate -
salvation-cosmic-text
Pure Rust multi-line text handling
-
levenshtein_lite
No-frills implementation of a Levenshtein Automata and the Levenshtein Distance function
-
indoc
Indented document literals
-
cabocha
Safe Rust wrapper for cabocha a japanese language dependency structure analyzer library
-
mdbook-infisearch
InfiSearch plugin for Mdbook
-
ltxcut
formats a table-like stream into a LaTeX-table
-
cli-colors
A CLI tool for outputting text in ANSI format with features like colors, underlining, boldening, and italicizing
-
code-agent
AI-Native Code Assistant Library
-
mdopen
Preview markdown files in a browser
-
lang-unicodes
Effortlessly convert and validate text across various Unicode ranges, supporting multiple languages for seamless internationalization in your applications
-
computergeneration
compgen but all wrong
-
lithe
A Slim template engine by using Pest
-
hayro-write
rewriting pages of a PDF file
-
turn-uppercase
Small command to uppercase text in command line and copy to clipboard
-
console-traits
Basic console traits, such as setting char position and writing strings
-
delim
parsing values from delimited strings
-
abbreviation_extractor
extracting abbreviations from text
-
fmty
Composable
core::fmtutilities -
eytan-minigrep
minigrep from "the book"
-
indexrs
inefficient multi-language search index
-
unidecode
pure ASCII transliterations of Unicode strings
-
naromat
Convert text to narou novel format
-
afrim-memory
handle of sequential codes easier for an input method
-
pulldown_mdbook
A pull parser for mdBook
-
seven_seg
Seven-segment digital display for terminal
-
hangul
manipulate Hangul Syllables
-
ogrep
searching in indentation-structured texts
-
xpath-cli
Evaluate XPath selectors on XML or HTML documents
-
decondenser
Pretty-print any text based on brackets nesting
-
campfire
A tiny static site generator, greatly inspired by Zola
-
squidge
shortens delimited data
-
rosie
Interface for the Rosie Pattern Language, for efficient and maintainable text pattern matching and search
-
print-positions
providing string segmentation on grapheme clusters and ANSI escape sequences for accurate length arithmetic based on visible print positions
-
tuilet
A textual user interface for Toilet, the ANSI-art text generator
-
csv_to_table
pretty print CSV as a table
-
grammalecte_client
Grammalecte HTTP client
-
mdbook-najan
Preprocessor for the Najan mdBook
-
grader
Stream-based CLI for binary sorting text files via a given shell command
-
veloci_levenshtein_automata
Creates Levenshtein Automata in an efficient manner
-
rreplace
designed to streamline string replacements. It can handle multiple unique replacements and iterates the string only once.
-
runiq-lib
An efficient way to filter duplicate lines from input, à la uniq
-
gimme
Pull useful data out of your clipboard
-
lindera-decompress
A morphological analysis library
-
highlights
Convert kindle highlights into different formats
-
minigrep_necimye
Functions required to find the lines in file that contains the query. query and file path should be entered through command line preceded by two dashes. Ex: cargo run -- body filename.txt
-
code-splitter
Split code into semantic chunks using tree-sitter
-
retest
Command-line regular expression tester
-
rk-utils
A collection of utility functions and data structures for rust
-
ut1_blocklist
UT1 blocklist URL/domain filters
-
enc-check
inspect utf-8 and utf-16 character encodings
-
llm-tui
A Terminal User Interface (TUI) for interacting with Language Learning Models (LLM) using llm-cli
-
mdbook-tools
A collection of tools for mdbook
-
wildflower
Wildcard matching against strings
-
kanjiwrapper
Wrapper for kanjiapi.dev
-
names-changer
Convert a names of sql schemes from camelcase to snake case
-
mdbook-embed
A preprocessor that simplifies embedded URL
-
mdja
日本語に最適化されたMarkdownパーサー - CommonMark + GFM対応、目次生成、読了時間計算
-
simple-markdown-parser
handling markdown. May include extra utilities
-
rustfits
A light-weight FITS file reader in Rust
-
vec-string-to-static-str
providing utilities for converting vectors of
Strings into vectors of&'static str -
notedown_ast
Notedown Abstract Syntax Tree
-
sauron-markdown
parsing markdown into sauron node
-
lf_lint
Linter that checks that all files in <PATH> ends with a newline
-
stringsext
find multi-byte-encoded strings in binary data
-
acorns
Generate an AsciiDoc release notes document from tracking tickets
-
ohos-input-method-sys
OpenHarmony's input method binding for rust
-
anon-csv-cli
anonymise CSV files, providing various options to substitute real data with plausable fake data
-
typst-ts-core
Core function of Typst.ts
-
coen
A RegEx-based pseudo-programmable markdown written in Rust that acts as an abstraction layer for any other language
-
dokkoo
Mokk (Macro Output Key Kit) implementation written in Rust
-
crate2bib-cli
A CLI tool for the crate2bib crate
-
zalgo-text
A command line tool for generating zalgo text
-
fcowsay
working with cowsay
-
verba
working with Latin words
-
old_icelandic_zoega
Old Icelandic dictionary for Rust. From "A Concise Dictionary of Old Icelandic" by Geir Zoëga
-
piki-core
Core library for Piki personal wiki application
-
badascii-mdbook
Embed badascii diagrams in your mdbook. See badascii.me for the editor.
-
cfasttext-sys
fastText ffi binding
-
story-dl
Story web scraping
-
py-regex
A small wrapper around the Python regex module via PyO3
-
techlead
CLI is a command-line interface that enables developers to chat with an AI assistant powered by the OpenAI GPT language model, designed specifically to help with your Rust project
-
html-auto-p
function like
wpautopin Wordpress. It uses a group of regex replaces used to identify text formatted with newlines and replace double line-breaks with HTML paragraph tags. -
table_to_html
interface to convert a
tabled::Tableinto a HTML table (<table>) -
dimgrep
command-line text search tool written in Rust, based on the minigrep example from The Rust Programming Language
-
chinese_detection
Classify a string as either English, Chinese, or Pinyin
-
lexikit
CLI that packs an entire project into a single LLM-ready document (Markdown or //-comment style) with UTF-8 detection, binary handling, glob include/exclude filters, and .gitignore awareness
-
kindle_epub_converter
sanitize epubs for kindles
-
nlf
A CLI to append newline characters (LF) at the end of text file
-
ufofmt
A fast, flexible UFO source file formatter based on the Norad library
-
humnum
Human numeric sorting program — does what
sort -his supposed to do! -
tectonic_bridge_harfbuzz
Expose the Harfbuzz C/C++ APIs to Rust/Cargo
-
best_utils_lib_ever
Best utility library/crate ever made!
-
arbitrator
Format text based on a set of rules and regexes
-
opencc-jieba-rs
High-performance Chinese text conversion and segmentation using Jieba and OpenCC-style dictionaries
-
opengrep
Advanced AST-aware code search tool with tree-sitter parsing and AI integration capabilities
-
ra_rustc_lexer
lexer used by rustc, zero stability gurantees
-
lindera-dictionary
A morphological dictionary library
-
markdown-it-autolink
A markdown-it plugin for parsing GFM autolinks
-
asciir
Print ASCII table/values
-
litsea-cli
Litsea is an extreamely compact word segmentation and model training tool implemented in Rust
-
llm-shield-scanners
Security scanners for LLM Shield toolkit
-
uniart
A CLI tool to convert images and gifs to terminal characters
-
kincaid
A word statistics library in Rust
-
marko
Programmtically format text with Markdown syntax
-
fileagg
The ultimate file aggregation tool! If you've ever needed to consolidate multiple files from a directory into a single file effortlessly
-
aki-txpr-macro
the more easy to use libaki-*
-
alemat
type-safe building of MathML
-
rcut
replacement for GNU cut that supports UTF-8
-
selmr
Package to create and use Simple Explainable Language Multiset Representations
-
libxdiff
Rust bindings for the libxdiff C library
-
grace-cli
CLI tool for processing files and strings
-
segtok
Sentence segmentation and word tokenization tools
-
markdown-formatter
Flavored Markdown (ZH) content formatter
-
utf8-supported
Determine the UTF-8 support of the current locale
-
mdbook-keeper
An improved testing experience for mdbook
-
chars_data
Build-dependency for chars, the unicode character information CLI
-
literumilo
A spell checker and morphological analyzer for Esperanto
-
token-parser
parsing texts into data structures
-
minigrep_jeck
minigrepis a grep clone that takes a query and searches for the query in the file; with added support for regex -
untex
Understand and manipulate TeX files with ease
-
asimov-chromium-module
ASIMOV module for Chromium (and Brave, Google Chrome) bookmark import
-
glcp
Small library to help you set and get text from a global clipboard
-
krilla-svg
Converting SVG files to PDF
-
whisperer
将文本编码为简短的中文字符, 防和谐
-
uklatn
Ukrainian Cyrillic transliteration to Latin script
-
scribe-graph
Graph-based code representation and analysis for Scribe
-
excavate
A command line tool to excavate fields from multi-column texts
-
aranya-capi-core
Aranya's C API tooling
-
kitoken
Fast and versatile tokenizer for language models, supporting BPE, Unigram and WordPiece tokenization
-
unic-char-property
UNIC — Unicode Character Tools — Character Property taxonomy, contracts and build macros
-
sejong
Buffer is a buffer that can receive ASCII bytes different from keyboard and send out UTF-32 Hangul string. This buffer allows deletion by Jamo.
-
simple-xml-builder
XML builder/writer
-
flerp
CLI tool that does XYZ
-
leven-distance
Compute operational differences between two sequences using the Levenshtein algorithm
-
molten_ember
Render Markdown beautifully in the terminal 🔥
-
scribe-webservice
Web service interface for Scribe repository analysis
-
auto-regex
Automagically finds a regex that best matches an example and a sample list
-
ab-radix-trie
A compressed radix trie implementation supporting matching rules
-
rmw-utf8
Short text compression algorithm for utf-8 (optimized for Chinese , developed based on rust programming language). 面向utf-8的短文本压缩算法(为中文压缩优化,基于rust编程语言开发)。
-
unicode-ellipsis
truncate Unicode strings to a certain width, automatically adding an ellipsis if the string is too long
-
char-positions
Iterate chars and their positions, i.e. line, column, and byte ranges
-
typos-dict
Source Code Spelling Correction
-
e2m
A lightweight CLI tool that transforms English words into emojis using only Rust’s standard library
-
yeslogic-unicode-script
Fast lookup of the Unicode Script property
-
tree-sitter-stack-graphs-java
Stack graphs for the Java programming language
-
jlu
Command-line utilities for on-the-fly investigation of JSON Lines
-
sqlify
CLI tool for formatting SQL queries
-
briefli
Swiss letter management CLI using Typst - create properly formatted letters for Swiss envelope windows
-
aho-corasick-unsafe
Fast multiple substring searching
-
minisearch
A mini search which can handle both - case sensitive or in-sensitive both
-
benda
Use Bend language functions and capabilities in Python!
-
fmtm
A diff-friendly Markdown formatter that breaks lines on sensible punctuations and words to fit a line width
-
igo-rs
Pure Rust port of the Igo, a POS(Part-Of-Speech) tagger for Japanese (日本語 形態素解析)
-
shaygrep
Minigrep from the Rust Book, for my practice
-
meme_generator_utils
Meme generator utils
-
minigrep_cargo_test
Just a simple grep exercise followed from the rust book and published in order to learn Cargo
-
bitranslit
Bi-directional transliterator for Rust. Transliterates strings according to the rules specified in the language packs.
-
spongemock
CLI-tool for random text capitalization
-
rsticle-cli
Command line tool to convert source files into narratives/articles
-
jiang_mini_grep
minigrep 查询文件的某个字符
-
tre-regex-sys
Rust bindgen bindings to the TRE regex module
-
const_format_proc_macros
detail of the
const_formatcrate -
kelp
A convert tool for Japanese
-
svgbobdoc
Renders ASCII diagrams in doc comments as SVG images
-
zspell-cli
Command line interface for the ZSpell spellchecking library
-
unaccent
remove accents from strings, inspired by PostgreSQL's unaccent extension
-
md-parser-kma
parser that converts basic Markdown syntax (headers, bold, italic, lists) into corresponding HTML tags
-
chinese-ner
A CRF based Chinese Named-entity Recognition Library written in Rust
-
gematria_rs
Gematria, a traditional Hebrew numerology system
-
indent_tokenizer
Generate tokens based on indentation
-
pattern-3
Needle API (née Pattern API 3.0), generalization of
std::str::pattern -
dtxt-detect
dangerous text detection, optimized for high speeds
-
grep-regex
Use Rust's regex library with the 'grep' crate
-
to_fraktur
Function that converts any string to fraktur font
-
cmark-syntax
Syntax highlighting for pulldown-cmark
-
mdtranslation
prepare multi-lingual Markdown documents
-
rust_lemmatizer
A lemmatizing package for use with a .csv dictionary of lemmas and their corresponding words
-
make_it_braille
A CLI tool and library to manipulate unicode braille images
-
md-inc
Include files in Markdown docs
-
localization
t! macro, the easiest way
-
m2h
Convert Markdown to HTML with syntax highlighting
-
daft-functions-utf8
UTF-8 functions for the Daft project
-
html-compare
compare html files
-
raw_str
type which contains a byte slice that is believed to contain UTF-8 string data, but which has not been validated
-
tiny-ver
tiny version parser
-
scripter
A screenplay compiler
-
text-fx
A collection of text processing utilities for Rust
-
mdbook-latex
An mdbook backend for generating LaTeX and PDF documents
-
markdown_converter
html to markdown converter and flavored markdown to discord markdown converter
-
ssort
CLI tool for suffix (inverse lexicographic) sorting
-
minigrep-cli-tool
cli tool to search upon a text file for lines containing specified string
-
zuk
Yozuk command-line interface
-
spc-core
A command-line tool for processing and analyzing data from SPC files
-
fozzie
A quick fuzzy finder for the terminal
-
vl-convert-pdf
convert SVG to PDF with embedded text
-
typeline_ext_utils
operators for typeline
-
words-count
Count the words and characters, with or without whitespaces
-
minigrep_elijahkx
MiniGrep is a Rust-based command-line tool, with a (current) size of 588KB that lets users search files for a given query string and shows matching lines with their line numbers
-
vndb_rs
Bare-bones VNDB api
-
limace
Slugify some strings
-
mdbook-obsidian
mdBook preprocessor to render Obsidian specific syntax
-
groop
Outline-aware grep with syntax highlighting
-
regex_static
Compile-time validated regex, with convenience functions for lazy and static regexes
-
shallow
long text
-
marker
finding issues in CommonMark documents
-
lindera-cli
A morphological analysis CLI
-
search-in-terminal
A terminal-based search tool
-
sparklet
small flashcards library
-
clippy_lints
A bunch of helpful lints to avoid common pitfalls in Rust
-
mdbook-skill-tree
mdbook plugin to show roadmaps
-
no-comment
Remove rust-style line and block comments from a char iterator
-
zed-util
A collection of utility structs and functions used by Zed and GPUI
-
aklat
create books from markdown files (like Gitbook)
-
mdbook-pagebreaks
A mdbook preprocessor to insert page breaks when rendering to HTML
-
git-busy
A wrapper around "git commit" that generates the commit messages for you
-
google-fonts
Download and cache TTF fonts from Google
-
string_art
Convert images into thread patterns for creating string art. It generates detailed instructions in text format and provides graphical previews of the resulting patterns.
-
transition-table
transition table utilities for keyword parser
-
find-simdoc
Time- and memory-efficient all pairs similarity searches in documents
-
assert-text
the testing macro tools
-
sttx
belt for transforming speech-to-text data
-
node-emoji
Convert
:emoji:to Unicode using GitHub’s and EmojiDB’s emoji names -
stringedits
Edit trait and associated iterators for small edits to strings
-
regex-automata
Automata construction and matching using regular expressions
-
oxidized-json-checker
A pushdown automaton low memory JSON bytes stream checker
-
codetypo-vars
Source Code Spelling Correction
-
tui-syntax
Tree-sitter based syntax highlighting for TUI applications
-
transliterate1234
UTF-8 to ASCII transliteration
-
ascii-hangman-webapp
customizable Hangman game with ASCII-art rewarding for children (webapp version)
-
catatau
A TUI EPUB reader with fuzzy search and navigation
-
termwrap
Wrap Unicode text with ANSI color codes
-
uniwhat
Display the unicode characters text
-
molybdenum
Recursive search and replace CLI application
-
rust-md2html
CLI to convert Markdown to HTML with file watching
-
escape-bytes
Escapes bytes that are not printable ASCII characters
-
goodname
assist you with cool naming of your methods and software
-
tantivy-czech-stemmer
Czech stemmer as Tantivy tokenizer
-
rescue-blanket
Escape values while they are being formatted
-
miette-arborium
Arborium-powered syntax highlighter for miette diagnostics
-
ultron-syntaxes-themes
Syntaxes and themes dump for ultron
-
termio
styling terminal output with CSS-like syntax
-
markovish
Markov chain implementation for text generation
-
libanubhav
management system written in Rust
-
pulldown-cmark-escape
An escape library for HTML created in the pulldown-cmark project
-
file-search
File indexing and search
-
intname
Full English name for any integer of any primitive integer type
-
textshrink
A minimalistic lib that reduces text size by replacing specific character sequences with single Unicode characters (e.g., 'mm' becomes '㎜'). On average, this compresses text by ~5%, with variation also ~5%.
-
xur
Toy langage made to be used in a REPL
-
homoglyph-service
homoglyph-cli
-
timfmt
A small utility for formatting code as Tim likes it
-
emoji-printer
Replace emoji shortcodes in string with emoji unicode (":sushi:" -> 🍣)
-
skribo
low-level text layout
-
lingua-german-language-model
The German language model for Lingua, an accurate natural language detection library
-
mitex-lexer
Lexer for MiTeX
-
twitter_text_parser
Parser for twitter-text in Rust
-
opentalk-roomserver-types-legal-vote
OpenTalk RoomServer Types Legal Vote
-
eversal-esi
ESI Library for the Eversal project
-
smartcat
Putting a brain behind
cat. CLI interface to bring language models in the Unix ecosystem 🐈⬛ -
static_table
creates pretty tables at compiler time
-
shopping-parser
A Rust-based parser for parsing structured product information and shopping lists, supporting multiple currencies and units
-
framework
detector for different frameworks in one projects
-
fontcull-klippa
Subsetting a font file according to provided input. (Vendored fork for fontcull)
-
crypto-invert
Unicode Upside-Down Mapping
-
libfmt
A text formating package for rust
-
minigrep_xiaoai
一个简单的命令行工具,用于在文件中搜索字符串。
-
untanglr
Probabilistically split concatenated words using NLP based on English Wikipedia unigram frequencies
-
mark-html
efficient Markdown to HTML parser written in Rust
-
unitoken
Fast BPE tokenizer/trainer with a Rust core and Python bindings
-
oeb-rs
oeb电子书解析库
-
lindera-ko-dic
A Korean morphological dictionary for Ko-Dic
-
tree-formatter-rs
format hierarchical structures as text-based trees
-
rss4mdbook
a generator for mdBook as CLI tool, export RSS.xml into u want path
-
noodler
A port of the python-ngram project that provides fuzzy search using N-gram
-
yar_markdown
Markdown handling for yar
-
textr
TeX-inspired plug-n-play interface for converting JSON documents into PDFs
-
ispell
Easily run ispell, hunspell or aspell from Rust
-
unic-ucd-normal
UNIC — Unicode Character Database — Normalization Properties
-
rustex
auto-generated LaTeX files in Rust
-
text-diff
text diffing and assertion library
-
dequote
Remove nested quotes around text
-
spezilinter
spezifisch's linter for different file formats, linting for weirdly specific stuff
-
enpsrlib
English Phrase Structure Rules library
-
fiberplane-markdown
convert Fiberplane Notebooks to and from Markdown
-
wikitext_table_parser
implement for parse the wikitext's table
-
alpino-tokenizer
Wrapper around the Alpino tokenizer for Dutch
-
lingua-chinese-language-model
The Chinese language model for Lingua, an accurate natural language detection library
-
regex-lite
A lightweight regex engine that optimizes for binary size and compilation time
-
toresy
term rewriting system based on tokenization
-
kpathsea
Rust interface to the kpathsea TeX file management library
-
indentation
Formatter
-
render_readme
Render Markdown or reStructuredText with syntax highlighting and image filtering similar to GitHub's
-
autoruby-cli
CLI to easily generate furigana for various document formats
-
toshi
A full text search engine based on Tantivy
-
xhtmlchardet
Character set detection for XML and HTML
-
wkhtmltopdf
High-level bindings to wkhtmltopdf
-
zummi
fun lib that produces spoonerisms
-
wz
Count words, fast
-
wiki-tui2
easy to use Wikipedia Text User Interface
-
grep-search
search query in contents lib
-
markdown-extract-cli
Extract sections of a markdown file with a regular expression
-
markdown-table-formatter
Markdown table formatter fully compliant with Unicode 15.1.0
-
doccy
brace based markup language
-
formatjson
Formats JSON files
-
md-localizer
Localize markdown with remote links
-
unicode_converter
CLI tool to convert data between various Unicode encodings
-
promptune
A tiny tool to auto-augment prompts programmatically
-
wordbreaker
A Unicode-aware no_std crate (requires alloc) that rapidly finds all sequences of dictionary words that concatenate to a given string
-
cesu8-str
CESU-8 and Java CESU-8 string validation and manipulation
-
mark-rs
A Markdown parser and Static Site Generator
-
repub
convert markdown documents to epub
-
ryaspeller
lib for searching typos in text, files and websites
-
asciimath-text-renderer
Render asciimath in terminal
-
md-include
include any file in markdown files
-
dianac
An emulator, compiler, and interpreter for the Diana Compiled Language
-
kanpyo
Japanese Morphological Analyzer
-
mq-repl
Read-Eval-Print Loop (REPL) for mq query language
-
rins_markdown_parser
markdown parser written on Rust
-
jpreprocess-window
Japanese text preprocessor for Text-to-Speech application (OpenJTalk rewrite in rust language)
-
unflow
DSL to convert design to code
-
html_to_epub
A command line converts .html file to .epub file
-
invisible_unicode
finding invisible unicode characters
-
minigrep21
A program to catch arguments pertaining to query and file path, and to display its search results
-
seq2xypic
Turn a text sequence diagram into a LaTeX xypic diagram
-
bpmf_py
A Bopomofo and Pinyin library
-
csv-sanity
Sanitize and transform large CSVs with millions of records quickly and efficiently
-
unicodeit-cli
The command line interface to unicodeit
-
nlprule-build
Build tools for a fast, low-resource Natural Language Processing and Error Correction library
-
tokenizers-enfer
today's most used tokenizers, with a focus on performances and versatility
-
pdftotext
High-level library that binds to Poppler to extract text from a PDF
-
markdown-gen
generating Markdown files
-
cindex
CSV indexing library
-
unicode_categories_zheyuye
Query Unicode category membership for chars
-
bazaar
formats and protocols
-
presciidoc
Preprocessing AsciiDoc for other tools
-
lucide-yew
Yew port of Lucide
-
staple
powerful static site generator
-
scie
research about how to build simple code identify engine for different languages
-
mdbook-spec
An mdBook preprocessor to help with the Rust specification
-
asimov-ollama-module
ASIMOV Ollama module
-
morse-nostd
A nostd version of the morse crate
-
yozuk-helper-english
English NLP utilities for Yozuk
-
bbd-lib
Binary Braille Dump
-
wg-ragsmith
Semantic chunking and RAG utilities for document processing and retrieval-augmented generation
-
unidok
A powerful, readable, easy-to-learn markup language
-
kindleclip
CLI tool to export a file per book in kindle 'My Clippings.txt'
-
sm-search
way of searching through text - for people who are too lazy to use Regex
-
iata_bcbp
IATA BCBP Parser in Rust Based on Resolution 792
-
bullshit-rs
A BullshitGenerator implementation in Rust
-
tokengrams
Compute n-gram statistics and model language over pre-tokenized text corpora used to train large language models
-
kth-lines
Command line tool for filtering stdin lines that just work
-
word_filter
A Word Filter for filtering text
-
beard
Handy macro to generate formatted text in rust
-
json2markdown
convert JSON data into well-structured Markdown format
-
kradical_static
Ready-to-use EDRDG radical decompositions
-
hashmoji_generator
Code generation tool for hashmoji
-
grepmini
grep-like command line tool written in Rust to search for text patterns in files. Built following the Rust book's example.
-
tradukisto
Kinda useful natural language translation library and utility
-
write16
A UTF-16 analog of the Write trait
-
md-designer
A CLI tool for creating design docs in Markdown
-
ascii-pixel
Convert pixel art into ascii images
-
leven
Make your own blog!
-
soup
Inspired by the python library BeautifulSoup, this is a layer on top of html5ever that adds a different API for querying and manipulating HTML
-
grepox
Minimalist's grep written in Rust
-
supercat
A syntax highlighting alternative to cat
-
moenster
mønster (n) - pattern. simple glob-style pattern matching for strings
-
text-tables
A terminal/text table prettifier with no dependencies
-
llmvm-codeassist
A LLM-powered code assistant that automatically retrieves context (i.e. type definitions) from a Language Server Protocol server.
-
pdf-create
low-level, strongly-typed PDF creation library
-
is_utf8
functions to determine if a sequence of bytes is valid utf-8
-
k-lighter
syntax highlighter for K language
-
kras
Detect, highlight and pretty print almost any structured data inside plain text
-
findtext_textfile
Search text in text file
-
moenarchbook
Creates a book from markdown files
-
mdbook-diataxis
A preprocessor for mdBook which helps apply the Diátaxis documentation framework
-
sauron-md
parsing markdown into sauron node
-
wordnik_list
Quick, local library to check valid words within the Wordnik word list
-
prefixtree
Hash-table-based prefix tree library
-
mitex-parser
Parser for MiTeX
-
pra
Print Random ASCII
-
endf_parser
parsing ENDF-6 format nuclear data
-
warrah
command-line utility and Rust library that sloppily removes code comments from a text file, supporting 60+ programming languages
-
gret
command line tool to search for patterns and show matches in a tree structure
-
pdf_composer_base
PDF Composer base functionality crate
-
mini-openai
An OpenAI API client with minimal dependencies
-
Ds_minigrep
grep implementation from The Rust Programing Book
-
ligotab
Format delimited data with lightweight markup
-
cattocol
Combine two text into one text as columns
-
rexis-rag
Rexis RAG - High-performance Retrieval-Augmented Generation framework with memory-first agents, vector search, and LLM integration
-
prettythanks
frontend to dtolnay/prettyplease library
-
varcon
Source Code Spelling Correction
-
yassgy
A static site generator
-
toml_to_table
pretty print TOML as a table
-
saurus
Markdown to LaTeX Transpiler
-
rustsay
CLI tool in Rust that mimics the classic cowsay program, allowing a cow to speak your text in the terminal
-
ahtml-from-markdown
Convert Markdown to ahtml HTML element trees
-
character_frequency
counting character frequencies in a string concurrently
-
json-peek
Amature JSON parser library designed for my specific need
-
addbib
An app to add linked bibliographies to markdown files
-
regexgrep
ripgrep tool that suports regular expressions
-
mdbook-iced
An mdBook preprocessor to turn iced code blocks into interactive examples
-
custard_lib
A core library for Custard
-
aki-json-pick
The json pick out command
-
tectonic_engine_xetex
The XeTeX engine as a reusable crate
-
chisel-lexers
Chisel backend lexers/scanners
-
clxrity_lilgrep
grep-like utility written in Rust
-
single_source
Generate code files from snippets in md tutorial files
-
harfbuzz-sys
Rust bindings to the HarfBuzz text shaping engine
-
mq-lsp
Language Server Protocol implementation for mq query language
-
admerge
Merge multiply sources into one, with advanced options
-
aki-stats
output the statistics of text, like a wc of linux command
-
exit-with-code
A CLI which returns the exit code of your choice
-
bukvalno
A cli tool for converting images to ascii art
-
sudachiclone
sudachiclone-rs is a Rust version of Sudachi, a Japanese morphological analyzer
-
pray
A tui tool for preparing a prompt to the llms
-
simplearrayhash
-
rigrep
grep from Rust Book
-
every-range
Iterator for interleaving included and excluded ranges
-
korrektor
work with Uzbek language text processing
-
marcus
An experimental Markdown parser written in Rust
-
aki-mline
match line, regex text filter like a grep of linux command
-
darn
Extras for the Book on Data Analysis with Rust Notebooks
-
samvadsetu
LLM API for commonly used LLM services including Gemini, ChatGPT, and Ollama. The name implies a bridge for dialogue since the library facilitates communication and interaction between…
-
gen-epub-book
Generate an ePub book from a simple plaintext descriptor
-
tform
format plain text into well-structured Markdown or HTML
-
p101_enc
convert Olivetti P101 program to and from different encodings
-
LatexImgGen-lib
generating Latex images of equations with an edition number to avoid cache flushing and timestamp attributes
-
sesdiff
Generates a shortest edit script (Myers' diff algorithm) to indicate how to get from the strings in column A to the strings in column B. Also provides the edit distance (levenshtein).
-
chunk_norris
splitting large text into smaller batches for LLM input
-
kasedenv
Read environment variables by lower, upper case or case-insensitive keys
-
caribon
A repetition detector program and library
-
rep-grep
wgrep/write-grep CLI
-
deck
A command line tool to generate HTML presentations from Markdown documents
-
ragzilla
providing tools for RAG (Retrieval-Augmented Generation) pipelines
-
emojito
Find all the Emoji in a string. Supports composed emoji.
-
aglow
Render Markdown beautifully in the terminal 🔥
-
kaomoji-ru
The full http://kaomoji.ru database, encoded and escaped as byte literals
-
blockcounter
Counts the blocks in a stream
-
asimov-x-module
ASIMOV module
-
texrender
Thin wrapper around running
latexmkto render LaTeX documents. Also supports generating Tex documents. -
lindera-unidic
A Japanese morphological dictionary for UniDic
-
ucd-generate
A program for generating packed representations of the Unicode character database that can be efficiently searched
-
turbovault-export
Reporting and export utilities for TurboVault Server
-
szovegertesimutato-score
Calculate szovegertesimutato score for a given text and language
-
mon-artist
Render ASCII art to SVG, with user-customizable rendering rules
-
encoding_c
C API for encoding_rs
-
grep-printer
grep crate's Sink trait that provides standard printing of search results, similar to grep itself
-
mdbook-tectonic
An mdbook backend for generating LaTeX and PDF documents
-
strfry
randomize a string
-
dictcc
Rust API for reading and querying the dict.cc offline translation database
-
widest-line
Find the widest line in a string with proper Unicode and ANSI escape code support
-
writings
The Bahá’í Sacred Writings for use in Rust projects and APIs
-
asimov-sdk
ASIMOV Software Development Kit (SDK) for Rust
-
swc_ecma_lexer
Feature-complete es2019 parser
-
fontconfig-rs
Safe, higher-level wrapper around the fontconfig library
-
mdoc
Modern PDF creation through Markdown and LaTeX
-
betacode
conversion
-
rrename
" Opinionated tool to rename files in batch. Match regular expression, replace some characters I consider noise to kebab case
-
mdbook-preprocessor-utils
writing mdBook preprocessors
-
unicode_skeleton
detects unicode strings that look nearly identical once rendered, but do not compare as equal. It defines "confusable" and "skeleton" based on Unicode Standard Annex #39
-
hashtag-regex
regex matching hashtags accoding to the unicode spec: http://unicode.org/reports/tr31/#hashtag_identifiers
-
file-size
a function formatting file sizes in 4 chars
-
rcut-lib
rcut is a Rust replacement for GNU cut that supports UTF-8
-
regex-cli
A command line tool for debugging, ad hoc benchmarking and generating regular expressions
-
stylish-stringlike
API for string-like objects that have styles applied
-
forming
lightweight architecture as code language. 架构描述语言
-
tabwriter
Elastic tabstops
-
boringascii
Strings that can only be constructed to contain non-whitespace, non-control ASCII characters
-
regex_quote_fixer
Rewrites grep regexpressions for the use in the regex crate
-
milligrep
Custom simplified implementation of grep
-
typos
Source Code Spelling Correction
-
paltoquet
rule-based general-purpose tokenizers
-
telegram-escape
Escape text for Telegram's MarkdownV2 format
-
syntaxdot-tokenizers
Subword tokenizers
-
moon-phases
Fast command-line application to show the moon phase
-
lilgrep
A minimal command-line tool that searches for lines matching a pattern in a file, similar to grep. Built for simplicity and learning purposes.
-
unicode_names
Map characters to and from their name given in the Unicode standard. This goes to great lengths to be as efficient as possible in both time and space, with the full bidirectional tables weighing barely 500 KB…
-
pdf-sign
extract signed date from pdf file
-
stringsort
Pathological sorting of string characters
-
rosie-sys
build or link to librosie to access the Rosie Pattern Language
-
typeline
Efficient, Type-Safe Pipeline Processor
-
encoded-words
Encoded Words for usage in MIME headers
-
minigrepx719
A grep implementation in Rust
-
findtext_sheet
Search text in SpreadSheet
-
rust-crlf
-
skyspell_core
skyspell core library
-
math-text-transform
Transform greek letters, latin letters, or decimal digits into certain variants from the mathematical alphanumeric symbols Unicode block (U+1D400–U+1D7FF). For example to bold, italic, script or double-struck.
-
contractions
expand contractions in English
-
kakasi
Romanize hiragana, katakana and kanji (Japanese text)
-
jg
Jeff Goldblum (jg) is a command-line JSON processor. jg searches for structural patterns in json input and prints each json object that matches the pattern.
-
aghpb
Anime girls holding programming books API wrapper for 🦀Rust
-
textframe
query plain text documents by unicode offset without loading them all into memory
-
ende
encoding/decoding unicode/utf-8/utf-16(ucs-2) code points
-
utf8-command
UTF-8 encoded
std::process::Commandoutput -
ftd-rt
ftd
-
repa
Peak Performance Pattern Seeker
-
actiondb
A safe and efficient unstructured text (log) parsing library
-
roxy_cli
A command-line static site generator
-
izihawa-tantivy-tokenizer-api
Tokenizer API of tantivy
-
kanji_hanzi_converter
Translate Japanese Kanji to Chinese Hanzi
-
pinot
Fast, high-fidelity OpenType parser
-
ghimlink
Generates a Github-Flavoured-Markdown formatted image link to an image in the same repository
-
demoji
Remove all emojis from a string
-
xssan
that sanitizes HTML to prevent XSS vulnerabilities
-
heyiwei
何意味
-
unicodeit
Converts LaTeX to Unicode (rust port)
-
markdown-tables
generating markdown-formatted tables
-
grep-searcher
Fast line oriented regex searching as a library
-
utf-8
Incremental, zero-copy UTF-8 decoding with error handling
-
detect-indent
Detect the indentation of code
-
cli_app_capo
CLI application with Unix-like tools
-
markdown-to-html
Markdown parser that runs at hyper speeds!
-
pulldown-html-ext
Extended HTML rendering capabilities for pulldown-cmark
-
font-awesome
A typed manifest of constants of Font Awesome unicode code points
-
mdbookshelf
Create epubs from a list of mdbook repositories
-
tectonic_xetex_layout
XeTeX's font loading and layout interface encapsulation, as a crate
-
top-english-words
retrieve top words from the English language
-
latin1str
Windows-1252 string types
-
epubparse
Parse epub and convert to text-only Book structure
-
term-rustdoc
A TUI for Rust docs
-
heart-strings
Quickly get random heart emojis to copy!
-
hexify
Format hex
-
rust_nickname_generater
that generates user/nick names based on the rust language
-
wordfreq
port of wordfreq for looking up the frequencies of words in many languages
-
jpreprocess-naist-jdic
Japanese text preprocessor for Text-to-Speech application (OpenJTalk rewrite in rust language)
-
japhonex
Japanese phone number checker for Rust
-
csv_coincidence
Tool designed to efficiently search for and identify specific patterns within CSV files
-
ngrams
Generate n-grams from sequences
-
advanced_string_generator
A command-line tool for generating strings based on customizable regex patterns
-
ipynb-to-md
Convert Jupyter Notebooks to Markdown files
-
csvre
replacing data in CSV columns with regular expressions
-
varcon-core
Varcon-relevant data structures
-
xot
Full-featured XML tree library for Rust
-
yeslogic-ucd-parse
parsing data files in the Unicode character database
-
lindera-filter
Character and token filters for Lindera
-
grep-matcher
A trait for regular expressions, with a focus on line oriented search
-
dekor
styling and character repository in Rust
-
yeslogic-unicode-blocks
Functions to access and search Unicode blocks
-
mdbook-html
mdBook HTML renderer
-
arabic_text_utils
Arabic text processing and manipulation
-
clparse
A command line tool for parsing CHANGELOG.md files that use the Keep A Changelog format
-
random_access_unicode
Provide random access to the unicode characters in a file
-
tinytoken
tokenizing text into words, numbers, symbols, and more, with customizable parsing options
-
utf8toipv4
Convert UTF-8 to ipv4 addresses and vice versa
-
text2art
translate text to ascii art
-
besida
Language for defining branching dialogue
-
text-to-json
Convert text to json in rust
-
darts
A double array trie, A Forward Maximum Matching Searcher
-
pcre2
High level wrapper library for PCRE2
-
rahat3062_minigrep
A light-weight & minimal implementation of the grep cli app
-
jpreprocess-dictionary
Japanese text preprocessor for Text-to-Speech application (OpenJTalk rewrite in rust language)
-
uo_rst_parser
fork of rst_parser with fixes for upstream-ontologist
-
codetypo-cli
Source Code Spelling Correction
-
tex
The νTeX typesetting engine
-
flw
Process text via configurable tasks
-
flx
Full-text searching and scoring of strings
-
triangle-from-scratch
An educational series on how to do stuff yourself
-
strizer
minimal and fast library for text tokenization
-
grepster
command-line tool for searching text in files
-
bos_books_codes
that handles 3-character Bible Books Codes
-
word-star-puzzle-solver
A CLI tool for solving VGs word star puzzle
-
tattoy-wezterm-char-props
Unicode and other character properties
-
screen_printer
displaying rectangular blocks of text to a terminal
-
bible-reference-rs
Extract Bible references from plain text
-
unicode-utf8
that converts utf-8 bytes to a unicode scalar value, and vice versa
-
bigstr
A command-line tool to make string BIG
-
braille_pics
producing text-art pictures using Braille characters
-
rtlicious
A nom-based parser for Yosys RTLIL files
-
crossandra
A straightforward tokenization library for seamless text processing
-
lithe-cli
A cli of lithe
-
pranav_minigrep
Mini Version of grep regex command. It finds out the given string and returns the line containing it.
-
nerd_font
Enumerated contents of the JetBrainsMono Nerd Font
-
xxxxx_rust_sts
A collection of useful string and file utilities for Rust
-
ngram-search
Ngram-based indexing of strings into a binary file
-
mq-view
CLI tool for rendering Markdown documents with syntax highlighting and rich text formatting
-
jp_utils
Utils for working with Japanese text
-
perspicuity_formula
Calculate Flesh Reading Ease for a given text and language
-
parattice
Recursive paraphrase lattice generator
-
cologne_phonetics
generate phonetic cologne codes for utf8 strings
-
mitex-spec
Specification Library for MiTeX
-
letrs-fonts
A collection of FIGlet fonts
-
minigrep_bourne
classic grep command
-
rizzer
Fuzzy matching tool to find string similarity
-
recode_rs
Command-line tool for converting between the character encodings defined in the Encoding Standard
-
mepple
English dictionary as a library
-
lindera-ipadic
A Japanese morphological dictionary for IPADIC
-
mini__grep
grep tool
-
kwindex
based "keyword index" library crate that maintaining an index of words from texts
-
asimov-dataset-cli
ASIMOV Dataset Command-Line Interface (CLI)
-
cw
Count Words, a fast wc clone
-
mdbook-sitemap-generator
generate a sitemap.xml file for an mdbook project
-
fvoid
Void file generator, supports many formats
-
moscato
Outline scaler for OpenType glyphs
-
litegrep
A basic tool for searching in files for lines of text, based on a query
-
rsnltk
Rust-based Natural Language Toolkit
-
fontations
Working with font files
-
minigrep_unlimate_test_of_rustbook
minigrep from rust book lul
-
charwise
This lightweight, dependency-free rust library provides a convenient way to read characters from different resources
-
base1112031
way to represent numbers using all Unicode characters except ASCII control characters
-
unicode_escape
decoding escape sequences in strings
-
asimov-readwise-module
ASIMOV module
-
morc
Dead simple, minimal markdown generator library written in Rust
-
argot
Parse documentation from codebases into Markdown for easy doc creation
-
strings
String utilities, including an unbalanced Rope
-
write-html
writing HTML in Rust
-
reedy
A terminal-based RSS reader with a clean TUI interface
-
encoding-index-japanese
Index tables for Japanese character encodings
-
bookgrep
Basic grep equivalent, minor mods to Chapter 12
-
lines_lossy
extension to
BufReadwith a functionlines_lossythat works likeBufRead::linesbut with lossy UTF-8 decoding -
spongedown
Converts markdown to html with svgbob support
-
ed_join
Implemtation of Ed-Join Algorithm for string similarity join
-
castle_tokenizer
Castle Tokenizer: tokenizer
-
epub2mdbook
convert EPUB files to MDBook format
-
squ
command-line utility for converting quotation marks in plaintext files to "smart quotes"
-
promptchine
prompt refinement and caching library using LLMs
-
fuzzy-string-distance
Fuzzy string distance comparisons
-
mdbook-core
The base support library for mdbook, intended for internal use only
-
dd
a clone of the unix coreutil dd
-
stringutils
A collection of various and (hopefully) useful String utility functions
-
ron_to_table
pretty print RON as a table
-
goya
morphological analyzer for Rust and WebAssembly
-
kytea-tokenizer
Wrapper of tokenization by KyTea
-
bookkeeping
Minimal, safe and flexible in-memory bookkeeping API
-
mdbook-summary-generate
A mdbook preprocessor to generate SUMMARY.md from a directory structure
-
chanoma
Characters Normalization library. 文字列正規化処理用のライブラリです。
-
ssml-parser
parsing speech sythnesis markup language
-
pygmentize
wrapper for syntax highlighting
-
lexmatch
lexicon matching tool that, given a lexicon of words or phrases, identifies all matches in a given target text. Uses suffix arrays.
-
demoji-rs
demoji implementation for Rust
-
asimov-core
ASIMOV Software Development Kit (SDK) for Rust
-
termbook
behind the
termbook-cli -
novalyn_core
Core library for novalyn changelog generation
-
outerspace
Methods for prefixing and suffixing the non-whitespace characters in a string
-
is-vowel
Heuristically test whether a character is a vowel letter
-
jellybean
Syntax highlighting with tree-sitter. Sweet colors.
-
glowpub
A glowfic to epub converter
-
czv
performing CSV-related operations for data engineering and analysis
-
lindera-compress
A morphological analysis library
-
scenic
Presentation tool that utilizes LaTex Beamer package
-
mdtranslation-cli
Command-line tools for using mdTranslation, which can be used to prepare multi-lingual Markdown documents
-
minigrep_flict
Simplest text-in-file search engine from rust book
-
borderrs
Add stylish borders around your text and datastructures
-
chinese_segmenter
Tokenize Chinese sentences using a dictionary-driven largest first matching approach
-
hulk
An ultra simple no hassle static site generator
-
sbert
Sentence Bert (SBert)
-
lazy-char-iter
Lazily iterate over Unicode characters from a u8 slice in Rust
-
imgr
image to ascii converter
-
khat
A cat clone, nothing more nothing less
-
textos
Texts, strings, formatting, unicode…
-
tectonic_pdf_io
Xdvipdfmx’s PDF, XDV, and image I/O APIs in C, as a Rust crate
-
kaomji-ru
The full http://kaomoji.ru database, encoded and escaped as byte literals
-
gmi2html
Convert text/gemini into HTML
-
rusty-grep
RustyGrep is a command-line tool written in Rust that allows users to search for patterns in files
-
randem
Print a random emoji optionally with the given string as seed
-
base_u256
base-u256 is to utf-8 as base-64 is to ascii
-
utf8_slice
Lightweight UTF8 Slice Utilities
-
looneygrep
A powerful command-line search tool with web page support and replace functionality
-
quicksilverx
easy to use grep clone
-
lindera-cc-cedict
A Chinese morphological dictionary for CC-CEDICT
-
utf8_reader
A UTF-8 reader that read UTF-8 characters from object that implement Read trait
-
lingua-japanese-language-model
The Japanese language model for Lingua, an accurate natural language detection library
-
csv2qr
command line tool for generating QR codes from a CSV file
-
yamc
Markdown Converter - A fast and feature-rich markdown to HTML and PDF converter
-
boxy
Declarative builder for Unicode box-drawing characters
-
asciidork-opts
Asciidork opts
-
quewuigrep
grep-like tool written in Rust
-
vaporetto_rules
Rule-base filters for Vaporetto
-
drawille-nostd
A no_std version of the drawille crate
-
latex-to-html
Latex to html converter
-
autoruby
Easily generate furigana for various document formats
-
stfu
Shut The Ferris Up - profanity filtering for Rust
-
bqrs
apply boolean query to text
-
didaskalia
over-complicated language to store text entries and sequences across different languages
-
mdbook-to-github-wiki
Turns an mdbook book into a github wiki
-
hsk
Return HSK Level for Simplified Chinese Characters
-
mq-dap
Debug Adapter Protocol implementation for mq
-
claude-tokenizer
tokenizing text with the Anthropic Claude models
-
cli_app_capo15
CLI application with Unix-like tools
-
strip-tags
Strip HTML and PHP tags from strings
-
rust_readability
A package to assess the complexity of texts using a variety of readability formulas
-
uniaxe
replace Unicode letters with Ascii equivalents
-
vaporetto_tantivy
Vaporetto Tokenizer for Tantivy
-
indexable_str
creating indexable strings
-
atm_parser_helper_common_syntax
Parsing utilities for the common syntax specification
-
unic-emoji-char
UNIC — Unicode Emoji — Emoji Character Properties
-
rustyword
An anagram finder
-
lingua-arabic-language-model
The Arabic language model for Lingua, an accurate natural language detection library
-
text-sanitizer
convert text to plain ASCII text
-
economic_indicator_finder
A finder for extracting economic indicators from paragraphs
-
mdbook-translation
prepare multi-lingual mdBook books
-
catmark
Console printer for CommonMark
-
const-utf16
Utf8 to utf16 conversion functions for use in const contexts
-
uscan
A universal source code scanner
-
harfbuzz-traits
Rust Traits for the HarfBuzz text shaping engine
-
kmdbook-html
mdBook HTML renderer
-
sentencepiece
Binding for the sentencepiece tokenizer
-
lignin-html
HTML renderer for lignin VDOM Nodes. This crate is primarily for static and server-side rendering. For client-side use, see lignin-dom.
-
kanabake
Turn Romaji strings into Hiragana or Katakana
-
hex_d_hex
HexDHex is a Rust Crate that encodes and decodes byte data to and from its hexidecimal representation. For instance, one may wish, on ocasion that is, to translate a utf8 or ASCII string…
-
chisel-parsers
Chisel parser front ends
-
common-words-all
Most common words sorted by ngram frequency
-
suffix
arrays
-
rsonpath-test-codegen
Blazing fast JSONPath query engine powered by SIMD. TOML-based test codegen for rsonpath-lib.
-
poetry-book
Create a poetry book in latex, starting from plain text
-
dictionary-1024
a mnemonic dictionary with 1024 words, 10 bits per word
-
tectonic_bridge_graphite2
Expose the graphite2 library to Rust/Cargo
-
STKLR
STKLR: pronounced 'stickler'. Is a cli tool to automatically link functions, enums, structs, traits etc in rust-doc docstrings. I couldn't find a tool like this when I needed it so... here we are.
-
tnil
Parsing, glossing, and generating utilites for New Ithkuil
-
notion2html
Convert Notion pages to HTML
-
nutrimatic
Tools for reading Nutrimatic (https://nutrimatic.org) index files
-
tiniestsegmenter
Compact Japanese segmenter
-
supergrep
A super grepper
-
asimov-luma-module
ASIMOV module
-
refac
Transform some text given a generic natural language prompt
-
allsorts-subset-browser
Temp fork of allsorts 0.15 - includes patch for subsetting fonts for browsers
-
lix-score
Calculate LIX score for a given text and language
-
rmbs
Remove any fluff, corporate speak, or other bullshit from input text and print the TL;DR essence of what's being said, using the www.bullshitremover.com public LLM API
-
env2toml
Convert env vars to toml text
-
decline-word
Choose word form based on given number
-
html2runes
An HTML to Text converter
-
unicode-bidi
Unicode Bidirectional Algorithm
-
rbatis_xml_parser
Push based streaming parser for xml
-
terminal-supports-emoji
Check whether the current terminal supports emoji
-
iterlower
Final-sigma-correct lowercasing iterator adapter with option for Turkish/Azeri I behavior
-
base_emoji
Convert everything to Emojis (and back)! - 🐫🔦🔥🎲🐬🍀🍟🔋🐬🎲🐬🍀🎁🍟💧👂🔥🚪🔋🍟🔦🔋🚗👣🔦🌍👂🍤🐗
-
lindera-ipadic-neologd
A Japanese morphological dictionary for IPADIC NEologd
-
lex-babel
Format conversion library for the lex format
-
mdbook-llms-txt-tools
convert mdbook to llmstxt.org format
-
inclip
terminal utility for clipboard interaction and convenience
-
xsv
A high performance CSV command line toolkit
-
unicode-line-stacker
Tiny library for stacking Unicode line-drawing characters on top of each other
-
markdownx
⇟
-
ced
Dead easy csv editor
-
grep-reader
short text for crates.io
-
rust-cedar
efficiently-updatable double-array trie in Rust (ported from cedar)
-
ascii_utils
handle ASCII characters
-
kilo
small, fast utility crate/library for manipulating strings and generating sourcemaps with all in Magic 🪄
-
readable-readability
Really fast readability
-
htmlfixinator
A composable HTML transformation library with filters for cleaning, modifying, and standardizing HTML content
-
rustextile
Textile markup language parser for Rust
-
char_stream
Unified character reading interface to str, String, bytes, File and Stdin for Rust language
-
cutters
Rule based sentence segmentation library
-
worcher
full-text search for static websites
-
glyphrs
A text art generator written in Rust
-
bgrep
grep tailored to handle binary patterns and files
-
tectonic_bridge_icu
Expose a subset of the ICU Unicode APIs to Rust/Cargo
-
password-characters
help with the "enter the 12th, 35th, and 63rd characters from your password" situations
-
sgrep
grep util for those lazy to remember many command line options
-
lingua-latvian-language-model
The Latvian language model for Lingua, an accurate natural language detection library
-
findtext_pdf
Search text in PDF
-
conv-rs
CLI converter for multi extensions
-
mq-formatter
Code formatter for mq query language
-
lychee-lib
A fast, async link checker
-
schmfy
Schmfication library
-
simple-word-count
word count function, try to get same result with Microsoft Office Word application
-
lindera-dictionary-builder
Shared code for building Lindera dictionary files
-
truncrate
truncate strings economically without splitting your characters, symbols, emojis etc
-
pretok
A string pre-tokenizer for C-like syntaxes
-
unic-char-range
UNIC — Unicode Character Tools — Character Range and Iteration
-
mdbook-force-relative-links
An
mdbookpre-processor to transform all local links to relative ones -
minigrep_improved
A command line tool to search for a string in a file
-
llmvm-outsource-lib
outsource backend for llmvm
-
color-convert
Support RGB,RGBA,HEX,HSL,HSLA,HSV,CMYK to convert each other, write by rust
-
nb2fr
Convert integer number to literal French text
-
utfx
-
hashlogs
Command-line utility that hashes the part before a space on each line from stdin with blake2b keyed with an ephemeral randomly-generated key and writes to stdout
-
code-span
Add additional infomation to code character
-
rure
A C API for Rust's regular expression library
-
cowsay
program that generates ASCII art pictures of a cow with a message
-
note-mark
A markdown parser
-
lindera-py
Python binding for Lindera
-
ansi-to-tui-forked
convert ansi color coded text into ratatui::text::Text type from ratatui library
-
jp_inflections
A small crate to conjugate japanese words
-
morsels_lang_ascii
Basic ascii tokenizer for morsels
-
butterkups-minigrep
Mini grep utility; very weak application, use grep instead
-
folia
High-performance library for handling the FoLiA XML format (Format for Linguistic Annotation)
-
namesake
A CLI tool and library to generate names by combining words
-
ibm1047
String conversions to and from IBM-1047
-
case_style
Converts from and to different case styles
-
umlauts
text transformation of german umlauts
-
cha-rs
Extract specific characters from an input
-
markdown2unicode
Converter from markdown notation to unicode characters
-
xmldecl
Extracts an encoding from an ASCII-based bogo-XML declaration in text/html in a Web-compatible way
-
peppergrep
grep utility written following the 12th chapter of the Rust book. Some little modifications were made.
-
cautious-octo-funicular
Test: shipping an mdbook with API docs
-
html_to_markdown
Convert HTML to Markdown
-
lll_toon
convert JPG images to ASCII art with colors
-
jpreprocess-dictionary-builder
Japanese text preprocessor for Text-to-Speech application (OpenJTalk rewrite in rust language)
-
writedown
format parser
-
asimov-maildir-module
ASIMOV module for Maildir email import
-
xgrepx
xgrep is a rust implementation of grep. This is a follow up from the rust book