Skip to content

Commit 267a396

Browse files
authored
Merge pull request keon#292 from proycon/ucto
added python-ucto as separate entry
2 parents cbf61dd + 358c009 commit 267a396

File tree

1 file changed

+1
-0
lines changed

1 file changed

+1
-0
lines changed

README.md

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -185,6 +185,7 @@ Material can be found [here](https://github.com/aws-samples/aws-machine-learning
185185
- [Sockeye](https://github.com/awslabs/sockeye) - Neural Machine Translation (NMT) toolkit that powers Amazon Translate.
186186
- [DL Translate](https://github.com/xhlulu/dl-translate) - A deep learning-based translation library for 50 languages, built on `transformers` and Facebook's mBART Large.
187187
- [Jury](https://github.com/obss/jury) - Evaluation of NLP model outputs offering various automated metrics.
188+
- [python-ucto](https://github.com/proycon/python-ucto) - Unicode-aware regular-expression based tokenizer for various languages. Python binding to C++ library, supports [FoLiA format](https://proycon.github.io/folia).
188189

189190
- <a id="c++">**C++** - C++ Libraries</a> | [Back to Top](#contents)
190191
- [InsNet](https://github.com/chncwang/InsNet) - A neural network library for building instance-dependent NLP models with padding-free dynamic batching.

0 commit comments

Comments
 (0)