Skip to content

Tags: karimamd/cltk

Tags

untagged-b4997d46ce5d4c3d468c

Toggle untagged-b4997d46ce5d4c3d468c's commit message
Adding Marathi Alphabets (cltk#588)

* Adding marathi corpus from wikisource

* Recfctoring importer.py

* Refactoring importer.py

* Added new origin of CLTK after transfering the repository ownership.

* Adding Mrathi docs

* Adding Marathi docs in index

* cleanup about

* Adding albhates of marathi language

* Update alphabet.py

* Update license

untagged-7cf6c55ce9051e332613

Toggle untagged-7cf6c55ce9051e332613's commit message
Adding Marathi Alphabets (cltk#588)

* Adding marathi corpus from wikisource

* Recfctoring importer.py

* Refactoring importer.py

* Added new origin of CLTK after transfering the repository ownership.

* Adding Mrathi docs

* Adding Marathi docs in index

* cleanup about

* Adding albhates of marathi language

* Update alphabet.py

* Update license

untagged-5ecbb01a1f820e2b1739

Toggle untagged-5ecbb01a1f820e2b1739's commit message
Adding Marathi Alphabets (cltk#588)

* Adding marathi corpus from wikisource

* Recfctoring importer.py

* Refactoring importer.py

* Added new origin of CLTK after transfering the repository ownership.

* Adding Mrathi docs

* Adding Marathi docs in index

* cleanup about

* Adding albhates of marathi language

* Update alphabet.py

* Update license

v0.1.64

Toggle v0.1.64's commit message
updated docs, bump vers (cltk#575)

v0.1.63

Toggle v0.1.63's commit message
mk api updates, bump vers

v0.1.62

Toggle v0.1.62's commit message
bump vers

v0.1.61

Toggle v0.1.61's commit message
Regex lemmatizer update (cltk#565)

* Refactor RegexpLemmatizer

* Update Regexp Lemmatizer test

* Update Regexp Lemmatizer test

* Refactor RomanNumeralLemmatizer

* Add test for BackoffLatinLemmatizer; fix test coverage in general

v0.1.60

Toggle v0.1.60's commit message
bump vers for arabic

v0.1.57

Toggle v0.1.57's commit message
Fix some Arabic mistakes during coding (cltk#552)

* Add classical arabic corpus and all arabic letters/symbols

* transfer ownership corpuses to cltk

* transfer ownership corpuses to cltk

* update corpus/arabic/corpora.py

* added arabic word tokenizer

* mk pyarabic optional install

* replace if with elif

* fixed line 16 corpus/arabic/alphabet.py

* added arabic stop words

* ch print to log

* Added blank line

v0.1.48

Toggle v0.1.48's commit message
Add line tokenizer to CLTK (cltk#530)

* Add line tokenizer with tests

* Fix line tokenizer return

* Update docs to include line tokenizer

* Fix code block in docs