I've just released Classifier4J 0.6. This new release includes a rather nice (I think) new classifier (the VectorClassifer) based on the vector space search algorithm This particular classifier is fast, doesn't require training for non-matches and is very suitable for sorting data into various categories.
If you've looked at Classifier4J in the past and run into performance problems with the Bayesian algorithm I'd be interested in your feedback on this new algorithm.
Classifier4J 0.51 has been released. It fixes some bugs in the 0.5 release
Classifier4J is a Bayesian classification tool written in Java. Version 0.5, has just been released.
Some of the changes include:
- JDBCWordsDataSource now properly stored the connection info set in the constructor (bug)
- New DataSourceJDBCConnectionManager
- New SimpleHTMLTokenizer
- New CustomizableStopWordProvider
- JDBCWordsDataSource now truncates any words longer than 255 characters
- SimpleWordsDataSource is now Serializable
- Revmoval of dependancy on commons-lang... read more
Classifier4J version 0.4 is now available.
Classifer4J is a java library that provides an API for automatic classification of text, including Bayesian classification.
Version 0.4 has numerous enhancements over 0.3, including a text summary API
Classifier4J is available from http://classifier4j.sourceforge.net/
Classifier4J version 0.3 is now available.
Classifer4J is a java library that provides an API for automatic classification of text, including Bayesian classification.
Version 0.3 is the first version recommened for general use.
Classifier4J is available from http://classifier4j.sourceforge.net/