... a Java-based package for statistical natural language processing, document classification, clustering, topic modeling, information extraction, and other ... to text. MALLET includes sophisticated tools for document classification: efficient routines for converting text to "features", ... Learning applications, MALLET includes routines for transforming text documents into numerical representations that can then be processed ...
Details Download Save Freeware