Mallet

From Digital Sinology
Revision as of 13:37, 12 July 2018 by Dsturgeon (talk | contribs)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to: navigation, search

MALLET is a Java "Machine Learning for Language Toolkit" best known for its topic modeling capabilities.

Description

MALLET is an integrated collection of Java code useful for statistical natural language processing, document classification, cluster analysis, information extraction, topic modeling and other machine learning applications to text.

History

MALLET was developed primarily by Andrew McCallum, of the University of Massachusetts Amherst, with assistance from graduate students and faculty from both UMASS and the University of Pennsylvania.

External links