MALLET icon

MALLET For Mac

  n/a
Eclipse Public License   

Java library for machine learning applied to text. #Machine learning library  #Machine learning toolkit  #Extract information  #Machine learning  #Library  #Extract  

Description

changelog

Free Download

MALLET is a free Java-based package for statistical natural language processing, topic modeling, information extraction, document classification, clustering, and other machine learning applications to text.

MALLET includes sophisticated tools for document classification: efficient routines for converting text to "features", a wide variety of algorithms (including Naïve Bayes, Maximum Entropy, and Decision Trees), and code for evaluating classifier performance using several commonly used metrics.

MALLET provides facilities not only for document classification, but also information extraction, part-of-speech tagging, noun phrase segmentation, and much more.

The development of the library is quite mature, however it does not yet have as polished front-ends or documentation as rainbow.

System requirements

What's new in MALLET 2.0.7:

  • Fixed a bug in the Generalized Expectation (GE) implementation forMaxEnt models. The old code could give low accuracy when using a small number of constraints. See the note at the top of this page for more information: http://mallet.cs.umass.edu/ge-classification.php
  • Fixed a bug in SVMLight2Vectors that could result in different Alphabets when importing multiple files at once.
  • Fixed a bug in SVMLight2Classify that allowed previously unobserved features to be added to the data Alphabet, possibly resulting in mismatching Classifier and InstanceList Alphabets.
  • Fixed bugs in the search direction computation in ConjugateGradient.
Read the full changelog
User Comments
This enables Disqus, Inc. to process some of your data. Disqus privacy policy

MALLET 2.0.7 / 2.0.8 RC 3

add to watchlist add to download basket send us an update REPORT
  runs on:
Mac OS X (-)
  file size:
12.4 MB
  filename:
mallet-2.0.7.tar.gz
  main category:
Development
  developer:
  visit homepage