OpenNLP Changelog

What's new in OpenNLP 1.6.0

Jul 17, 2015
  • Added evalutation support to the parser and doccat components
  • Added support to Evalita 07/09, Brat and OntoNotes corpus formats
  • Now L-BFGS is stable
  • Added Snowball to the Stemmer package
  • NameFinder now supports a user defined factory
  • Added pluggable machine learning support
  • Added a lemmatizer module
  • Added Cluster, Document Begin and Clark feature generators to the Name Finder
  • Added Liblinear as a Machine Learning addon
  • Entity Linker now has a command line interface
  • Added sequence classification support

New in OpenNLP 1.5.2 (Nov 30, 2011)

  • This release contains a couple of new features, improvements and bug fixes. The maxent trainer can now run in multiple threads to utilize multi-core CPUs, configurable feature generation was added to the name finder, the perceptron trainer was refactored and improved, machine learners can now be configured with much more options via a parameter file, evaluators can print out detailed evaluation information.
  • Additionally the release contains the following noteworthy changes:
  • Improved the white space handling in the Sentence Detector and its training code
  • Added more cross validator command line tools
  • Command line handling code has been refactored
  • Fixed problems with the new build
  • Now uses fast token class feature generation code by default
  • Added support for BioNLP/NLPBA 2004 shared task data
  • Removal of old and deprecated code
  • Dictionary case sensitivity support is now done properly
  • Support for OSGi

New in OpenNLP 1.4.3 (Jan 26, 2009)

  • Fixed thread saftey issue in name finder.

New in OpenNLP 1.4.0 (Sep 4, 2008)

  • A number of updates to name finding with minor changes to a number of other components.
  • Improvements in accuracy to name finding, pos-tagging, and coreference models.
  • Addition of a document classification component.
  • Extended support for muli-lingual processing.
  • Document categorization - Allows documents to be categorized into categories.
  • Updated models with slightly better data.
  • Support for different types of feature generation.
  • Support for dictionary and regex-based name finding.
  • Improvement in speed in accuracy of default name finder.
  • Restructures the parsing packages in support of work in upcoming releases.
  • Minor changes to computation of semantic similarity.
  • Adds processing for the Thai language and for German
  • Adds support to specify the encoding for many types of processing.