What's new in OpenNLP 1.6.0
Jul 17, 2015
- Added evalutation support to the parser and doccat components
- Added support to Evalita 07/09, Brat and OntoNotes corpus formats
- Now L-BFGS is stable
- Added Snowball to the Stemmer package
- NameFinder now supports a user defined factory
- Added pluggable machine learning support
- Added a lemmatizer module
- Added Cluster, Document Begin and Clark feature generators to the Name Finder
- Added Liblinear as a Machine Learning addon
- Entity Linker now has a command line interface
- Added sequence classification support
New in OpenNLP 1.5.2 (Nov 30, 2011)
- This release contains a couple of new features, improvements and bug fixes. The maxent trainer can now run in multiple threads to utilize multi-core CPUs, configurable feature generation was added to the name finder, the perceptron trainer was refactored and improved, machine learners can now be configured with much more options via a parameter file, evaluators can print out detailed evaluation information.
- Additionally the release contains the following noteworthy changes:
- Improved the white space handling in the Sentence Detector and its training code
- Added more cross validator command line tools
- Command line handling code has been refactored
- Fixed problems with the new build
- Now uses fast token class feature generation code by default
- Added support for BioNLP/NLPBA 2004 shared task data
- Removal of old and deprecated code
- Dictionary case sensitivity support is now done properly
- Support for OSGi
New in OpenNLP 1.4.3 (Jan 26, 2009)
- Fixed thread saftey issue in name finder.
New in OpenNLP 1.4.0 (Sep 4, 2008)
- A number of updates to name finding with minor changes to a number of other components.
- Improvements in accuracy to name finding, pos-tagging, and coreference models.
- Addition of a document classification component.
- Extended support for muli-lingual processing.
- Document categorization - Allows documents to be categorized into categories.
- Updated models with slightly better data.
- Support for different types of feature generation.
- Support for dictionary and regex-based name finding.
- Improvement in speed in accuracy of default name finder.
- Restructures the parsing packages in support of work in upcoming releases.
- Minor changes to computation of semantic similarity.
- Adds processing for the Thai language and for German
- Adds support to specify the encoding for many types of processing.