Stanford CoreNLP is a free and open-source Java-based utility that provides a set of natural language analysis tools which can take raw English language text input and give the base forms of words, their parts of speech, whether they are names of companies, people, etc., normalize dates, times, and numeric quantities, and mark up the structure of sentences in terms of phrases and word dependencies, and indicate which noun phrases refer to the same entities.
In other words, Stanford CoreNLP is an integrated suite of natural language processing tools for English in Java, including tokenization, part-of-speech tagging, named entity recognition, parsing, and coreference.
Stanford CoreNLP provides the foundational building blocks for higher level text understanding applications.
Stanford CoreNLP integrates all our NLP tools for the English language, including the part-of-speech (POS) tagger, the named entity recognizer (NER), the parser, and the coreference resolution system.
The goal of Stanford CoreNLP is to enable people to quickly and painlessly get complete linguistic annotations of natural language texts. It is designed to be highly flexible and extensible, i.e., with a single option you can change which tools should be enabled and which should be disabled.
Detailed instructions on how to install and use the Stanford CoreNLP utility on your Mac are available HERE.
Stanford CoreNLP is a cross-platform utility capable of running on any operating system that comes with Java support (e.g. Mac OS X, Windows, Linux).
What's New in This Release: [ read full changelog ]
· Fixed thread safety bugs, caseless models available.