-
Notifications
You must be signed in to change notification settings - Fork 2.7k
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
- Loading branch information
Showing
915 changed files
with
153,478 additions
and
100,719 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,17 +1,17 @@ | ||
Stanford CoreNLP | ||
================ | ||
|
||
Stanford CoreNLP provides a set of natural language analysis tools written in Java. It can take raw human language text input and give the base forms of words, their parts of speech, whether they are names of companies, people, etc., normalize dates, times, and numeric quantities, and mark up the structure of sentences in terms of phrases and word dependencies, and indicate which noun phrases refer to the same entities. It was originally developed for English, but now also provides varying levels of support for Arabic, (mainland) Chinese, French, German, and Spanish. Stanford CoreNLP is an integrated framework, which make it very easy to apply a bunch of language analysis tools to a piece of text. Starting from plain text, you can run all the tools on it with just two lines of code. Its analyses provide the foundational building blocks for higher-level and domain-specific text understanding applications. Stanford CoreNLP is a set of stable and well-tested natural language processing tools, widely used by various groups in academia, government, and industry. | ||
Stanford CoreNLP provides a set of natural language analysis tools written in Java. It can take raw human language text input and give the base forms of words, their parts of speech, whether they are names of companies, people, etc., normalize and interpret dates, times, and numeric quantities, mark up the structure of sentences in terms of phrases or word dependencies, and indicate which noun phrases refer to the same entities. It was originally developed for English, but now also provides varying levels of support for (Modern Standard) Arabic, (mainland) Chinese, French, German, and Spanish. Stanford CoreNLP is an integrated framework, which make it very easy to apply a bunch of language analysis tools to a piece of text. Starting from plain text, you can run all the tools with just two lines of code. Its analyses provide the foundational building blocks for higher-level and domain-specific text understanding applications. Stanford CoreNLP is a set of stable and well-tested natural language processing tools, widely used by various groups in academia, industry, and government. The tools variously use rule-based, probabilistic machine learning, and deep learning components. | ||
|
||
The Stanford CoreNLP code is written in Java and licensed under the GNU General Public License (v3 or later). Note that this is the full GPL, which allows many free uses, but not its use in proprietary software that you distribute. | ||
The Stanford CoreNLP code is written in Java and licensed under the GNU General Public License (v3 or later). Note that this is the full GPL, which allows many free uses, but not its use in proprietary software that you distribute to others. | ||
|
||
You can find releases of Stanford CoreNLP on [Maven Central](http:https://search.maven.org/#browse%7C11864822). | ||
|
||
You can find more explanation and documentation on [the Stanford CoreNLP homepage](http:https://nlp.stanford.edu/software/corenlp.shtml#Demo). | ||
|
||
The most recent models associated with the code in the HEAD of this repository can be found [here](http:https://nlp.stanford.edu/software/stanford-corenlp-models-current.jar). | ||
|
||
For information about making contributions to Stanford CoreNLP, see the file `CONTRIBUTING.md`. | ||
For information about making contributions to Stanford CoreNLP, see the file [CONTRIBUTING.md](CONTRIBUTING.md). | ||
|
||
Questions about CoreNLP can either be posted on StackOverflow with the tag [stanford-nlp](http:https://stackoverflow.com/questions/tagged/stanford-nlp), | ||
or on the [mailing lists](http:https://nlp.stanford.edu/software/corenlp.shtml#Mail). |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,106 @@ | ||
* NN Number=Sing | ||
* NNP Number=Sing | ||
* NNS Number=Plur | ||
* NNPS Number=Plur | ||
* VBZ Number=Sing|Person=3|Tense=Pres|VerbForm=Fin|Mood=Ind | ||
* VBD VerbForm=Fin|Mood=Ind|Tense=Past | ||
* VBN Tense=Past|VerbForm=Part | ||
* VBP VerbForm=Fin|Mood=Ind|Tense=Pres | ||
* MD VerbForm=Fin | ||
* JJ Degree=Pos | ||
* JJR Degree=Cmp | ||
* JJS Degree=Sup | ||
* CD NumType=Card | ||
am VBP VerbForm=Fin|Mood=Ind|Tense=Pres|Person=1|Number=Sing | ||
was VBD VerbForm=Fin|Mood=Ind|Tense=Past|Number=Sing | ||
i PRP Number=Sing|Person=1|PronType=Prs|Case=Nom | ||
you PRP Person=2|PronType=Prs | ||
he PRP Number=Sing|Person=3|Gender=Masc|PronType=Prs|Case=Nom | ||
she PRP Number=Sing|Person=3|Gender=Fem|PronType=Prs|Case=Nom | ||
it PRP Number=Sing|Person=3|Gender=Neut|PronType=Prs | ||
we PRP Number=Plur|Person=1|PronType=Prs|Case=Nom | ||
they PRP Number=Plur|Person=3|PronType=Prs|Case=Nom | ||
me PRP Number=Sing|Person=1|PronType=Prs|Case=Acc | ||
him PRP Number=Sing|Person=3|Gender=Masc|PronType=Prs|Case=Acc | ||
her PRP Number=Sing|Person=3|Gender=Fem|PronType=Prs|Case=Acc | ||
us PRP Number=Plur|Person=1|PronType=Prs|Case=Acc | ||
them PRP Number=Plur|Person=3|PronType=Prs|Case=Acc | ||
my PRP$ Number=Sing|Person=1|Poss=Yes|PronType=Prs | ||
mine PRP$ Number=Sing|Person=1|Poss=Yes|PronType=Prs | ||
your PRP$ Person=2|Poss=Yes|PronType=Prs | ||
yours PRP$ Person=2|Poss=Yes|PronType=Prs | ||
his PRP$ Number=Sing|Person=3|Gender=Masc|Poss=Yes|PronType=Prs | ||
her PRP$ Number=Sing|Person=3|Gender=Fem|Poss=Yes|PronType=Prs | ||
hers PRP$ Number=Sing|Person=3|Gender=Fem|Poss=Yes|PronType=Prs | ||
its PRP$ Number=Sing|Person=3|Gender=Neut|Poss=Yes|PronType=Prs | ||
our PRP$ Number=Plur|Person=1|Poss=Yes|PronType=Prs | ||
ours PRP$ Number=Plur|Person=1|Poss=Yes|PronType=Prs | ||
their PRP$ Number=Plur|Person=3|Poss=Yes|PronType=Prs | ||
theirs PRP$ Number=Plur|Person=3|Poss=Yes|PronType=Prs | ||
myself PRP Number=Sing|Person=1|PronType=Prs | ||
yourself PRP Number=Sing|Person=2|PronType=Prs | ||
himself PRP Number=Sing|Person=3|Gender=Masc|PronType=Prs | ||
herself PRP Number=Sing|Person=3|Gender=Fem|PronType=Prs | ||
itself PRP Number=Sing|Person=3|Gender=Neut|PronType=Prs | ||
ourselves PRP Number=Plur|Person=1|PronType=Prs | ||
yourselves PRP Number=Plur|Person=2|PronType=Prs | ||
themselves PRP Number=Plur|Person=3|PronType=Prs | ||
the DT Definite=Def|PronType=Art | ||
a DT Definite=Ind|PronType=Art | ||
an DT Definite=Ind|PronType=Art | ||
this DT PronType=Dem|Number=Sing | ||
that DT PronType=Dem|Number=Sing | ||
these DT PronType=Dem|Number=Plur | ||
those DT PronType=Dem|Number=Plur | ||
here RB PronType=Dem | ||
there RB PronType=Dem | ||
then RB PronType=Dem | ||
whose WP$ Poss=Yes | ||
hard RB Degree=Pos | ||
fast RB Degree=Pos | ||
late RB Degree=Pos | ||
long RB Degree=Pos | ||
high RB Degree=Pos | ||
easy RB Degree=Pos | ||
early RB Degree=Pos | ||
far RB Degree=Pos | ||
soon RB Degree=Pos | ||
low RB Degree=Pos | ||
close RB Degree=Pos | ||
well RB Degree=Pos | ||
badly RB Degree=Pos | ||
little RB Degree=Pos | ||
harder RBR Degree=Cmp | ||
faster RBR Degree=Cmp | ||
later RBR Degree=Cmp | ||
longer RBR Degree=Cmp | ||
higher RBR Degree=Cmp | ||
easier RBR Degree=Cmp | ||
quicker RBR Degree=Cmp | ||
earlier RBR Degree=Cmp | ||
further RBR Degree=Cmp | ||
farther RBR Degree=Cmp | ||
sooner RBR Degree=Cmp | ||
slower RBR Degree=Cmp | ||
lower RBR Degree=Cmp | ||
closer RBR Degree=Cmp | ||
better RBR Degree=Cmp | ||
worse RBR Degree=Cmp | ||
less RBR Degree=Cmp | ||
hardest RBS Degree=Sup | ||
fastest RBS Degree=Sup | ||
latest RBS Degree=Sup | ||
longest RBS Degree=Sup | ||
highest RBS Degree=Sup | ||
easiest RBS Degree=Sup | ||
quickest RBS Degree=Sup | ||
earliest RBS Degree=Sup | ||
furthest RBS Degree=Sup | ||
farthest RBS Degree=Sup | ||
soonest RBS Degree=Sup | ||
slowest RBS Degree=Sup | ||
lowest RBS Degree=Sup | ||
closest RBS Degree=Sup | ||
best RBS Degree=Sup | ||
worst RBS Degree=Sup | ||
least RBS Degree=Sup |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.