nlp - How to NER and POS tag a pre-tokenized text with Stanford CoreNLP? -

- July 15, 2010

i'm using stanford's corenlp named entity recognizer (ner) , part-of-speech (pos) tagger in application. problem code tokenizes text beforehand , need ner , pos tag each token. able find out how using command line options not programmatically.

can please tell me how programmatically can ner , pos tag pretokenized text using stanford's corenlp?

edit:

i'm using individual ner , pos instructions. code written instructed in tutorials given in stanford's ner , pos packages. have corenlp in classpath. have corenlp in classpath using tutorials in ner , pos packages.

edit:

i found there instructions how 1 can set properties corenlp here http://nlp.stanford.edu/software/corenlp.shtml wish if there quick way want stanford ner , pos taggers don't have recode everything!

if set property:

tokenize.whitespace = true

then corenlp pipeline tokenize on whitespace rather default ptb tokenization. may want set:

ssplit.eolonly = true

so split sentences on newline characters.

Search This Blog

Shefl

nlp - How to NER and POS tag a pre-tokenized text with Stanford CoreNLP? -

Comments

Post a Comment

Popular posts from this blog

c++ - No viable overloaded operator for references a map -

java - UML - How would you draw a try catch in a sequence diagram? -

c++ - Gamma correction doesn't look properly corrected, is this linear? -