nlp - How to NER and POS tag a pre-tokenized text with Stanford CoreNLP? -


i'm using stanford's corenlp named entity recognizer (ner) , part-of-speech (pos) tagger in application. problem code tokenizes text beforehand , need ner , pos tag each token. able find out how using command line options not programmatically.

can please tell me how programmatically can ner , pos tag pretokenized text using stanford's corenlp?

edit:

i'm using individual ner , pos instructions. code written instructed in tutorials given in stanford's ner , pos packages. have corenlp in classpath. have corenlp in classpath using tutorials in ner , pos packages.

edit:

i found there instructions how 1 can set properties corenlp here http://nlp.stanford.edu/software/corenlp.shtml wish if there quick way want stanford ner , pos taggers don't have recode everything!

if set property:

tokenize.whitespace = true 

then corenlp pipeline tokenize on whitespace rather default ptb tokenization. may want set:

ssplit.eolonly = true 

so split sentences on newline characters.


Comments

Popular posts from this blog

java - Ebean enhancement ignores a model -

ubuntu - How to disable Kernel Module Signing in linux -

SQL php on different pages to Insert (mysqli) -