stanford-nlp

Output results in conll format (POS-tagging, stanford pos tagger)

拟墨画扇 提交于 2019-12-06 07:20:24
问题 I am trying to use Stanford POS-tagger, I want to ask if it is possible to parse (actually only pos tag would be enough) an english text and output the results in conll format. Is there such an option? I am using the full 3.2.0 version of the Stanford pos tagger Thanks a lot 回答1: When it comes to the CONLL format, i presume you mean the CONLL2000 chunking task format as such: He PRP B-NP reckons VBZ B-VP the DT B-NP current JJ I-NP account NN I-NP deficit NN I-NP will MD B-VP narrow VB I-VP

Does Stanford Core NLP support lemmatization for German?

匆匆过客 提交于 2019-12-06 05:35:47
I found German parse and pos-tag models which are compatible with Stanford Core NLP. However I was not able to get German lemmatization working. Is there a way to do so? Since the version 3.6 is also German supported. Check it under http://stanfordnlp.github.io/CoreNLP/history.html Sorry, as far as I know no implementation of German lemmatization exists for Stanford CoreNLP. 来源: https://stackoverflow.com/questions/29861925/does-stanford-core-nlp-support-lemmatization-for-german

Cannot Initialize CoreNLP in R

自闭症网瘾萝莉.ら 提交于 2019-12-06 05:25:27
I am unable to access coreNLP in R on a Mac running High Sierra. I am uncertain what the problem is, but it seems that every time I try again to get coreNLP to work, I am faced with a different error. I have JDK 9.0.4. Please see my code below for what I am attempting to do, and the error that stops me. My previous attempt I was able to get initCoreNLP() to run and load some elements of the packages, but it would fail on others. When I then attempted to run annotateString() , it would throw the error Error Must initialize with 'int CoreNLP'! . I have downloaded and re-downloaded the coreNLP

Getting sentiment analysis result using stanford core nlp java code

吃可爱长大的小学妹 提交于 2019-12-06 05:12:09
问题 When we test it on Stanford demo page: http://nlp.stanford.edu:8080/sentiment/rntnDemo.html it gives the tree with the sentiment score of each node as below: I am trying to test it on my local system using command: H:\Drive E\Stanford\stanfor-corenlp-full-2013~>java -cp "*" edu.stanford.nlp.sen timent.Evaluate edu/stanford/nlp/models/sentiment/sentiment.ser.gz test.txt text.txt has This movie doesn't care about cleverness, wit or any other kind of intelligent humor. Those who find ugly

Stanford CoreNLP: Use partial existing annotation

怎甘沉沦 提交于 2019-12-06 04:40:31
We are trying to use existing tokenzation sentence splitting and named entity tagging while we would like to use Stanford CoreNlp to additionally provide us with part-of-speech tagging lemmatization and parsing Currently, we are trying it the following way: 1) make an annotator for "pos, lemma, parse" Properties pipelineProps = new Properties(); pipelineProps.put("annotators", "pos, lemma, parse"); pipelineProps.setProperty("parse.maxlen", "80"); pipelineProps.setProperty("pos.maxlen", "80"); StanfordCoreNLP pipeline = new StanfordCoreNLP(pipelineProps); 2) read in the sentences, with a custom

Lexicon dictionary for synonym words

萝らか妹 提交于 2019-12-06 04:28:09
问题 There are few dictionaries available for natural language processing. Like positive, negative words dictionaries etc. Is there any dictionary available which contains list of synonym for all dictionary words? Like for nice synonyms: enjoyable, pleasant, pleasurable, agreeable, delightful, satisfying, gratifying, acceptable, to one's liking, entertaining, amusing, diverting, marvellous, good; 回答1: Although WordNet is a good resource to start for finding synonym, one must note its limitations,

Does the Stanford NLP Parser have methods for semantic role labelling?

╄→尐↘猪︶ㄣ 提交于 2019-12-06 01:24:39
I'm trying to find the semantic labels of english sentences. I am using the Stanford NLP parser. Does it have methods for this? I was going through the documentation but the closest thing I could find was: CoreAnnotations.SemanticWordAnnotation CoreAnnotations.SemanticTagAnnotation No, we currently don't have a semantic role labeling (SRL) system in CoreNLP. Unless you already have a system that explicitly requires semantic role labels, I would recommend taking a look at the Universal Dependencies representation. Despite the fact that this representation is primarily a syntactic representation

Stanford NLP - OpenIE out of memory when processing list of files

余生颓废 提交于 2019-12-06 00:49:24
I'm trying to extract information from several files using the OpenIE tool from Stanford CoreNLP, it gives an out of memory error when several files are passed to the input, instead of just one. All files have been queued; awaiting termination... java.lang.OutOfMemoryError: GC overhead limit exceeded at edu.stanford.nlp.graph.DirectedMultiGraph.outgoingEdgeIterator(DirectedMultiGraph.java:508) at edu.stanford.nlp.semgraph.SemanticGraph.outgoingEdgeIterator(SemanticGraph.java:165) at edu.stanford.nlp.semgraph.semgrex.GraphRelation$GOVERNER$1.advance(GraphRelation.java:267) at edu.stanford.nlp

Using Stanford CoreNLP - Java heap space

拟墨画扇 提交于 2019-12-05 23:47:54
问题 I am trying to use the coreference module of the Stanford CoreNLP pipeline, but I end up getting an OutOfMemory error in Java. I already increased the heap size (via Run->Run Configurations->VM Arguments in Eclipse) and set them to -Xmx3g -Xms1g. I even tried -Xmx12g -Xms4g, but that didn't help either. I'm using Eclipse Juno on OS X 10.8.5 with Java 1.6 on a 64-bit machine. Does anyone have an idea what else I could try? I'm using the example code from the website (http://nlp.stanford.edu

OpenNLP: Training a custom NER Model for multiple entities

早过忘川 提交于 2019-12-05 21:11:23
I am trying training a custom NER model for multiple entities. Here is the sample training data: count all <START:item_type> operating tables <END> on the <START:location_id> third <END> <START:location_type> floor <END> count all <START:item_type> items <END> on the <START:location_id> third <END> <START:location_type> floor <END> how many <START:item_type> beds <END> are in <START:location_type> room <END> <START:location_id> 2 <END> The NameFinderME.train(.) method takes a string parameter type . What is the use of this parameter? And, how can I train a model for multiple entities (e.g.