-
-
Save saravananpsg/c8bda8fce2a1b9a5cd8f5b2a065c227f to your computer and use it in GitHub Desktop.
Revisions
-
charlesBochet revised this gist
Apr 10, 2018 . 1 changed file with 2 additions and 2 deletions.There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters. Learn more about bidirectional Unicode charactersOriginal file line number Diff line number Diff line change @@ -9,13 +9,13 @@ os.environ['JAVA_HOME'] = java_path sentence = u"La première Falcon Heavy de l'entreprise SpaceX, " \ "la plus puissante fusée des Etats-Unis jamais " \ "lancée depuis plus de quarante ans, devrait bien " \ "emporter le roadster de l'entrepreneur américain, " \ "mais sur une orbite bien différente. Elon Musk a le sens du spectacle." jar = './stanford-ner-tagger/stanford-ner.jar' model = './stanford-ner-tagger/trained-ner-model-french.ser.gz' ner_tagger = StanfordNERTagger(model, jar, encoding='utf8') -
charlesBochet revised this gist
Dec 4, 2017 . 1 changed file with 4 additions and 7 deletions.There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters. Learn more about bidirectional Unicode charactersOriginal file line number Diff line number Diff line change @@ -8,19 +8,16 @@ java_path = "/usr/lib/jvm/java-8-oracle" os.environ['JAVA_HOME'] = java_path sentence = u"La première Falcon Heavy de l'entreprise SpaceX, " \ "la plus puissante fusée américaine jamais " \ "lancée depuis plus de quarante ans, devrait bien " \ "emporter le roadster de l'entrepreneur américain, " \ "mais sur une orbite bien différente. Elon Musk a le sens du spectacle." jar = './stanford-ner-tagger/stanford-ner.jar' model = './stanford-ner-tagger/ner-model-french.ser' ner_tagger = StanfordNERTagger(model, jar, encoding='utf8') words = nltk.word_tokenize(sentence) print(ner_tagger.tag(words)) -
charlesBochet created this gist
Dec 4, 2017 .There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters. Learn more about bidirectional Unicode charactersOriginal file line number Diff line number Diff line change @@ -0,0 +1,26 @@ # coding: utf-8 import nltk from nltk.tag.stanford import StanfordNERTagger # Optional import os java_path = "/usr/lib/jvm/java-8-oracle" os.environ['JAVA_HOME'] = java_path sentence = u"La première Falcon Heavy de l'entreprise SpaceX, la plus puissante fusée américaine jamais " \ "lancée depuis plus de quarante ans, devrait bien emporter le roadster de l'entrepreneur américain, " \ "mais sur une orbite bien différente. Elon Musk a le sens du spectacle." jar = './stanford-ner-tagger/stanford-ner.jar' model = './stanford-ner-tagger/ner-model-french.ser' # Load NER Tagger with english model ner_tagger = StanfordNERTagger(model, jar, encoding='utf8') # Split sentence into words words = nltk.word_tokenize(sentence) # Tag words print(ner_tagger.tag(words))