从句子中提取相关日期和位置

1条回答

网友

1楼 · 发布于 2024-06-01 20:18:15

这似乎是一个命名实体识别问题。以下是相同的步骤。有关详细了解，请参阅this文章

从here下载Stanford NER
解压缩文件夹并保存到驱动器中
从文件夹中复制“stanford ner.jar”，并将其保存在文件夹外，如下图所示。
点击下面给出的“无壳”从https://stanfordnlp.github.io/CoreNLP/history.html下载无壳模型。第一个链接中的模型也可以工作，但是，无大小写模型有助于识别命名实体，即使它们没有按照形式语法规则的要求大写。
运行以下Python代码。请注意，这段代码在Windows10、64位机器上使用Python2.7版本

注意：请确保将所有路径更新为本地计算机上的路径

#Import all the required libraries.
import os
from nltk.tag import StanfordNERTagger
import pandas as pd

#Set environmental variables programmatically.
#Set the classpath to the path where the jar file is located
os.environ['CLASSPATH'] = "<your path>/stanford-ner-2015-04-20/stanford-ner.jar"
#Set the Stanford models to the path where the models are stored
os.environ['STANFORD_MODELS'] = '<your path>/stanford-corenlp-caseless-2015-04-20-models/edu/stanford/nlp/models/ner'

#Set the java jdk path. This code worked with this particular java jdk
java_path = "C:/Program Files/Java/jdk1.8.0_191/bin/java.exe"
os.environ['JAVAHOME'] = java_path


#Set the path to the model that you would like to use
stanford_classifier  =  '<your path>/stanford-corenlp-caseless-2015-04-20-models/edu/stanford/nlp/models/ner/english.muc.7class.caseless.distsim.crf.ser.gz'

#Build NER tagger object
st = StanfordNERTagger(stanford_classifier)

#A sample text for NER tagging
text = 'The man left Amsterdam on January and reached Nepal on October 21st'

#Tag the sentence and print output
tagged = st.tag(str(text).split())
print(tagged)
#[(u'The', u'O'), 
# (u'man', u'O'), 
# (u'left', u'O'), 
# (u'Amsterdam', u'LOCATION'), 
# (u'on', u'O'), 
# (u'January', u'DATE'), 
# (u'and', u'O'), 
# (u'reached', u'O'), 
# (u'Nepal', u'LOCATION'), 
# (u'on', u'O'), 
# (u'October', u'DATE'), 
# (u'21st', u'DATE')]

这种方法适用于大多数情况

相关问题更多 >

编程相关推荐

热门问题

热门文章

从句子中提取相关日期和位置

相关问题 更多 >

编程相关推荐

热门问题

热门文章

相关问题更多 >