在Python中编写函数只保存最后一个字符串（Python）

import nltk import pos_tag import nltk.tokenize import numpy f = open(r'C:\Users\sample_data.txt') data = f.readlines() #Parse the text file for NER with POS Tagging for line in data: tokens = nltk.word_tokenize(line) tagged = nltk.pos_tag(tokens) #print (tagged) output = open(r"C:\Users\output3.csv", "w") output.write(str(tagged)) f.close()

[('This', 'DT'), ('is', 'VBZ'), ('a', 'DT'), ('simple', 'JJ'), ('sentence', 'NN')] [('I', 'PRP'), ('love', 'VBP'), ('this', 'DT'), ('company', 'NN'), ('.', '.'), ('This', 'DT'), ('company', 'NN'), ('is', 'VBZ'), ('so', 'RB'), ('good', 'JJ'), ('.', '.')] [('I', 'PRP'), ('am', 'VBP'), ('not', 'RB'), ('inovlved', 'VBN'), ('with', 'IN'), ('this', 'DT'), ('work', 'NN'), ('.', '.'), ('So', 'RB'), ('hard', 'JJ'), ('!', '.')] [('What', 'WP'), ('are', 'VBP'), ('you', 'PRP'), ('doing', 'VBG'), ('?', '.'), ('Are', 'NNP'), ('you', 'PRP'), ('nut', 'RB'), ('?', '.')] [('Can', 'MD'), ('I', 'PRP'), ('borrow', 'VB'), ('your', 'PRP$'), ('jar', 'NN'), ('?', '.'), ('Just', 'NNP'), ('for', 'IN'), ('today', 'NN'), ('.', '.')]

3条回答

网友

1楼 · 编辑于 2024-09-26 18:01:12

编辑：要回答原始问题，您需要在原始代码中调用循环中的output.write(str(tagged))。你知道吗

即使其他答案确实回答了这个问题，我还是想建议对您的实现进行一些更改

在处理资源时尽量使用with，因为它最终会自动关闭资源
打开文件后，只需迭代f变量即可

最终结果如下：

import nltk

# file will be closed once out of the scope
with open(r'C:\Users\sample_data.txt') as f:  
    with open(r'C:\Users\output3.csv', 'w') as output:
        for line in f:
            tokens = nltk.word_tokenize(line)
            tagged = nltk.pos_tag(tokens)
            output.write(str(tagged)+'\n')

网友

2楼 · 编辑于 2024-09-26 18:01:12

您应该将每一行保存在一个列表中，然后编写整个列表：

tagged_list = []
#Parse the text file for NER with POS Tagging
for line in data:
    tokens = nltk.word_tokenize(line)
    tagged_list.append(str(nltk.pos_tag(tokens)))

output = open(r"C:\Users\output3.csv", "w")
output.write('\n'.join(tagged_list))
output.close()

在tagged_list中，添加所有要写入的行。用'\n'.join(tagged)写它们，用'\n'分隔（即每一行）

网友

3楼 · 编辑于 2024-09-26 18:01:12

您有缩进错误。你知道吗

import nltk 
import pos_tag
import nltk.tokenize 
import numpy

f = open(r'C:\Users\sample_data.txt')
data = f.readlines()

#Parse the text file for NER with POS Tagging
for line in data:
    tokens = nltk.word_tokenize(line)
    tagged = nltk.pos_tag(tokens)
    #print (tagged)

    output = open(r"C:\Users\output3.csv", "a")
    output.write(str(tagged)+'\n')
f.close()
output.close()

相关问题更多 >

编程相关推荐

热门问题

热门文章