将循环输出写入cs

output_dir = "C:\\Users\\Lenovo\\.spyder-py3\\NER_training" DIR = 'C:\\Users\\Lenovo\\.spyder-py3\\Testing\\' print("Loading from", output_dir) nlp2 = spacy.load(output_dir) with open('eng_productnames.csv', newline='') as myFile: reader = csv.reader(myFile) for rowz in reader: try: filenamez = rowz[1] file = open(DIR+filenamez, "r", encoding ='utf-8') filecontentszz = file.read() for s in filecontentszz: filecontentszz = re.sub(r'\s+', ' ', filecontentszz) #filecontents = filecontents.encode().decode('unicode-escape') filecontentszz = ''.join([line.lower() for line in filecontentszz]) doc2 = nlp2(filecontentszz) for ent in doc2.ents: print(filenamez, ent.label_, ent.text) break except Exception as e:`

Filename PRODUCT 07-09-18 Acknowledgement P.txt ABC1 06-22-18 Letter Req to Change.txt ABC2

2条回答

网友

1楼 · 编辑于 2024-10-03 19:31:02

您可以使用csv.writer将每一行写入输出文件，使用writerow而不是打印到屏幕上。你知道吗

output_dir = "C:\\Users\\Lenovo\\.spyder-py3\\NER_training"
DIR = 'C:\\Users\\Lenovo\\.spyder-py3\\Testing\\'
print("Loading from", output_dir)
nlp2 = spacy.load(output_dir)
with open('eng_productnames.csv', newline='') as input_file, \
        open('output.csv', 'w') as output_file:
    reader = csv.reader(input_file)
    writer = csv.writer(output_file)
    writer.writerow(["Filename", "Product"])  # this is the header row
    for rowz in reader:
        try:
            filenamez = rowz[1]
            file = open(DIR+filenamez, "r", encoding ='utf-8')
            filecontentszz = file.read()
            for s in filecontentszz:
                filecontentszz = re.sub(r'\s+', ' ', filecontentszz)
                #filecontents = filecontents.encode().decode('unicode-escape')
                filecontentszz = ''.join([line.lower() for line in filecontentszz]) 
                doc2 = nlp2(filecontentszz)
                for ent in doc2.ents:
                    writer.writerow([filenamez, ent.text])

                break

我在这里假设filenamez和ent.text在每一列中都包含您想要的信息。如果不是这样的话，那么您可以在写入CSV之前操纵它们以获得所需的内容。你知道吗

网友

2楼 · 编辑于 2024-10-03 19:31:02

有很多方法可以做到这一点。我更喜欢使用Pandas，这是一个处理CSV文件的强大库。您可以创建字典：

predicted_products = {'FILENAME': [], 'PRODUCT': []}

并将文件名和产品迭代地附加到相应的列表中。你知道吗

完成后，将预测的\u产品转换为数据帧，并调用\u csv函数：

import Pandas as pd
predicted_products_df = pd.DataFrame.from_dict(predicted_products)
predicted_products_df.to_csv('your_path/file_name.csv')

我更喜欢这种方式，因为在保存文件之前可以更轻松地编辑数据。你知道吗

对于现有代码，我假设print(filenamez, ent.label_, ent.text)打印输出。如果是，那么：

import Pandas as pd
output_dir = "C:\\Users\\Lenovo\\.spyder-py3\\NER_training"
DIR = 'C:\\Users\\Lenovo\\.spyder-py3\\Testing\\'
print("Loading from", output_dir)
nlp2 = spacy.load(output_dir)
predicted_products = {'FILENAME': [], 'PRODUCT': []}
with open('eng_productnames.csv', newline='') as myFile:
    reader = csv.reader(myFile)
    for rowz in reader:
        try:
            filenamez = rowz[1]
            file = open(DIR+filenamez, "r", encoding ='utf-8')
            filecontentszz = file.read()
            for s in filecontentszz:
                filecontentszz = re.sub(r'\s+', ' ', filecontentszz)
                #filecontents = filecontents.encode().decode('unicode-escape')
                filecontentszz = ''.join([line.lower() for line in filecontentszz]) 
                doc2 = nlp2(filecontentszz)
                for ent in doc2.ents:
                    print(filenamez, ent.label_, ent.text)
                    predicted_products['FILENAME'].append(filenamez + ' ' + ent.label_)
                    predicted_products['PRODUCT'].append(ent.text)
                break

        except Exception as e:

predicted_products_df = pd.DataFrame.from_dict(predicted_products)
predicted_products_df.to_csv('your_path/file_name.csv')

相关问题更多 >

编程相关推荐

热门问题

热门文章