python中的CSV模块newlin问题

2024-10-01 00:20:29 发布

您现在位置:Python中文网/ 问答频道 /正文

我有一个csv文件,数据如下:

"field1"|"field2"|"field3"
"12ed"|"ksdk"|"sjdhs"
"1323"|"jdjsk
sjfsk"|"sk"k"sd"

我的预期产出

^{pr2}$

我的两个问题在第三行。其中数据在双引号csv文件中包含双引号,它应该在最终输出中返回该文件。以及列值中的新行/换行符。都在第三行找到了。在

因为我将数据读作“QUOTE_NONE”,所以我可以返回[1:-1]数据,但不能用空值替换新行。在

with open(fileIn, "rb") as input:
    with open(fileOut,'wb') as output:
        w = csv.writer(output, delimiter='|',quoting=csv.QUOTE_NONE,quotechar='')
        for record in csv.reader(input, delimiter='|',quoting=csv.QUOTE_NONE):
            #r = map(lambda x: x.replace("\n",""), record) --> This is not working
            print([s[1:-1] for s in record])
            w.writerow([s[1:-1] for s in record])

使用这段代码,我能够处理引号(第一个和最后一个)并在数据中保留引号。但我没法处理newline。在

已更新-

csv文件内容:

"id"|"comments"|"Date"
"B-7"|"Hi How . 


Are You."|"2017-03-15 13:53:23.727"
"8-C"|"How was "your day" today"|"2017-02-06 11:45:26.783"

错误:-

['"id"', '"comments"', '"Date"']
['"B-7"', '"Hi How . ']
[]
Traceback (most recent call last):
File "try.py", line 23, in <module>
appendRecords(record, oldRecord)
File "try.py", line 8, in appendRecords
oldRecord[-1] = oldRecord[-1] + ' ' + record[0]
IndexError: list index out of range

仅供参考-Im使用2.6.6版


Tags: 文件csv数据innoneforinputas
1条回答
网友
1楼 · 发布于 2024-10-01 00:20:29

一个选项是添加一个检查,如果一行的最后一列不是以"结尾,那么不要将其写入输出文件,而是将下一行合并到它,然后将其写入输出文件。在

Merge是一个list.extend,除了第一个列表的最后一个元素和最后一个列表的第一个元素也被连接在一起。在

此代码应适用于您:

def appendRecords(record, oldRecord):
    # Check to guard against empty lines in the input csv file
    if len(record):
        oldRecord[-1] = oldRecord[-1] + ' ' + record[0]
        record.pop(0)
        oldRecord.extend(record)



with open(fileIn, "rb") as input:
    with open(fileOut,'wb') as output:
        w = csv.writer(output, delimiter='|',quoting=csv.QUOTE_NONE,quotechar='')
        oldRecord = None
        for record in csv.reader(input, delimiter='|',quoting=csv.QUOTE_NONE):
            if oldRecord is not None:
                appendRecords(record, oldRecord)
                record = oldRecord

            if record[-1].endswith('"'):
                print([s[1:-1] for s in record])
                w.writerow([s[1:-1] for s in record])
                oldRecord = None
            else:
                oldRecord = record

相关问题 更多 >