我有一个装满csv文件的文件夹,每个文件看起来像这样:
TPN,201203,by the congress,3,0.000001559523908541200542298447130
TPN,201312,by the congress,2,0.000001975728179317089554819047995
TPN,201308,by the congress,2,0.000002130556224313481520620588417
CR,200910,by the congress,10,0.000001254229103759238181242376639
CR,200911,by the congress,5,6.974221464170843876612631794E-7
MED,200507,by the congress,2,0.000004113271264069958517659301854
我想要一个脚本,它遍历每个文件并找到该文件中的最小日期值,然后将该文件中包含该日期值的每一行打印到一个新文件中(因此,如果两行具有相同的日期值,则应该同时打印这两行)。我有这个:
import csv
import os
import codecs
import unicodecsv
folder = '/Users/xyz/Desktop/TextAnalysis/PointsOfOrigin/trigramsdated/'
c = csv.writer(open("trigrampointsoforigin.csv", "a"))
for file in os.listdir (folder):
with open(os.path.join(folder, file), mode='rU') as f:
m=min(int(line[1]) for line in unicodecsv.reader(f, encoding='utf-8', errors='replace'))
f.seek(0)
for line in unicodecsv.reader(f):
if int(line[1])==m:
print line
c.writerow(line)
print "All done."
但出于某种奇怪的原因,它只是不断地将每个csv中的最后一行打印到“t”rigramspointsoforign.csv文件“文件。你知道吗
任何帮助都非常感谢。你知道吗
line[1]
是否实际打印出日期值?在任何情况下,您都可以避免一个内部循环:你需要
import copy
。你知道吗相关问题 更多 >
编程相关推荐