在Python中求CSV行的总和

sortedList = csv.reader(open("keywordReport.csv")) editedFile = open("output.csv",'w') wr = csv.writer(editedFile, delimiter = ',') name = "" sortedList = sorted(sortedList, key=operator.itemgetter(0), reverse=True) newKeyword = ["","","","","",""] for row in sortedList: if row[0] != name: wr.writerow(newKeyword) name = row[0] else: newKeyword[0] = row[0] #Name newKeyword[1] = str(float(newKeyword[1]) + float(row[1])) newKeyword[2] = str(float(newKeyword[2]) + float(row[2])) newKeyword[3] = str(float(newKeyword[3]) + float(row[3]))

3条回答

网友

1楼 · 编辑于 2024-10-04 01:30:40

方法很简单：

import pandas as pd

aframe = pd.read_csv('thefile.csv')

Out[19]:
Name    Value   Value2  Value3  Rating
0   ddf 34  45  46  ok
1   ddf 67  23  11  ok
2   ghd 23  11  78  bad
3   ghd 56  33  78  bad

r = aframe.groupby(['Name','Rating'],as_index=False).sum()

Out[40]:
Name    Rating  Value   Value2  Value3
0   ddf ok  101 68  57
1   ghd bad 79  44  156

如果你需要做进一步的分析和统计，熊猫会带你走很长的路而不费吹灰之力。因为这里的用例就像使用锤子杀死苍蝇，但是我想提供这个替代方案。你知道吗

网友

2楼 · 编辑于 2024-10-04 01:30:40

你知道吗文件.csv你知道吗

Name,Value,Value2,Value3,Rating
ddf,34,45,46,ok
ddf,67,23,11,ok
ghd,23,11,78,bad
ghd,56,33,78,bad

代码

import csv

def map_csv_rows(f):
    c = [x for x in csv.reader(f)]
    return [dict(zip(c[0], map(lambda p: int(p) if p.isdigit() else p, x))) for x in c[1:]]

my_csv = map_csv_rows(open('file.csv', 'rb'))

output = {}
for row in my_csv:
    output.setdefault(row.get('Name'), {'Name': row.get('Name'), 'Value': 0,'Value2': 0, 'Value3': 0, 'Rating': row.get('Rating')})
    for val in ['Value', 'Value2', 'Value3']:
        output[row.get('Name')][val] = output[row.get('Name')][val] + row.get(val)

with open('out.csv', 'wb') as f:
    fieldnames = ['Name', 'Value', 'Value2', 'Value3', 'Rating']
    writer = csv.DictWriter(f, fieldnames = fieldnames)
    writer.writeheader()
    for out in output.values():
        writer.writerow(out)

网友

3楼 · 编辑于 2024-10-04 01:30:40

为了便于比较，等效的awk程序

$ awk -v OFS="\t" '
     NR==1{$1=$1;print;next} 
          {k=$1;a[k]+=$2;b[k]+=$3;c[k]+=$4;d[k]=$5} 
       END{for(i in a) print i,a[i],b[i],c[i],d[i]}' input

将打印

Name    Value   Value2  Value3  Rating
ddf     101     68      57      ok
ghd     79      44      156     bad

如果是csv输入，而您想要csv输出，则需要添加-F,参数并更改为OFS=,

相关问题更多 >

编程相关推荐

热门问题

热门文章