无法分组和合计csv fi

with open("testingtesting6a.csv") as inf: data = [] author = 'XXXXXXXX' countAuthor = 0.0 for line in inf: line = line.split(",") if line[0] == author: countAuthor = countAuthor + float(line[1]) else: countAuthor = float(line[1]) author = line[0] # print line[0], countAuthor w = (line[0],line[1],countAuthor) print w[2] data.append(w) print data[2] # print data[0] # print type(w) # print w[2]

2条回答

网友

1楼 · 编辑于 2024-05-03 07:59:58

标准库已经涵盖了这一点。你知道吗

import collections

def sum_up(input_file):
    counter = collections.defaultdict(int)
    for line in input_file:
        parts = line.split()  # splits by any whitespace.
        if len(parts) != 2:
          continue  # skip the line that does not parse; maybe a blank line.
        name, number = parts
        counter[name] += int(number)  # you can't borrow 1.25 books.
    return counter

现在您可以：

with open('...') as f:
  counts = sum_up(f)

for name, count in sorted(counts.items()):
  print name, count  # prints counts sorted by name.

print counts['Vincent']  # prints 4.

print counts['Jane']  # prints 0.

这里的诀窍是使用^{}，一种假装对任何键都有值的dict。我们要求它有一个由int()生成的默认值，即0。你知道吗

网友

2楼 · 编辑于 2024-05-03 07:59:58

使用`strip`、groupby和Pandas删除空格：

输入文件（可选空格是有意的）：

author,books
Vincent, 1
Vincent , 1
Vincent, 1
Vincent, 1
Thomas  ,  1
Thomas,  1
Thomas,  1
Jimmy,   1
Jimmy  ,   1

import csv
import pandas as pd

fin = open('author.csv', 'r')
reader = csv.DictReader(fin, delimiter=',')

# strip remove spaces
authors=[( (d['author']).strip(), int((d['books']).strip())) for d in reader]

df = pd.DataFrame(authors)
df.columns = ['author', 'books']
df2 = (df.groupby('author').sum())
print (df2)    

         books
author        
Jimmy        2
Thomas       3
Vincent      4

# For total of books:
print (df2.books.sum())
9

使用`strip`、groupby和Pandas删除空格：

相关问题更多 >

编程相关推荐

热门问题

热门文章