应用中的交叉引用数据帧

import pandas as pd def itemcounts(row): # ok this works? # return b[b['id'] == 1234]['amount'].sum() # each a['quantity'] gets set to 4 or whatever the sum for 1234 is. # and this does? # return row['id'] # a['quantity'] get set to whatever row's 'id' is. # but this doesn't id = row['id'] return b[b['id'] == id]['amount'].sum() # a['quantity'] is 0. a = pd.read_csv('a.csv') b = pd.read_csv('b.csv') a['quantity'] = a.apply(itemcounts, axis=1)

2条回答

网友

1楼 · 编辑于 2024-10-01 19:30:22

试试这个：

df = pd.DataFrame({'id' : [1234, 1235, 1236], 'name' : ['R', 'Python', 'Pandas']})

     id    name
0  1234       R
1  1235  Python
2  1236  Pandas

df1 = pd.DataFrame({'id' : [1234, 1234, 1234, 1234, 1234, 1235, 1235, 1236], 'amount' : [1, 1, 2, 1, 2, 2, 1, 1]})

   amount    id
0       1  1234
1       1  1234
2       2  1234
3       1  1234
4       2  1234
5       2  1235
6       1  1235
7       1  1236

df['quantity'] = df1.groupby('id').agg(sum).values

     id    name  quantity
0  1234       R         7
1  1235  Python         3
2  1236  Pandas         1

网友

2楼 · 编辑于 2024-10-01 19:30:22

这个剧本对我很有用：

import pandas as pd
a = pd.read_csv('a.csv')
b = pd.read_csv('b.csv')

a['Quantity'] = a['id'].apply(lambda x: b[b.id == x].amount.sum())

在apply函数中使用“lambda”可以将列的每一行作为“x”应用到函数中。你知道吗

采取行动：

    id    name
0  1234       r        
1  1235  Python       
2  1236   Panda

和b：

     id  amount   
0  1234       1
1  1234       1
2  1234       2
3  1236       1
4  1236       1

它返回：

        id    name  Quantity
0     1234       r         4
1     1235  Python         0
2     1236   Panda         2

相关问题更多 >

编程相关推荐

热门问题

热门文章