使用枚举在数据帧中进行For循环

2条回答

网友

1楼 · 编辑于 2024-09-29 01:18:55

我建议使用^{}来获取每个组的值
在大多数情况下，对熊猫使用for-loop表示可能没有正确或有效地使用
额外资源：
- Fast, Flexible, Easy and Intuitive: How to Speed Up Your Pandas Projects
- Stack Overflow Pandas Tag Info Page

备选案文1：

import pandas as pd
import numpy as np
import random

np.random.seed(365)
random.seed(365)
rows = 25
data = {'n': [random.choice(['A', 'B', 'C']) for _ in range(rows)],
        'v1': np.random.randint(40, size=(rows)),
        'v2': np.random.randint(40, size=(rows))}

df = pd.DataFrame(data)

# groupby n
for g, d in df.groupby('n'):
#     print(g)               # use or not, as needed
    print(d.v1.values[0])    # selects the first value of each group and prints it

[out]:  # first value of each group
5
33
18

备选案文2：

dfg = df.groupby(['n'], as_index=False).agg({'v1': list})

# display(dfg)
   n                                   v1
0  A  [5, 26, 39, 39, 10, 12, 13, 11, 28]
1  B      [33, 34, 28, 31, 27, 24, 36, 6]
2  C        [18, 27, 9, 36, 35, 30, 3, 0]

备选案文3：

如注释中所述，您的数据已经是groupby的结果，并且每个组的列中只有一个值

dfg = df.groupby('n', as_index=False).sum()

# display(dfg)

   n   v1   v2
0  A  183  163
1  B  219  188
2  C  158  189

# print the value for each group in v1
for v in dfg.v1.to_list():
    print(v)

[out]:
183
219
158

备选案文4：

打印每列的所有行

dfg = df.groupby('n', as_index=False).sum()

for col in dfg.columns[1:]:  # selects all columns after n
    for v in dfg[col].to_list():
        print(v)

[out]:
183
219
158
163
188
189

网友

2楼 · 编辑于 2024-09-29 01:18:55

我同意@Trenton的评论，即使用数据帧的全部目的是避免像这样循环通过它们。使用函数重新思考这个问题。然而，让你所写的东西发挥作用的最接近的方法是：

Segment_list = df['Name1'].unique()
for Index in Segment_list:
    print(df['Value1'][df['Name1']==Index]).iloc[0]

如果Name有两个条目（可能是因为您使用了.unique()，所以可能会发生这种情况），这将打印值的总和，具体取决于您希望发生的情况：

df.groupby('Name1').sum()['Value1']

备选案文1：

备选案文2：

备选案文3：

备选案文4：

相关问题更多 >

编程相关推荐

热门问题

热门文章

使用枚举在数据帧中进行For循环

备选案文1：

备选案文2：

备选案文3：

备选案文4：

相关问题 更多 >

编程相关推荐

热门问题

热门文章

相关问题更多 >