计算一个列在Pandas中包含某个值的次数

网友

1楼 · 编辑于 2024-09-29 00:13:00

使用collections.Counter+itertools.chain：

from collections import Counter
from itertools import chain

c = Counter(chain.from_iterable(df['column_name'].str.split('|')))

res = pd.Series(c)

print(res)

book        3
campfire    1
fish        2
icecream    1
dtype: int64

网友

2楼 · 编辑于 2024-09-29 00:13:00

将^{}与^{}一起用于Series：

a = df['column_name'].str.split('|', expand=True).stack().value_counts()
print (a)
book        3
fish        2
icecream    1
campfire    1
dtype: int64

或者Counter使用列表理解和扁平化：

^{pr2}$

网友

3楼 · 编辑于 2024-09-29 00:13:00

`pd.value_counts`

也可以将列表传递给value_counts函数。注I join除以|，然后再除以|。在

pd.value_counts('|'.join(df.column_name).split('|'))

book        3
fish        2
icecream    1
campfire    1
dtype: int64

`get_dummies`

这是因为数据是用|作为分隔符的。如果有不同的分隔符，请将其传递给get_dummies调用df.column_name.str.get_dummies(sep='|').sum()

^{pr2}$

如果你想把结果排序

df.column_name.str.get_dummies().sum().sort_values(ascending=False)

book        3
fish        2
icecream    1
campfire    1
dtype: int64

`pd.factorize`和{}

请注意，我join整个列并再次拆分。在

f, u = pd.factorize('|'.join(df.column_name).split('|'))
pd.Series(np.bincount(f), u)

book        3
fish        2
icecream    1
campfire    1
dtype: int64

要排序，我们可以像上面那样使用sort_values。或者这个

f, u = pd.factorize('|'.join(df.column_name).split('|'))
counts = np.bincount(f)
a = counts.argsort()[::-1]
pd.Series(counts[a], u[a])

book        3
fish        2
campfire    1
icecream    1
dtype: int64

`pd.value_counts`

`get_dummies`

`pd.factorize`和{}

相关问题更多 >

编程相关推荐

热门问题

热门文章

计算一个列在Pandas中包含某个值的次数

pd.value_counts

get_dummies

pd.factorize和{}

相关问题 更多 >

编程相关推荐

热门问题

热门文章

`pd.value_counts`

`get_dummies`

`pd.factorize`和{}

相关问题更多 >