在按多个其他列分组的情况下，计算dataframe列中的唯一提及次数

body score id created subreddit type mentions 3860 There are much better industrials stocks than ... 1 NaN 2021-03-13 20:32:08+00:00 stocks comment {GE} 3776 I guy I work with told me about PENN about 9 m... 1 NaN 2021-03-13 20:29:30+00:00 investing comment {PENN} 4122 [mp4 link](https://preview.redd.it/ieae3z7suum... 2 NaN 2021-03-13 20:28:43+00:00 StockMarket comment {KB} 2219 If you cant decide, then just buy $GME options 1 NaN 2021-03-13 20:28:12+00:00 wallstreetbets comment {GME} 2229 This sub the most wholesome fucking thing in t... 2 NaN 2021-03-13 20:27:57+00:00 wallstreetbets comment {GME}

ticker subreddit type count GME wallstreetbets comment 5 GME wallstreetbets title 4 GME investing comment 3 GME investing title 2

1条回答

网友

1楼 · 发布于 2024-09-26 18:19:56

听起来像一个简单的groupby应该做到：

df.groupby(['mentions','subreddit','type']).count()

产生

                                        body    score   id  created
mentions    subreddit        type               
{GE}        stocks           comment    1       1       0   1
{GME}       wallstreetbets   comment    2       2       0   2
{KB}        StockMarket      comment    1       1       0   1
{PENN}      investing        comment    1       1       0   1

相关问题更多 >

编程相关推荐

热门问题

热门文章