嵌套列表到数据帧的dict

{'user_1': {'postID_1': ['#fitfam', '#gym', '#bro'], 'postID_2': ['#swol', '#anotherhashtag']}, 'user_2': {'postID_78': ['#ripped', '#bro', '#morehashtags'], 'postID_1': ['#buff', '#othertags']}, 'user_3': ...and so on }

+------------+------------+--------+-----+-----+------+-----+ | UserID_key | PostID_key | fitfam | gym | bro | swol | ... | +------------+------------+--------+-----+-----+------+-----+ | user_1 | postID_1 | 1 | 1 | 1 | 0 | ... | | user_1 | postID_2 | 0 | 0 | 0 | 1 | ... | | user_2 | postID_78 | 0 | 0 | 1 | 0 | ... | | user_2 | postID_1 | 0 | 0 | 0 | 0 | ... | | user_3 | ... | ... | ... | ... | ... | ... | +------------+------------+--------+-----+-----+------+-----+

1条回答

网友

1楼 · 发布于 2024-09-28 21:02:46

在my answer to another question的基础上，可以使用pd.concat构建和连接子帧，然后使用stack和get_dummies：

(pd.concat({k: pd.DataFrame.from_dict(v, orient='index') for k, v in dct.items()})
   .stack()
   .str.get_dummies()
   .sum(level=[0, 1]))

                  #anotherhashtag  #bro  #buff  #fitfam  #gym  #morehashtags  #othertags  #ripped  #swol
user_1 postID_1                 0     1      0        1     1              0           0        0      0
       postID_2                 1     0      0        0     0              0           0        0      1
user_2 postID_78                0     1      0        0     0              1           0        1      0
       postID_1                 0     0      1        0     0              0           1        0      0

相关问题更多 >

编程相关推荐

热门问题

热门文章