从pandas数据帧中提取字典值

f1 = pd.read_json('https://raw.githubusercontent.com/ansymo/msr2013-bug_dataset/master/data/v02/eclipse/short_desc.json') print(f1.head()) short_desc 1 [{'when': 1002742486, 'what': 'Usability issue... 10 [{'when': 1002742495, 'what': 'API - VCM event... 100 [{'when': 1002742586, 'what': 'Would like a wa... 10000 [{'when': 1014113227, 'what': 'getter/setter c... 100001 [{'when': 1118743999, 'what': 'Create Help Ind...

2条回答

网友

1楼 · 编辑于 2024-09-21 03:26:01

不要初始化数据帧并尝试将其分配给列-列应该是pd.Series。在

您应该直接指定列表理解，如下所示：

f1['desc'] = [x[0]['what'] for x in f1['short_desc']]

作为替代，我将提出一个不涉及任何lambda函数的解决方案，使用operator和pd.Series.apply：

^{pr2}$

网友

2楼 · 编辑于 2024-09-21 03:26:01

或者您可以尝试apply（PS:apply将其视为时间成本函数）

f1['short_desc'].apply(pd.Series)[0].apply(pd.Series)

Out[864]: 
                                                     what        when   who
1         Usability issue with external editors (1GE6IRL)  1002742486    21
10                 API - VCM event notification (1G8G6RR)  1002742495    10
100     Would like a way to take a write lock on a tea...  1002742586    24
10000   getter/setter code generation drops "F" in ".....  1014113227   331
100001  Create Help Index Fails with seemingly incorre...  1118743999  9571

相关问题更多 >

编程相关推荐

热门问题

热门文章