如何将规范化的json数据从多个文件导入到一个数据帧中？

import os from glob import glob PATH = 'dir/filepath' files = [y for x in os.walk(PATH) for y in glob(os.path.join(x[0], 'file*'))] for file in files: with open(issuefile, 'r') as f: data = f.read() data_json = json_normalize(json.loads(data)) type = ' '.join(issuefile.split('/')[3] data_json['type'] = type # append to data frame for typeA and typeB if 'typeA' in type: # append to typeA dataframe else: # append to typeB dataframe

1条回答

网友

1楼 · 发布于 2024-09-29 02:27:05

我认为在将这些文件读入pandas之前，应该首先将它们连接在一起，下面是如何在bash中实现（也可以在Python中实现）：

cat `find *typeA` > typeA
cat `find *typeB` > typeB

然后可以使用io.json.json_normalize将其导入熊猫：

import json
with open('typeA') as f:
    data = [json.loads(l) for l in f.readlines()]
    dfA = pd.io.json.json_normalize(data)

dfA

#          that this.first this.second
# 0  otherthing      thing       thing
# 1  otherthing      thing       thing
# 2  otherthing      thing       thing

相关问题更多 >

编程相关推荐

热门问题

热门文章