谷歌表单API和Pandas。API中的数据长度不一致

2条回答

网友

1楼 · 编辑于 2024-09-29 21:52:15

如果有人感兴趣，下面是我如何解决这个问题的

首先，我们需要从Sheets API获取所有数据

# define the names of the tabs I want to get
ranges = ['tab1', 'tab2']

# Call the Sheets API
request = service.spreadsheets().values().batchGet(spreadsheetId=document, ranges=ranges,)
response = request.execute()

现在，我想遍历每一列，并确保每一行的列表包含的元素数与第一行包含列标题的元素数相同

# response is the response from google sheets API, 
# and from the code above. It contains column headings
# and data from every row.

# valueRanges is the key to access the data.
def extract_case_data(response, keyword):
    for obj in response["valueRanges"]:
        if keyword in obj["range"]:
            values = pad_data(obj["values"])
            df = pd.DataFrame(values[1:], columns=values[0])
            return df
    return None

最后介绍了数据的填充方法

def pad_data(data: list):

    # build a new array with the column heading data
    # this is the list which we will return
    return_data = [data[0]]

    for row in data[1:]:
        difference = len(data[0]) - len(row)
        new_row = row
        # append None to the lists which have a shorter
        # length than the column heading list
        for count in range(1, difference + 1):
            new_row.append(None)
        return_data.append(new_row)
    return return_data

我当然不是说这是最好或最优雅的解决方案，但它为我做到了

希望这对别人有帮助

网友

2楼 · 编辑于 2024-09-29 21:52:15

同样的想法，也许更简单一些：

获取原始值

result = service.spreadsheets().values().get(spreadsheetId=spreadsheet_id, range=data_range).execute()
raw_values = result.get('values', [])

然后在迭代时完成

for row in raw_values:
    row = row + [''] * (expected_length - len(row))

相关问题更多 >

编程相关推荐

热门问题

热门文章

谷歌表单API和Pandas。API中的数据长度不一致

相关问题 更多 >

编程相关推荐

热门问题

热门文章

相关问题更多 >