重命名随时间变化的列名称

网友

1楼 · 编辑于 2024-10-04 09:20:53

我将创建一个列映射器字典，您可以随时间添加到其中：

col_map = {
  "postcode": "PostCode",
  "brands": "brand",
}

col_order = ["PostCode", "brand"]

renamed_df = df.columns.map(lambda x: col_map.get(x, x)) # <- Renames the cols to the dict values
ouput = renamed_df.reindex(columns=col_order ) # <- reorders the cols based on the config list

注意col_map.get(x, x)如果是新的，则返回到提供的列，即“品牌”

相反，如果您希望它出错，以便能够轻松识别问题并更新col_map，则可以使用df.columns.map(col_map)

网友

2楼 · 编辑于 2024-10-04 09:20:53

这个问题没有明确的答案，一切都取决于标题的可变性

让我们想象一下：顺序和复数是唯一的变化。您可以map为列名添加一个清理函数，并对列进行排序：

def clean_name(s):
    # make lowercase
    s = s.lower()
    # remove trailing 's'
    s = s.rstrip('s')
    return s

df.columns = df.columns.map(clean_name)
df = df.sort_index(axis=1)

输入示例：

  PostCode brands
0     abde   exp1

输出：

  brand postcode
0  exp1     abde

网友

3楼 · 编辑于 2024-10-04 09:20:53

您可以按如下方式标准化数据帧的列名：

>>> df.rename(columns={c: "PostCode" if "postcode" in c.lower() else "Brand" for c in df.columns})

相关问题更多 >

编程相关推荐

热门问题

热门文章

重命名随时间变化的列名称

相关问题 更多 >

编程相关推荐

热门问题

热门文章

相关问题更多 >