Pandas系列部分替换

网友

1楼 · 编辑于 2024-05-19 09:33:27

您可以使用numpy.where并在替换和降低字符串后检查字典键中是否存在值。如果是，我们想替换它，如果不是-我们不做任何更改：

import numpy as np

val = df["state"].str.replace(' ','').str.lower()

df['state'] = np.where(val.isin(STATE_MAP_DICT.keys()),
                       val.replace(STATE_MAP_DICT),
                       df['state']
                       )

输出：

                state
0       Uttar Pradesh
1   Jammu and Kashmir
2   Jammu and Kashmir
3         pondicherry

网友

2楼 · 编辑于 2024-05-19 09:33:27

您可以使用map方法和fillna

df["state"] = df["state"].astype(str).str.replace(' ','').str.lower().map(STATE_MAP_DICT).fillna(df["state"])

输出：

    state
0   Uttar Pradesh
1   Jammu and Kashmir
2   Jammu and Kashmir
3   pondicherry

网友

3楼 · 编辑于 2024-05-19 09:33:27

只需将以下条目添加到词典中：

{
  'pondicherry': 'Pondicherry',
  'uttarpradesh' 'Uttar Pradesh'
}

或者，使用自定义函数代替str：

def f(s):
  s2= s.replace(' ', '').lower()
  if s2 in STATE_MAP_DICT:
    return STATE_MAP_DICT[s2]
  return s

df['state'] = df["state"].apply(f)

相关问题更多 >

编程相关推荐

热门问题

热门文章

Pandas系列部分替换

相关问题 更多 >

编程相关推荐

热门问题

热门文章

相关问题更多 >