使用pandas和字典的Python查找和替换工具

mycolumns = ["Col1", "Col2"] mydictionary = {'A': 'B', 'B': 'C', 'C': 'D'} for x in mycolumns: # 1. If the mycolumn value exists in the headerlist of the file if x in headerlist: # 2. Get column coordinate col = df.columns.get_loc(x) + 1 # 3. iterate through the rows underneath that header for ind in df.index: # 4. log the row coordinate rangerow = ind + 2 # 5. get the original value of that coordinate oldval = df[x][ind] for count, y in enumerate(oldval): # 6. generate replacement value newval = df.replace({y: mydictionary}, inplace=True, regex=True, value=None) print("old: " + str(oldval) + " new: " + str(newval)) # 7. update the cell ws.cell(row=rangerow, column=col).value = newval else: print("not in the string") else: # print(df) print("column doesn't exist in workbook, moving on") else: print("done") wb.save(filepath) wb.close()

1条回答

网友

1楼 · 发布于 2024-09-28 21:54:52

newvalue never creates and I don't know why.

带有inplace=True的DataFrame.replace将返回无

>>> df = pd.DataFrame({'Code1': ['ABC1', 'B5CD', 'C3DE']})
>>> df = df.replace('ABC1','999')
>>> df
  Code1
0   999
1  B5CD
2  C3DE
>>> q = df.replace('999','zzz', inplace=True) 
>>> print(q)
None
>>> df
  Code1
0   zzz
1  B5CD
2  C3DE
>>>

另一种方法是b在列上使用str.translate（使用其str attribute）对整个序列进行编码

>>> df = pd.DataFrame({'Code1': ['ABC1', 'B5CD', 'C3DE']})
>>> mydictionary = {'A': 'B', 'B': 'C', 'C': 'D'}
>>> table = str.maketrans('ABC','BCD')
>>> df
  Code1
0  ABC1
1  B5CD
2  C3DE
>>> df.Code1.str.translate(table)
0    BCD1
1    C5DD
2    D3DE
Name: Code1, dtype: object
>>>

相关问题更多 >

编程相关推荐

热门问题

热门文章