通过pandas datafram将换行替换为str列的空格

>>> import pandas as pd >>> lol = [[1,2,'abc','foo\nbar'], [3,1, 'def\nhaha', 'love it\n']] >>> pd.DataFrame(lol) 0 1 2 3 0 1 2 abc foo\nbar 1 3 1 def\nhaha love it\n

>>> import pandas as pd >>> lol = [[1,2,'abc','foo\nbar'], [3,1, 'def\nhaha', 'love it\n']] >>> replace_and_strip = lambda x: x.replace('\n', ' ').strip() >>> lol2 = [[replace_and_strip(col) if type(col) == str else col for col in list(row)] for idx, row in pd.DataFrame(lol).iterrows()] >>> pd.DataFrame(lol2) 0 1 2 3 0 1 2 abc foo bar 1 3 1 def haha love it

3条回答

网友
                    
                    

                    

                    1楼 ·

                    
                        编辑于 2024-10-01 13:31:51

您可以使用以下两种正则表达式替换方法：
>>> df.replace({ r'\A\s+|\s+\Z': '', '\n' : ' '}, regex=True, inplace=True)
>>> df
   0  1         2        3
0  1  2       abc  foo bar
1  3  1  def haha  love it
>>> 
详细信息
'\A\s+|\s+\Z'->；''的作用类似于strip()删除所有前导和尾随空格：
\A\s+-匹配字符串开头的一个或多个空白符号
|-或
\s+\Z-匹配字符串末尾的一个或多个空白符号
'\n'->；' '将用空格替换任何换行符。在

网友
                    
                    

                    

                    2楼 ·

                    
                        编辑于 2024-10-01 13:31:51

使用^{}-第一个和最后一个条带，然后替换\n：
df = df.replace({r'\s+$': '', r'^\s+': ''}, regex=True).replace(r'\n',  ' ', regex=True)
print (df)
   0  1         2        3
0  1  2       abc  foo bar
1  3  1  def haha  love it

网友
                    
                    

                    

                    3楼 ·

                    
                        编辑于 2024-10-01 13:31:51

您可以select_dtypes来选择object类型的列，并在这些列上使用applymap。在

因为这些函数没有inplace参数，所以这是对数据帧进行更改的解决方法：

strs = lol.select_dtypes(include=['object']).applymap(lambda x: x.replace('\n', ' ').strip())
lol[strs.columns] = strs
lol
#   0  1         2        3
#0  1  2       abc  foo bar
#1  3  1  def haha  love it

`相关问题更多 >`

`编程相关推荐`

`热门问题`

`热门文章`