在.apply（）for dataframes中使用for循环

remove_l = [r'[A-Za-z]+(?:\/)', r'Today, ', '-'] def remove(x): for phrase in remove_l: if re.search(phrase, x): if phrase == '-': new = x.replace(phrase, ' ') else: new = x[re.search(phrase, x).span()[1]:].strip() return new else: return x #check up on items #60, 330, 347, 411, 647 #idx = nocountries_df[nocountries_df.Name.str.contains('\/')].Name.index nocountries_df.Name.apply(lambda x: remove(x))

1条回答

网友

1楼 · 发布于 2024-09-28 21:58:01

这是一个缩进问题，当它到达第一个返回值（在for循环中）时，它返回该值：

def remove(x):
    for phrase in remove_l:
        if re.search(phrase, x):
            if phrase == '-':
                new = x.replace(phrase, ' ')
            else: 
                new = x[re.search(phrase, x).span()[1]:].strip()
            return new  # <- returns here (in first phase) 
        else: 
            return x  # <- or returns here (in first phase)

如果要在for循环之后返回，那么在for循环中更改x可能是最简单的方法：

def remove(x):
    for phrase in remove_l:
        if re.search(phrase, x):
            if phrase == '-':
                x = x.replace(phrase, ' ')
            else: 
                x = x[re.search(phrase, x).span()[1]:].strip()
    return x

相关问题更多 >

编程相关推荐

热门问题

热门文章