在应用另一个函数时更改列值

oci,citing,cited,creation,timespan,journal_sc,author_sc 0200100000236252421370109080537010700020300040001-020010000073609070863016304060103630305070563074902,"10.1002/pol.1985.170230401","10.1007/978-1-4613-3575-7_2",1985-04,P2Y,no,no ...

def do_process_citation_data(f_path): global my_ocan my_ocan = pd.read_csv("citations.csv", names=['oci', 'citing', 'cited', 'creation', 'timespan', 'journal_sc', 'author_sc'], parse_dates=['creation', 'timespan']) my_ocan = my_ocan.iloc[1:] # to remove the first row iloc - to select data by row numbers my_ocan['creation'] = pd.to_datetime(my_ocan['creation'], format="%Y-%m-%d", yearfirst=True) return my_ocan def parse(): mydict = dict() mydict2 = dict() i = 1 r = 1 for x in my_ocan['oci']: mydict[x] = str(my_ocan['timespan'][i]) i +=1 print(mydict) for key, value in mydict.items(): is_negative = value.startswith('-') if is_negative: date_info = re.findall(r"P(?:(\d+)Y)?(?:(\d+)M)?(?:(\d+)D)?$", value[1:]) else: date_info = re.findall(r"P(?:(\d+)Y)?(?:(\d+)M)?(?:(\d+)D)?$", value) year, month, day = [int(num) if num else 0 for num in date_info[0]] if date_info else [0,0,0] daystotal = (year * 365) + (month * 30) + day if not is_negative: #mydict2[key] = daystotal return daystotal else: #mydict2[key] = -daystotal return -daystotal #print(mydict2) #return mydict2

1条回答

网友

1楼 · 发布于 2024-10-01 02:36:19

一个df['timespan'].apply(parse)（如@Dan所提到的）应该是有效的。您只需修改parse函数，即可将字符串作为参数接收，并在末尾返回解析后的字符串。大概是这样的：

import pandas as pd

def parse_postal_code(postal_code):
    # Splitting postal code and getting first letters
    letters = postal_code.split('_')[0]
    return letters


# Example dataframe with three columns and three rows
df = pd.DataFrame({'Age': [20, 21, 22], 'Name': ['John', 'Joe', 'Carla'], 'Postal Code': ['FF_222', 'AA_555', 'BB_111']})

# This returns a new pd.Series
print(df['Postal Code'].apply(parse_postal_code))

# Can also be assigned to another column
df['Postal Code Letter'] = df['Postal Code'].apply(parse_postal_code)

print(df['Postal Code Letter'])

相关问题更多 >

编程相关推荐

热门问题

热门文章