设定列表的值为数据帧列表

DF1_LIST2: row1 row2 row3 row4 5 55 12 3 51 11 3 52 11 9 59 11 DF2_LIST2: row1 row2 row3 row4 9 91 7 5 1 23 3 24 56 9 68 21

DF1_LIST2: row1 row2 row3 row4 jan18 5 55 12 jan18 3 51 11 jan18 3 52 11 jan18 9 59 11 DF2_LIST2: row1 row2 row3 row4 feb18 9 91 7 feb18 5 1 23 feb18 3 24 56 feb18 9 68 21

import pandas as pd import os from os import listdir from os.path import isfile, join import glob # Get File Names mypath = "//DGMS/Desktop/uploaded" onlyfiles = [f for f in listdir(mypath) if isfile(join(mypath, f))] # Get dates onlyfiles = [name.split("_")[0] for name in onlyfiles] df_of_names = pd.DataFrame(onlyfiles) # Get File Contents all_files = glob.glob(os.path.join(mypath, "*.xls*")) contentdataframes = [pd.read_excel(f) for f in all_files] for dfs in contentdataframes: dfs.insert(0,"date*","") dfs.insert(1,"apply*","") for date in onlyfiles: for dfs in contentdataframes: for row in dfs.itertuples(index=True): dfs.set_value(row,0,date)

2条回答

网友

1楼 · 编辑于 2024-06-30 17:05:13

您可以通过^{}从完整路径中提取文件名。然后用^{}换行：

import os

def extract_name(x):
    return os.path.splitext(fp)[0].split('_')[0]

dfs = [pd.read_excel(fp).assign(row1=extract_name(fp)) for fp in all_files]

网友

2楼 · 编辑于 2024-06-30 17:05:13

使用^{}在每个DataFrame中添加新列：

d = [pd.read_excel(f).assign(row1=os.path.basename(f).split('.')[0].split('_')[0])
     for f in all_files]

编辑：

如果希望处理列并且.assign处理多个列的可读性较差，可以使用loop处理每个DataFrame并最后附加到list：

contentdataframes = []
for f in all_files:
    df = pd.read_excel(f)
    df['col1'] = 10
    df['col2'] = 'string1'
    df['row1'] = os.path.basename(f).split('.')[0].split('_')[0]
    contentdataframes.append(df)

相关问题更多 >

编程相关推荐

热门问题

热门文章