对于循环，通过具有相同标头的目录中的csv文件进行解析，并在每个循环文件后打印新列

import os import pandas as pd #RUN THIS AFTER MAKING ALL THE CSV FILEs dfmaster = pd.DataFrame() directory = "/content/drive/My Drive/" for filename in os.listdir(directory): fullpath = os.path.join(directory, filename) if os.path.isfile(fullpath) and fullpath.endswith(".csv"): dfchild = pd.read_csv(fullpath) select_cols = ['var1', 'var2'] #define columns you want to explort dfmaster = dfchild[select_cols] #####Problem here, After each file print to two new columns I don't know what to input here print(dfmaster.reset_index(drop=True)) dfmaster.to_csv("/content/drive/My Drive/Subsurface_A.csv", index=False)

1条回答

网友

1楼 · 发布于 2024-09-24 06:20:42

您可以在循环外部创建一个空df，然后沿每个循环的列连续连接：

#empty df
finaldf = pd.Dataframe()
for filename in os.listdir(directory):
    fullpath = os.path.join(directory, filename)
    if os.path.isfile(fullpath) and fullpath.endswith(".csv"):
        dfchild = pd.read_csv(fullpath)
        select_cols = ['obs', 'temperature']
        #define columns you want to explort
        dfmaster = dfchild[select_cols]

        #new line
        finaldf = pd.concat([finaldf, dfmaster],axis = 1)


    print(finaldf.reset_index(drop=True))
    finaldf.to_csv("/content/drive/My Drive/Subsurface_A.csv", index=False)

相关问题更多 >

编程相关推荐

热门问题

热门文章