循环Pandas目录

2条回答

网友

1楼 · 编辑于 2024-06-17 04:40:49

不是想偷答案。如果我有足够的代表，我会把这个放在@Asif Ali的回答下面的评论里

假设所有输入.csv文件都遵循以下格式： “修改了\u文件名}.csv的\u{rest\u”

您希望输出为： “sum{same\u rest\u of the \u file\u name}.csv”

import os
import glob

path = "./your/path"
files = glob.glob(os.path.join(path, "*.csv"))

for file in files:
    df = pd.read_csv(file)
    df_new = df.groupby('miRNA')['read_count'].sum()
    print(df_new)
    df_new.to_csv(file.split('modified')[:-1] + \
                  'sum' + \
                  '_'.join(file.split('modified')[-1:]))

网友

2楼 · 编辑于 2024-06-17 04:40:49

尝试查看glob模块

from glob import glob
import os

path = "./your/path"
files = glob(os.path.join(path, "*.csv"))

dataframes = []
for file in files:
    df = pd.read_csv(file)
    # rest you would want to append these to dataframes
    dataframes.append(df)

然后，使用pd.concat连接数据帧并执行groupby操作

编辑1: 根据评论中提到的要求：

results = {}
for file in files:
    df = pd.read_csv(file)
    # perform operation
    df_new = df.groupby('miRNA')['read_count'].sum()
    results[file] = df_new

相关问题更多 >

编程相关推荐

热门问题

热门文章

循环Pandas目录

相关问题 更多 >

编程相关推荐

热门问题

热门文章

相关问题更多 >