递归导航文件系统以成对分析文件问题的回答

递归导航文件系统以成对分析文件

回答此问题可获得 20 贡献值，回答如果被采纳可获得 50 分。

0 条评论
分类：Python问答

默认排序时间排序

1 个回答

匿名 1天前

　擅长：python、mysql、java

（回答编辑后的问题。） 在shell中实现这一点比较困难（可读性较差），因此我求助于Python： <pre><code>#!/usr/bin/env python3 import os import re import pprint from sets import Set from subprocess import call group1 = {} # collect here the filenames for _1 group2 = {} # collect here the filenames for _2 for root, directories, filenames in os.walk('.'): for filename in filenames: ff = os.path.join(root,filename) if filename.endswith("_1.txt"): base = re.sub('_1\.txt$','', ff) group1[base] = ff if filename.endswith("_2.txt"): base = re.sub('_2\.txt$','', ff) group2[base] = ff #pprint.pprint(group1) #pprint.pprint(group2) # find common ones: the dirs which contain the files with the common prefix: list1 = Set(group1.keys()).intersection(Set(group2.keys())) #pprint.pprint(list1) # call the myscript.py cwd = os.getcwd() for base in list1: path, filename = os.path.split(base) #print path," ",filename try: os.chdir(path) call(['echo', 'myscript.py', filename+"_1.txt", filename+"_2.txt", "outputfile"]) finally: os.chdir(cwd) </code></pre> （为糟糕的Python风格感到抱歉：我实际上是一个Perl程序员。） <hr/> <blockquote> Most recursive solutions I have seen so far use either find or grep for each individual file however I need the location as well, to get them in pairs and write to disk at the appropriate place. </blockquote> 不要迭代文件-遍历目录。shell中的示例： ^{pr2}$ 或者，您仍然可以迭代文件，让<code>find</code>为我们检查其中一个文件。然后从找到的文件名中提取目录： <pre><code>find -type f -name xyz_1.gz -print | while read FN; do DIR=`dirname $FN` test -r $DIR/xyz_2.gz -a -r $DIR/some_other_file || continue ( cd $DIR; myscript.py xyz_1.gz xyz_2.gz outputfile ) done </code></pre> 此外，您还可以将开头的<code>cd $DIR</code>（<code>os.chdir()</code>）；将目录作为参数或env var传递到Python脚本本身，并检查输入文件（例如，如果文件不存在，则自动退出）。在

递归导航文件系统以成对分析文件

1 个回答

相关Python问题