使用检查点时将目录作为通配符

checkpoint secondstep: input: '{run}/firststep_done.txt', output: DIR = directory('{run}/secondstep') shell: 'mkdir -p {output.DIR} ;' 'mkdir -p {wildcards.run}/secondstep/projectA ;' 'touch {wildcards.run}/secondstep/projectA/file_arbitrary.1 ;' 'touch {wildcards.run}/secondstep/projectA/file_arbitrary.2 ;' 'mkdir -p {wildcards.run}/secondstep/projectB ;' 'touch {wildcards.run}/secondstep/projectB/file_arbitrary.1 ;' 'touch {wildcards.run}/secondstep/projectB/file_arbitrary.2 ;'

def resolve_project(wildcards): checkpoint_output=checkpoints.secondstep.get(**wildcards).output[0] return expand('{run}/report/{project}/arbitrary.all', run=wildcards.run, project=glob_wildcards(os.path.join(checkpoint_output, "{project}")).project)

runs = ['run1', 'run2'] rule all: input: expand('{run}/report/{run}_done', run = runs) rule firststep: output: '{run}/firststep_done.txt' shell: 'touch {output} ;' checkpoint secondstep: input: '{run}/firststep_done.txt', output: DIR = directory('{run}/secondstep') shell: 'mkdir -p {output.DIR} ;' 'mkdir -p {wildcards.run}/secondstep/projectA ;' 'touch {wildcards.run}/secondstep/projectA/file_arbitrary.1 ;' 'touch {wildcards.run}/secondstep/projectA/file_arbitrary.2 ;' 'mkdir -p {wildcards.run}/secondstep/projectB ;' 'touch {wildcards.run}/secondstep/projectB/file_arbitrary.1 ;' 'touch {wildcards.run}/secondstep/projectB/file_arbitrary.2 ;' rule intermediate: input: directory('{run}/secondstep/{project}') output: '{run}/report/{project}/arbitrary.all' shell: 'echo "blabla" > {output}' def resolve_project(wildcards): checkpoint_output=checkpoints.secondstep.get(**wildcards).output[0] return expand('{run}/report/{project}/arbitrary.all', run=wildcards.run, project=glob_wildcards(os.path.join(checkpoint_output, "{project}")).project) rule aggregate: input: resolve_project output: '{run}/report/{run}_done' shell: 'cat {input} > {output}'

1条回答

网友

1楼 · 发布于 2024-10-02 12:29:03

你需要的是wildcard_constraints:https://snakemake.readthedocs.io/en/stable/tutorial/additional_features.html#constraining-wildcards

这允许您定义一个正则表达式，将通配符限制为使用正则表达式定义的内容。例如：

wildcard_constraints:
    project="[^/]+"

定义约束有几种方法：全局、规则或内联。下面是一个内联约束的示例：output: '{run}/report/{project,[^/]+}/arbitrary.all'

相关问题更多 >

编程相关推荐

热门问题

热门文章