如何在snakemake python代码中访问重试尝试？

def getTargetFiles(files, attempted): do stuff return modified-target-files rule do_things_rule: input: ... output: getTargetFiles("file/path.txt", resources.attempt) resources: attempt=lambda wildcards, attempt: attempt,

1条回答

网友

1楼 · 发布于 2024-10-02 22:36:35

这是一个最小的工作示例，我可以用它重现您的错误

def getTargetFiles(files, attempted):
  return f"{files[:-4]}-{attempted}.txt"

rule do_things_rule:
  resources:
    nr = lambda wildcards, attempt: attempt
  output:
    getTargetFiles("test.txt", resources.nr)
  shell:
    'echo "Failing on purpose to produce file'
    '{output} at attempt {resources.nr}'
    '"; exit 1 '

事实上，output并不知道resources。我想这是因为需要在规则运行之前访问（请参见下文）。相反，如果你将getTargetFiles("test.txt", resources.nr)替换为getTargetFiles("test.txt", 1)，则规则将运行正确数量的并且shell命令可以访问resources.nr

据我所知，这个问题有一个根本原因

snakemake工作流是“根据定义如何从输入文件创建输出文件。规则之间的依赖关系为自动确定”。（引用自Tutorial）这意味着snakemake需要知道该规则将创建哪个输出文件。然后，它将确定是否需要运行该规则。因此, 尝试至少通常不应该是输出文件名的一部分

也许你想合并失败尝试的不同文件？但是，如果规则失败，则将没有输出文件。即使你强迫它。该文件将被snakemake删除。（见下面的示例）

def getTargetFiles(files, attempted):
  return f"{files[:-4]}-{attempted}.txt"

rule combine:
  input:
    'test-1.txt'
  output:
    'test-combined.txt'
  shell:
    'cat test-[0-9]*.txt > test-combined.txt'

rule do_things_rule:
  resources:
    nr = lambda wildcards, attempt: attempt
  output:
    getTargetFiles("test.txt", 1)
  shell:
    'touch {output}; '
    'echo "Failing on purpose to produce file'
    '{output} at attempt {resources.nr}'
    '"; exit 1 '

将尝试次数保留在文件名之外，而在shell命令中使用resources.nr如何

希望这能解决您的问题

相关问题更多 >

编程相关推荐

热门问题

热门文章