附加到对齐器的多序列比对

1条回答

网友

1楼 · 发布于 2024-10-01 19:29:39

如果您的主要问题是重新运行对齐所需的时间（重新计算PWI矩阵在计算上应该很便宜），那么MUSCLE有能力做您想要的，一个通常称为"profile-profile alignment"的过程。在

Profile-profile对齐

当传递-profile标志时，对齐将“彼此重新对齐，保持输入列的完整性，并在需要的地方插入空白列。”：

If you have two existing alignments of related sequences you can use the –profile option of MUSCLE to align those two sequences. Typical usage is:
   muscle -profile -in1 one.afa -in2 two.afa -out both.afa

在生物圈中实施

Biopython有一个wrapper around MUSCLE，但我发现使用subprocess调用MUSCLE，然后将结果解析回一个^{}：

# Do profile-profile alignment (one sequence to many aligned)
seq_fn = "influenza_seq.fasta"
aligned_fn = "520_influenza_seqs.afasta"
cmd = ['muscle', '-clwstrict', '-profile', '-in1', seq_fn, '-in2', aligned_fn]
aligner = subprocess.Popen(cmd, stdout=subprocess.PIPE, stderr=subprocess.PIPE)
stdout, stderr = aligner.communicate()

# Get resulting alignment (MultipleSeqAlignment)
alignment =  AlignIO.read(StringIO(stdout), "clustal",
                          alphabet=Alphabet.ProteinAlphabet())

Profile-profile对齐

在生物圈中实施

相关问题更多 >

编程相关推荐

热门问题

热门文章

附加到对齐器的多序列比对

Profile-profile对齐

在生物圈中实施

相关问题 更多 >

编程相关推荐

热门问题

热门文章

相关问题更多 >