擅长:python、mysql、java
<p>我喜欢@Chris_Rands的<a href="https://stackoverflow.com/a/49052433/778533">Biopython solution</a>对于小文件更好,但是这里有一个解决方案,它只使用Python附带的电池,并且具有内存效率。它假设fasta和fastq文件以相同的顺序包含相同数量的读取。在</p>
<pre><code>with open('Input.fasta') as fasta, open('Input.fastq') as fastq, open('DesiredOutput.fastq', 'w') as fo:
for i, line in enumerate(fastq):
if i % 4 == 1:
for j in range(2):
line = fasta.readline()
print(line, end='', file=fo)
</code></pre>