<p>因为我没有学生物学,所以我仍然对你的问题感到困惑</p>
<p>这个答案纯粹是为了<code>souce_text == expected_text</code></p>
<pre><code>from io import StringIO
from Bio import SeqIO # pip install biopython # https://biopython.org/wiki/Download
import re
source_text = """\
>lcl|NC_045512.2_gene_6 [gene=ORF6] [locus_tag=GU280_gp06][db_xref=GeneID:43740572] [location=27202..27387] [gbkey=Gene]
ATGTTTCATCTCGTTGACTTTCAGGTTACTATAGCAGAGATATTACTAATTATTATGAGGACTTTTAAAG
"""
expected_text = """\
ORF6
ATGTTTCATCTCGTTGACTTTCAGGTTACTATAGCAGAGATATTACTAATTATTATGAGGACTTTTAAAG
"""
regex = re.compile("\[gene=[\w]*\] ") # \w: [a-zA-Z0-9_]
result = ''
for record in SeqIO.parse(StringIO(source_text), 'fasta'):
# print(record.name)
gene_name = regex.search(record.description).group() # [ORF6]
gene_name = gene_name[gene_name.find('=')+1: -2] # ORF6
print(gene_name)
print(record.seq)
result += gene_name + '\n' + record.seq + '\n'
if result == expected_text:
print('ok')
</code></pre>
<pre><code>ORF6
ATGTTTCATCTCGTTGACTTTCAGGTTACTATAGCAGAGATATTACTAATTATTATGAGGACTTTTAAAG
ok
</code></pre>
<h2>参考文献</h2>
<p>以下是不熟悉<code>biopython</code>的人的参考资料</p>
<ul>
<li>什么是<a href="http://biopython.org/DIST/docs/api/Bio.SeqIO-module.html" rel="nofollow noreferrer">SeqIO.parse</a></li>
<li>什么是<a href="https://biopython.org/wiki/SeqIO" rel="nofollow noreferrer">fasta</a></li>
</ul>
<hr/>
<ul>
<li>更多<a href="https://www.ncbi.nlm.nih.gov/sviewer/viewer.fcgi?db=nuccore&val=973795115&page_size=5&fmt_mask=0&report=gene_fasta&retmode=file" rel="nofollow noreferrer">test data</a></li>
</ul>