<p>这应该完全符合您的要求:</p>
<pre><code>infile = open('out3.txt', 'r')
outfile = open('out5.txt', 'w')
for line in infile:
line = (line[line.find('[') + 1:])[:-2] + "\n"
outfile.write(line)
infile.close()
outfile.close()
</code></pre>
<p><strong>out3.txt</strong></p>
<pre><code>RAH71880.1 phenol monooxygenase [Aspergillus aculeatinus CBS 121060]
PVV21043.1 phenol 2-monooxygenase [gamma proteobacterium symbiont of Ctena orbiculata]
PVV21041.1 phenol hydroxylase [gamma proteobacterium symbiont of Ctena orbiculata]
PYH66749.1 phenol monooxygenase [Aspergillus vadensis CBS 113365]
PYH31415.1 phenol monooxygenase [Aspergillus neoniger CBS 115656]
PUB86175.1 phenol 2-monooxygenase [gamma proteobacterium symbiont of Ctena orbiculata]
PUB86141.1 phenol 2-monooxygenase [gamma proteobacterium symbiont of Ctena orbiculata]
PUB86139.1 phenol hydroxylase [gamma proteobacterium symbiont of Ctena orbiculata]
PUB79626.1 phenol hydroxylase [gamma proteobacterium symbiont of Ctena orbiculata]
PUB79624.1 phenol 2-monooxygenase [gamma proteobacterium symbiont of Ctena orbiculata]
PUB72973.1 phenol 2-monooxygenase [gamma proteobacterium symbiont of Ctena orbiculata]
PUB72971.1 phenol hydroxylase [gamma proteobacterium symbiont of Ctena orbiculata]
PWY90296.1 phenol monooxygenase [Aspergillus sclerotioniger CBS 115572]
PWY63616.1 phenol monooxygenase [Aspergillus eucalypticola CBS 122712]
</code></pre>
<p><strong>out5.txt</strong></p>
<pre><code>Aspergillus aculeatinus CBS 121060
gamma proteobacterium symbiont of Ctena orbiculata
gamma proteobacterium symbiont of Ctena orbiculata
Aspergillus vadensis CBS 113365
Aspergillus neoniger CBS 115656
gamma proteobacterium symbiont of Ctena orbiculata
gamma proteobacterium symbiont of Ctena orbiculata
gamma proteobacterium symbiont of Ctena orbiculata
gamma proteobacterium symbiont of Ctena orbiculata
gamma proteobacterium symbiont of Ctena orbiculata
gamma proteobacterium symbiont of Ctena orbiculata
gamma proteobacterium symbiont of Ctena orbiculata
Aspergillus sclerotioniger CBS 115572
Aspergillus eucalypticola CBS 122712
</code></pre>
<hr/>
<p><strong>编辑</p>
<p>如果您只想打印出唯一的行,可以这样更新源代码:</p>
<pre><code>infile = open('out3.txt', 'r')
outfile = open('out5.txt', 'w')
unique = []
for line in infile:
line = (line[line.find('[') + 1:])[:-2] + "\n"
if line not in unique:
unique.append(line)
outfile.write(line)
infile.close()
outfile.close()
</code></pre>
<p>然后您将得到如下输出(out5.txt):</p>
<pre><code>Aspergillus aculeatinus CBS 121060
gamma proteobacterium symbiont of Ctena orbiculata
Aspergillus vadensis CBS 113365
Aspergillus neoniger CBS 115656
Aspergillus sclerotioniger CBS 115572
Aspergillus eucalypticola CBS 122712
</code></pre>