<p>如果你用</p>
<pre><code> with open('file.txt', 'r') as f:
a = f.read()
a = a.split('\n')
</code></pre>
<p>将提供以下输出:</p>
<pre><code>['Degardome Category: 4',
'Degradome p-value: 0.00120246641531374',
'T-Plot file: T-plots IGR/Seq_5744_249_Supercontig_2.10_1257006_264_TPlot.pdf',
'',
'Position Reads Category',
'264 1 4 <<<<<<<<<<',
'914 1 4',
'987 4 0',
' -',
' -',
'',
"5' UUGGAGGUGGCUGGACGGAUG 3' Transcript: Supercontig_2.10_1395094:908-928 Slice Site:919",
' ||||o||||oo|o|',
"3' AUUAAUAACCGGCCUGUUUGC 5' Query: Seq_1950_218",
'HV2.fasta_dd.txt',
'Degardome Category: 4',
'Degradome p-value: 0.00360306320817827',
'T-Plot file: T-plots-IGR/Seq_1950_218_Supercontig_2.10_1395094_919_TPlot.pdf',
'',
'Position Reads Category',
'919 1 4 <<<<<<<<<<',
' -',
' -',
'',
"5' AGAAGGGGAAGAGUGGAGGAGAG 3' Transcript: Supercontig_2.10_1543625:626-648 Slice Site:637",
' |||o|oo||||o| o||o||',
"3' ACUUUCUUUUCUUAA UCUUUC 5' Query: Seq_2171_593"]
</code></pre>
<p>现在初始化一个空字符串并连接所有相关部分:</p>
<pre><code>In [4]: t = ''
In [5]: for line in a:
...: if 'Degardome Category:' in line:
...: t += line + ' '
...: if 'Degradome p-value:' in line:
...: t += line + ' '
...: if 'Query' in line:
...: t += line.replace('Query:', '') + '\n'
</code></pre>
<p>最后,根据新行拆分字符串:</p>
<pre><code>In [6]: out = [i for i in t.split('\n') if i]
In [7]: out
Out[7]:
["Degardome Category: 4 Degradome p-value: 0.00120246641531374 3'
AUUAAUAACCGGCCUGUUUGC 5' Seq_1950_218",
"Degardome Category: 4 Degradome p-value: 0.00360306320817827 3'
ACUUUCUUUUCUUAA UCUUUC 5' Seq_2171_593"]
</code></pre>