擅长:python、mysql、java
<p>您可以使用regex。在</p>
<pre><code>with open(file) as f:
start = []
genes = []
for line in f:
st, gen = re.search(r'\bexon\s+(\d+)\b.*?\s+gene_id\s+"([^"]*)"', line).groups()
start.append(st)
genes.append(gen)
print set(start)
print set(genes)
</code></pre>
<p><a href="https://regex101.com/r/gP1tP2/1" rel="nofollow">DEMO</a></p>