擅长:python、mysql、java
<p>使用<code>str.split()</code>拆分行</p>
<p>使用<code>collections.defaultdict(list)</code>来跟踪重复</p>
<p>例如</p>
<pre><code>>>> lines = """PAIR 1MFK 1 URANIUM 82 HELIUM 112 3.6997
... PAIR 2JGH 2 PLUTONIUM 98 POTASSIUM 88 5.3003
... PAIR 345G 3 SODIUM 23 CARBON 14 1.664
... PAIR 4IG5 4 LITHIUM 82 ARGON 99 2.5506 """.split("\n")
>>>
>>> from collections import defaultdict
>>> D = defaultdict(list)
>>> for line in map(str.split, lines):
... k = line[4]
... D[k].append(line)
...
>>> [v for k,v in D.items() if len(v) > 1]
[[['PAIR', '1MFK', '1', 'URANIUM', '82', 'HELIUM', '112', '3.6997'], ['PAIR', '4IG5', '4', 'LITHIUM', '82', 'ARGON', '99', '2.5506']]]
>>> [(k,v) for k,v in D.items() if len(v) > 1]
[('82', [['PAIR', '1MFK', '1', 'URANIUM', '82', 'HELIUM', '112', '3.6997'], ['PAIR', '4IG5', '4', 'LITHIUM', '82', 'ARGON', '99', '2.5506']])]
</code></pre>