擅长:python、mysql、java
<p>谢谢大家,你们的建议对我很有帮助,我修改了代码如下:</p>
<pre><code>SubPatent2count = {}
for line in data.split('\n'):
Mapper_data = ["%s\o%d" % (line.split(' ')[0], 1) ]
for line in Mapper_data:
Sub_Patent,count = line.strip().split('\o',1)
try:
count = int(count)
except ValueError:
continue
try:
SubPatent2count[Sub_Patent] = SubPatent2count[Sub_Patent]+count
except:
SubPatent2count[Sub_Patent] = count
for Sub_Patent in SubPatent2count.keys():
print ('%s\t%s'% ( Sub_Patent, SubPatent2count[Sub_Patent] ))
</code></pre>
<p>结果如下:</p>
<pre><code>1 2 (1 occurs 2 times)
2 4
3 2
4 3
</code></pre>