使用计算值创建字典

textNP = 'stopped traffic bklyn bqe 278 wb manhattan brtillary stx29 wb cadman pla hope oufootball makes safe manhattan kansas tomorrow boomersooner beatwildcats theyhateuscuztheyaintus hatersgonnahate rt bringonthecats bring cats exclusive live footage oklahoma trying get manhattan http colktsoyzvvz rt jonfmorse bring cats exclusive live footage oklahoma trying get manhattan'

txtU = set(textNP) lntxt = len(textNP) lntxtS = len(txtU) matrixNP = {} for b1, i1 in txtU: for b2, i2 in txtU: if i1< i2: bb1 = b1+b2 bb2 = b2+b1 freq = 0 for k in textNP: for j in textNP: if k < j: kj = k+j if kj == bb1 | kj == bb2: freq +=1 matrixNP[i1][i2] = freq matrixNP[i2][i1] = freq elif i1 == i2: matrixNP[i1][i1] = 1

1条回答

网友

1楼 · 发布于 2024-09-28 05:24:31

您是否正在查找2个单词的所有组合，如果是这样，您可以使用itertools.combinations和collections.Counter来执行您想要的操作：

>>> from itertools import combinations
>>> from collections import Counter
>>> N = 5
>>> c = Counter(tuple(sorted(a)) for a in combinations(textNP.split(), 2))
>>> c.most_common(N)
[(('manhattan', 'rt'), 8),
 (('exclusive', 'manhattan'), 8),
 (('footage', 'manhattan'), 8),
 (('manhattan', 'oklahoma'), 8),
 (('bring', 'manhattan'), 8)]

或者，如果要查找所有成对的连续单词，则可以创建成对函数：

>>> from itertools import tee
>>> from collections import Counter
>>> def pairwise(iterable):
...     a, b = tee(iterable)
...     next(b, None)
...     return zip(a, b)    # itertools.izip() in python2
>>> N = 5
>>> c = Counter(tuple(sorted(a)) for a in pairwise(textNP.split()))
>>> c.most_common(N)
[(('get', 'manhattan'), 2),
 (('footage', 'live'), 2),
 (('get', 'trying'), 2),
 (('bring', 'cats'), 2),
 (('exclusive', 'live'), 2)]

我也不认为骑自行车在名单上。你知道吗

相关问题更多 >

编程相关推荐

热门问题

热门文章