查找两个列表之间的相似性。根据列表中值的位置分别给出权重。

a=["are","you","are","you","why"] b=['you',"are","you",'are',"why"] li=[] va=[] fi=[] weightOfStatic=1/len(a) for i in range(len(a)): if a[i]==b[i]: print("true1", weightOfStatic,a[i],b[i]) fi.append({"static":i, "dynamic":i,"Weight":weightOfStatic}) li.append([weightOfStatic,a[i],b[i]]) va.append(li) else: for j in range(len(b)): if a[i]==b[j]: weightOfDynamic = weightOfStatic*(1-(1/len(b))*abs(i-j)) fi.append({"static":i, "dynamic":j,"Weight":weightOfDynamic}) print("true2 and index diiference between words =%d"% abs(i-j),weightOfDynamic, i,j) li.append([weightOfDynamic,a[i],b[j]]) va.append(weightOfDynamic) sim_value=sum(va) print("The similarity value is = %f" %(sim_value))

{'static': 0, 'dynamic': 1, 'Weight': 0.160} here 0 should not match with 3 again {'static': 0, 'dynamic': 3, 'Weight': 0.079} {'static': 1, 'dynamic': 0, 'Weight': 0.160} same for 1 and 2 {'static': 1, 'dynamic': 2, 'Weight': 0.160} dynamic 1 is already overhere {'static': 2, 'dynamic': 1, 'Weight': 0.160} {'static': 2, 'dynamic': 3, 'Weight': 0.160} dynamic 0 is already over {'static': 3, 'dynamic': 0, 'Weight': 0.079} {'static': 3, 'dynamic': 2, 'Weight': 0.160} [0.2, 'why', 'why']

{'static': 0, 'dynamic': 1, 'Weight': 0.160} {'static': 1, 'dynamic': 0, 'Weight': 0.160} {'static': 2, 'dynamic': 3, 'Weight': 0.160} {'static': 3, 'dynamic': 2, 'Weight': 0.160} [0.2, 'why', 'why']

1条回答

网友

1楼 · 发布于 2024-09-30 20:32:21

首先，我“美化”了你的代码，让它看起来更像Python我觉得你有点过分复杂了。实际上，它甚至没有为我运行，因为你试图对一个包含int和list的列表求和

a = ['are','you','are','you','why']
b = ['you','are','you','are','why']

total_weight = 0
weight_of_static = 1/len(a)
for i, a_word in enumerate(a):
    if a_word == b[i]:
        print('{0} <-> {1} => static\t\t// weight: {2:.2f}'.format(a_word, b[i], weight_of_static))
        total_weight += weight_of_static
    else:
        distances = []
        for j, b_word in enumerate(b):
            if a_word == b_word:
                distances.append(abs(i - j))

        dynamic_weight = weight_of_static*(1 - ( 1 / len(b)) * min(distances))
        total_weight += dynamic_weight
        print('{0} <-> {1} => not static\t// weight: {2:.2f}'.format(a_word, b[i], dynamic_weight))

print('The similarity value is = {0:.2f}'.format(total_weight))

因此，首先我声明一个total_weight变量来跟踪权重。
然后我充分利用枚举函数，这样我就可以有索引和元素
如果这两个词在同一个索引中是相同的，那么很简单：）
如果没有，那么我们将像您一样遍历第二个列表，但是我们必须跟踪距离变量中的匹配项，因为a[3]将匹配b[0]，而不是更接近的b[2]
之后，我们就用你的公式来计算动态权重（我把它留得有点冗长，这样你就可以看得更清楚了）。唯一的区别是我们使用最小的距离（min(distance)）

这是我的示例输出：

$ python similarity.py
are <-> you => not static       // weight: 0.16
you <-> are => not static       // weight: 0.16
are <-> you => not static       // weight: 0.16
you <-> are => not static       // weight: 0.16
why <-> why => static           // weight: 0.20
The similarity value is = 0.84

我希望这有帮助

相关问题更多 >

编程相关推荐

热门问题

热门文章