度量二进制列表之间的相似性

from sklearn.metrics.cluster import normalized_mutual_info_score l1 = [1,0,1] l2 = [0,1,0] print(normalized_mutual_info_score(l1 , l2)) l1 = [0,0,0] l2 = [0,0,0] print(normalized_mutual_info_score(l1 , l2))

2条回答

网友

1楼 · 编辑于 2024-09-28 01:30:08

不，情节没有意义。你所做的基本上是向量之间的内积。根据这个度量，l1和{}应该是3D（在这种情况下）空间中的向量，这将度量它们是否面向相同的方向，是否具有相似的长度。输出是一个标量值，因此没有什么可绘制的。在

如果您想显示每个组件的单独贡献，您可以执行以下操作

contributions = [a==b for a, b in zip(l1, l2)]
plt.plot(list(range(len(contributions)), contributions)

但我还是不确定这是否有意义。在

网友

2楼 · 编辑于 2024-09-28 01:30:08

import numpy as np
import matplotlib.pyplot as plt

def unpackbits(a, n):
    ''' Unpacks an integer `a` to n-length binary list. ''' 
    return [a >> i & 1 for i in range(n-1,-1,-1)]


def similarity(a, b, n):
    ''' Similarity between n-length binary lists obtained from unpacking
    the integers a and b. '''
    a_unpacked = unpackbits(a, n)
    b_unpacked = unpackbits(b, n)
    return np.sum(np.isclose(a_unpacked, b_unpacked))/n


# Plot
n = 3
x = np.arange(2**n+1)
y = np.arange(2**n+1)
xx, yy = np.meshgrid(x, x)
z = np.vectorize(similarity)(yy[:-1,:-1], xx[:-1,:-1], n)

labels = [unpackbits(i, n) for i in x]
cmap = plt.cm.get_cmap('binary', n+1)

fig, ax = plt.subplots()
pc = ax.pcolor(x, y, z, cmap=cmap, edgecolor='k', vmin = 0, vmax=1)
ax.set_xticks(x + 0.5)
ax.set_yticks(y + 0.5)
ax.set_xlim(0, 2**n)
ax.set_ylim(0, 2**n)
ax.set_xticklabels(labels, rotation=45)
ax.set_yticklabels(labels)
cbar = fig.colorbar(pc, ax=ax, ticks=[i/n for i in range(n+1)])
cbar.ax.set_ylabel('similarity', fontsize=14)
ax.set_aspect('equal', adjustable='box')
plt.tight_layout()
plt.show()

相关问题更多 >

编程相关推荐

热门问题

热门文章

度量二进制列表之间的相似性

相关问题 更多 >

编程相关推荐

热门问题

热门文章

相关问题更多 >