将scipy.sparse.csr.csr_矩阵转换为列表列表

1条回答

网友

1楼 · 发布于 2024-07-05 12:31:14

我不知道tf-idf需要什么，但我可能可以帮助稀疏的结束。

生成稀疏矩阵：

In [526]: M=sparse.random(4,10,.1)
In [527]: M
Out[527]: 
<4x10 sparse matrix of type '<class 'numpy.float64'>'
    with 4 stored elements in COOrdinate format>
In [528]: print(M)
  (3, 1)    0.281301619779
  (2, 6)    0.830780358032
  (1, 1)    0.242503399296
  (2, 2)    0.190933579917

现在将其转换为coo格式。这已经是（我可以给random一个格式参数）。在任何情况下，coo格式的值都存储在3个数组中：

In [529]: Mc=M.tocoo()
In [530]: Mc.data
Out[530]: array([ 0.28130162,  0.83078036,  0.2425034 ,  0.19093358])
In [532]: Mc.row
Out[532]: array([3, 2, 1, 2], dtype=int32)
In [533]: Mc.col
Out[533]: array([1, 6, 1, 2], dtype=int32)

看起来你想忽略Mc.row，并以某种方式加入其他人。

例如作为字典：

In [534]: {k:v for k,v in zip(Mc.col, Mc.data)}
Out[534]: {1: 0.24250339929583264, 2: 0.19093357991697379, 6: 0.83078035803205375}

或二维数组中的列：

In [535]: np.column_stack((Mc.col, Mc.data))
Out[535]: 
array([[ 1.        ,  0.28130162],
       [ 6.        ,  0.83078036],
       [ 1.        ,  0.2425034 ],
       [ 2.        ,  0.19093358]])

（也是np.array((Mc.col, Mc.data)).T）

或者只是数组列表[Mc.col, Mc.data]，或者列表列表[Mc.col.tolist(), Mc.data.tolist()]，等等

你能从那里拿走吗？

相关问题更多 >

编程相关推荐

热门问题

热门文章

将scipy.sparse.csr.csr_矩阵转换为列表列表

相关问题 更多 >

编程相关推荐

热门问题

热门文章

相关问题更多 >