我正在尝试用word2vec和numpy对一些文档进行集群。在
w2v = W2VTransformer()
X_train = w2v.fit_transform(X_train)
当我运行fit或fit_变换时,我得到以下错误:
Exception in thread Thread-8: Traceback (most recent call last): File "C:\Users\lperona\AppData\Local\Continuum\anaconda3\lib\threading.py", line 916, in _bootstrap_inner self.run() File "C:\Users\lperona\AppData\Local\Continuum\anaconda3\lib\threading.py", line 864, in run self._target(*self._args, **self._kwargs) File "C:\Users\lperona\AppData\Local\Continuum\anaconda3\lib\site-packages\gensim\models\base_any2vec.py", line 99, in _worker_loop tally, raw_tally = self._do_train_job(data_iterable, job_parameters, thread_private_mem) File "C:\Users\lperona\AppData\Local\Continuum\anaconda3\lib\site-packages\gensim\models\word2vec.py", line 539, in _do_train_job tally += train_batch_cbow(self, sentences, alpha, work, neu1, self.compute_loss) File "gensim/models/word2vec_inner.pyx", line 458, in gensim.models.word2vec_inner.train_batch_cbow ValueError: The truth value of an array with more than one element is ambiguous. Use a.any() or a.all()
(X_train是一个2D numpy字符串数组)
有人知道解决办法吗? 谢谢你
在使用numpy数组时,gensim的word2vec似乎有一些问题。把数据转换成python列表对我有帮助。在
相关问题 更多 >
编程相关推荐