拟合方法gensim.sklearn_api接口.w2vmodel.W2VTransformer在输入字符串的二维数组时引发错误

2024-10-02 22:35:30 发布

您现在位置:Python中文网/ 问答频道 /正文

我正在尝试用word2vec和numpy对一些文档进行集群。在

w2v = W2VTransformer() X_train = w2v.fit_transform(X_train)

当我运行fit或fit_变换时,我得到以下错误:

Exception in thread Thread-8: Traceback (most recent call last): File "C:\Users\lperona\AppData\Local\Continuum\anaconda3\lib\threading.py", line 916, in _bootstrap_inner self.run() File "C:\Users\lperona\AppData\Local\Continuum\anaconda3\lib\threading.py", line 864, in run self._target(*self._args, **self._kwargs) File "C:\Users\lperona\AppData\Local\Continuum\anaconda3\lib\site-packages\gensim\models\base_any2vec.py", line 99, in _worker_loop tally, raw_tally = self._do_train_job(data_iterable, job_parameters, thread_private_mem) File "C:\Users\lperona\AppData\Local\Continuum\anaconda3\lib\site-packages\gensim\models\word2vec.py", line 539, in _do_train_job tally += train_batch_cbow(self, sentences, alpha, work, neu1, self.compute_loss) File "gensim/models/word2vec_inner.pyx", line 458, in gensim.models.word2vec_inner.train_batch_cbow ValueError: The truth value of an array with more than one element is ambiguous. Use a.any() or a.all()

(X_train是一个2D numpy字符串数组)

有人知道解决办法吗? 谢谢你


Tags: inpyselfliblocallinetrainword2vec