将二维稀疏矩阵转换为三维矩阵问题的回答

将二维稀疏矩阵转换为三维矩阵

回答此问题可获得 20 贡献值，回答如果被采纳可获得 50 分。

<p>我想将2D稀疏矩阵转换为3D矩阵，因为我需要将其作为conv1d层的输入，该层需要3D张量</p> <p>下面是conv1d层的输入</p> <pre><code>from scipy.sparse import hstack other_features_train = hstack((X_train_state_ohe, X_train_teacher_ohe, X_train_grade_ohe, X_train_category_ohe, X_train_subcategory_ohe,X_train_price_norm,X_train_number_norm)) other_features_cv = hstack((X_cv_state_ohe, X_cv_teacher_ohe, X_cv_grade_ohe,X_cv_category_ohe,X_cv_subcategory_ohe,X_cv_price_norm,X_cv_number_norm)) other_features_test = hstack((X_test_state_ohe, X_test_teacher_ohe, X_test_grade_ohe,X_test_category_ohe,X_test_subcategory_ohe,X_test_price_norm,X_test_number_norm)) print(other_features_train.shape) print(other_features_cv.shape) print(other_features_test.shape) </code></pre> <p>列车形状、cv和试验数据</p> <pre><code>(49041, 101) (24155, 101) (36052, 101) </code></pre> <p>这是我的模型架构</p> <pre><code>tf.keras.backend.clear_session() vec_size = 300 input_model_1 = Input(shape=(300,),name='essay') embedding = Embedding(vocab_size_essay, vec_size, weights=[word_vector_matrix], input_length = max_length, trainable=False)(input_model_1) lstm = LSTM(16)(embedding) flatten_1 = Flatten()(lstm) input_model_2 = Input(shape=(101, ),name='other_features') conv_layer1 = Conv1D(32, 3, strides=1, padding='valid', kernel_initializer='glorot_uniform', activation='relu')(input_model_2) conv_layer2 = Conv1D(32, 3, strides=1, padding='valid', kernel_initializer='glorot_uniform', activation='relu')(conv_layer1) conv_layer3 = Conv1D(32, 3, strides=1, padding='valid', kernel_initializer='glorot_uniform', activation='relu')(conv_layer2) flatten_2 = Flatten()(conv_layer3) concat_layer = concatenate(inputs=[flatten_1, flatten_2],name='concat') dense_layer_1 = Dense(units=32, activation='relu', kernel_initializer='he_normal', name='dense_layer_1')(concat_layer) dropout_1 = Dropout(0.2)(dense_layer_1) dense_layer_2 = Dense(units=32, activation='relu', kernel_initializer='he_normal', name='dense_layer_2')(dropout_1) dropout_2 = Dropout(0.2)(dense_layer_2) dense_layer_3 = Dense(units=32, activation='relu', kernel_initializer='he_normal', name='dense_layer_3')(dropout_2) output = Dense(units=2, activation='softmax', kernel_initializer='glorot_uniform', name='output')(dense_layer_3) model_3 = Model(inputs=[input_model_1,input_model_2],outputs=output) </code></pre> <p>当我尝试给出2d数组时，我得到了这个错误</p> <pre><code>--------------------------------------------------------------------------- ValueError Traceback (most recent call last) <ipython-input-18-44c8f6f0caa7> in <module> 9 10 input_model_2 = Input(shape=(101, ),name='other_features') ---> 11 conv_layer1 = Conv1D(32, 3, strides=1, padding='valid', kernel_initializer='glorot_uniform', activation='relu')(input_model_2) 12 conv_layer2 = Conv1D(32, 3, strides=1, padding='valid', kernel_initializer='glorot_uniform', activation='relu')(conv_layer1) 13 conv_layer3 = Conv1D(32, 3, strides=1, padding='valid', kernel_initializer='glorot_uniform', activation='relu')(conv_layer2) ~\AppData\Local\Programs\Python\Python37\lib\site-packages\tensorflow_core\python\keras\engine\base_layer.py in __call__(self, inputs, *args, **kwargs) 810 # are casted, not before. 811 input_spec.assert_input_compatibility(self.input_spec, inputs, --> 812 self.name) 813 graph = backend.get_graph() 814 with graph.as_default(), backend.name_scope(self._name_scope()): ~\AppData\Local\Programs\Python\Python37\lib\site-packages\tensorflow_core\python\keras\engine\input_spec.py in assert_input_compatibility(input_spec, inputs, layer_name) 175 'expected ndim=' + str(spec.ndim) + ', found ndim=' + 176 str(ndim) + '. Full shape received: ' + --> 177 str(x.shape.as_list())) 178 if spec.max_ndim is not None: 179 ndim = x.shape.ndims ValueError: Input 0 of layer conv1d is incompatible with the layer: expected ndim=3, found ndim=2. Full shape received: [None, 101] model_3.summary() model_3.compile(loss = "binary_crossentropy", optimizer=Adam() </code></pre> <p>编译模型</p> <pre><code>model_3.compile(loss = "binary_crossentropy", optimizer=Adam(), metrics=["accuracy"]) </code></pre> <p>符合模型</p> <pre><code>model_3.fit(train_features,y_train_ohe,batch_size=16,epochs=10,validation_data=(cv_features,y_cv_ohe)) train_features = [train_text, other_features_train] cv_features = [cv_text, other_features_cv] test_featues = [test_text, other_features_test] </code></pre> <p>文本特征</p> <pre><code>train_text = X_train['essay'].tolist() cv_text = X_cv['essay'].tolist() test_text = X_test['essay'].tolist() token = Tokenizer() token.fit_on_texts(train_text) vocab_size_essay = len(token.word_index) + 1 print("No. of unique words = ", vocab_size_essay) encoded_train_text = token.texts_to_sequences(train_text) encoded_cv_text = token.texts_to_sequences(cv_text) encoded_test_text = token.texts_to_sequences(test_text) #print(encoded_test_text[:5]) max_length = 300 train_text = pad_sequences(encoded_train_text, maxlen=max_length, padding='post') cv_text = pad_sequences(encoded_cv_text, maxlen=max_length, padding='post') test_text = pad_sequences(encoded_test_text, maxlen=max_length, padding='post') print("\n") print(train_text.shape) print(cv_text.shape) print(test_text.shape) </code></pre> <p>文本特征的形状</p> <pre><code>No. of unique words = 41468 (49041, 300) (24155, 300) (36052, 300) </code></pre> <p>所以，我想在</p> <pre><code>(49041,101,1) (24155,101,1) (36052,101,1) </code></pre> <p>请建议怎么做</p>

0 条评论
分类：Python问答

默认排序时间排序

1 个回答

匿名 1天前

　擅长：python、mysql、java

<h2>解决方案</h2> <p>这里的解决方案需要明确以下几个概念。我将解释这些概念在下面的章节中</p> <ul> <li>{<cd1>}期望输入什么</li> <li>可以对<code>keras</code>模型进行何种修改以允许稀疏输入矩阵</li> <li>将2D<code>numpy</code>数组转换为3D<code>numpy</code>数组</li> <li>使用在稀疏和非稀疏（或密集）数组之间来回转换 <ul> <li><code>scipy.sparse.coo_matrix</code>用于2D<code>numpy</code>数组</li> <li><code>sparse.COO</code>用于3D<code>numpy</code>数组</li> </ul> </li> </ul> <h2>使用稀疏矩阵作为<code>tf.keras</code>模型的输入</h2> <ul> <li><p>一个选项是使用以下命令将稀疏输入矩阵转换为非稀疏（密集）格式： <code>todense()</code>方法。这使得矩阵成为一个规则的<code>numpy</code>数组。参见kaggle的讨论， <a href="https://www.kaggle.com/c/talkingdata-mobile-user-demographics/discussion/22567" rel="nofollow noreferrer">[3]</a>和<a href="https://www.kaggle.com/c/walmart-recruiting-trip-type-classification/discussion/18154" rel="nofollow noreferrer">[4]</a></p> </li> <li><p>另一种选择是通过以下方式为稀疏和密集输入编写自己的自定义层：子类化<code>tf.keras.layers.Layer</code>类。见这篇文章<a href="https://medium.com/dailymotion/how-to-design-deep-learning-models-with-sparse-inputs-in-tensorflow-keras-fd5e754abec1" rel="nofollow noreferrer">[2]</a></p> </li> <li><p>看起来<code>tensorflow.keras</code>现在允许使用稀疏权重进行模型训练。所以在某些地方，它有能力处理稀疏性。您可能需要浏览文档， <a href="https://www.tensorflow.org/model_optimization/guide/pruning/train_sparse_models" rel="nofollow noreferrer">[1]</a>在这方面</p> </li> </ul> <h2>向numpy阵列添加新轴</h2> <p>您可以使用<code>np.newaxis</code>将另一个轴添加到numpy数组，如下所示</p> <pre class="lang-py prettyprint-override"><code>import numpy as np ## Make a 2D array a2D = np.zeros((10,10)) # Make a few elements non-zero in a2D aa = a2D.flatten() aa[[0,13,41,87,98]] = np.random.randint(1,10,size=5) a2D = aa.reshape(a2D.shape) # Make 3D array from 2D array by adding another axis a3D = a2D[:,:,np.newaxis] #print(a2D) print('a2D.shape: {}\na3D.shape: {}'.format(a2D.shape, a3D.shape)) </code></pre> <p><strong>输出</strong>：</p> <pre><code>a2D.shape: (10, 10) a3D.shape: (10, 10, 1) </code></pre> <p>话虽如此，请查看参考资料部分的链接</p> <h2>稀疏阵列</h2> <p>由于稀疏数组的非零值非常少，因此在转换时会使用常规numpy数组在稀疏数组中，以几种稀疏格式存储：</p> <ul> <li><code>csr_matrix</code>：非零值和索引的行数组</li> <li><code>csc-matrix</code>：非零值和索引的按列数组</li> <li><code>coo-matrix</code>：一个有三列的表 <ul> <li>划船</li> <li>纵队</li> <li>非零值</li> </ul> </li> </ul> <p><strong>Scipy稀疏矩阵需要2D输入矩阵</p> <p>然而，上述三种稀疏矩阵的实现将二维非稀疏矩阵视为输入</p> <pre class="lang-py prettyprint-override"><code>from scipy.sparse import csr_matrix, coo_matrix coo_a2D = coo_matrix(a2D) coo_a2D.shape # output: (10, 10) # scipy.sparse only accepts 2D input matrices # the following line will throw an !!! ERROR !!! coo_a3D = coo_matrix(coo_a2D.todense()[:,:,np.newaxis]) </code></pre> <h3>来自3D非稀疏输入矩阵的稀疏矩阵</h3> <p>是的，您可以使用<a href="https://github.com/pydata/sparse" rel="nofollow noreferrer">^{<cd19>}</a>库执行此操作。它还支持<code>scipy.sparse</code>和<code>numpy</code>数组。将稀疏矩阵转换为非稀疏（密集）格式（<em>这不是神经网络中的密集层</em>），使用 <code>todense()</code>方法</p> <pre class="lang-py prettyprint-override"><code>## Installation # pip install -U sparse import sparse ## Create sparse coo_matrix from a # 3D numpy array (dense format) coo_a3D = sparse.COO(a3D) ## Test that # coo_a3D == coo made from (coo_a2D + newaxis) print( (coo_a3D == sparse.COO(coo_a2D.todense()[:,:,np.newaxis])).all() ) # output: True ## Convert to dense (non-sparse) format # use: coo_a3D.todense() print((a3D == coo_a3D.todense()).all()) # output: True </code></pre> <p><a href="https://i.stack.imgur.com/VXYgK.png" rel="nofollow noreferrer"><img src="https://i.stack.imgur.com/VXYgK.png" alt="scipy.sparse.coo_matrix vs. sparse.COO"/></a></p> <p><a href="https://sparse.pydata.org/en/latest/" rel="nofollow noreferrer">Source</a></p> <h3>PyTorch:<code>torch.sparse</code>🔥 ⭐</h3> <p>PyTorch库还提供了使用Space张量的方法</p> <ul> <li><p>文档<code>torch.sparse</code>:<a href="https://pytorch.org/docs/stable/sparse.html#sparse-coo-docs" rel="nofollow noreferrer">https://pytorch.org/docs/stable/sparse.html#sparse-coo-docs</a></p> <p><a href="https://pytorch.org/docs/stable/sparse.html#sparse-coo-docs" rel="nofollow noreferrer"><img src="https://i.imgur.com/e3GSn4O.png" alt="warning: torch.sparse"/></a></p> </li> </ul> <h2>参考资料</h2> <ol> <li><p><a href="https://www.tensorflow.org/model_optimization/guide/pruning/train_sparse_models" rel="nofollow noreferrer">Train sparse TensorFlow models with Keras</a></p> </li> <li><p><a href="https://medium.com/dailymotion/how-to-design-deep-learning-models-with-sparse-inputs-in-tensorflow-keras-fd5e754abec1" rel="nofollow noreferrer">How to design deep learning models with sparse inputs in Tensorflow Keras</a></p> </li> <li><p><a href="https://www.kaggle.com/c/talkingdata-mobile-user-demographics/discussion/22567" rel="nofollow noreferrer">Neural network for sparse matrices</a></p> </li> <li><p><a href="https://www.kaggle.com/c/walmart-recruiting-trip-type-classification/discussion/18154" rel="nofollow noreferrer">Training Neural network with scipy sparse matrix?</a></p> </li> <li><p><a href="https://sparse.pydata.org/en/latest/" rel="nofollow noreferrer">Documentation of ^{<cd19>} library</a></p> </li> </ol>

将二维稀疏矩阵转换为三维矩阵

1 个回答

相关Python问题