值错误:输入数组的样本数应与目标数组相同。找到166个输入样本和4个目标样本

2024-09-28 16:59:34 发布

您现在位置:Python中文网/ 问答频道 /正文

我正在和Keras DL库合作对图像数据集进行分类。我在训练模型时遇到了一个错误。在

我正在处理的数据集没有大量的数据,因此训练集包含166个图像。我不确定这个错误,但我想无论如何我必须改变标签集的形状来修复它。代码如下:

import tensorflow as tf
import numpy as np
from tensorflow.keras.models import Sequential
from tensorflow.keras.layers import Dense, Dropout, Activation, Flatten
from tensorflow.keras.layers import Conv2D, MaxPooling2D


DIR = '/home/.../'
IMG_H = 256
IMG_W = 256
IMG_CH = 1


loadFile = DIR + 'Img.npz'
X = np.load(loadFile)
trainImgSet = X['trainImgSet']
trainLabelSet = X['trainLabelSet']
testImgSet = X['testImgSet']

print('Shape of trainImgSet: {}'.format(trainImgSet.shape))
print('Shape of trainLabelSet: {}'.format(trainLabelSet))
#print('Shape of testImgSet:{}'.format(testImgSet))


model = tf.keras.Sequential()
model.add(tf.keras.layers.Conv2D(256, (3, 3), input_shape = (IMG_H, IMG_W, IMG_CH)))
model.add(tf.keras.layers.Activation('relu'))
model.add(tf.keras.layers.MaxPooling2D(pool_size=(1, 1)))

model.add(tf.keras.layers.Conv2D(256, (3, 3)))
model.add(tf.keras.layers.Activation('relu'))
model.add(tf.keras.layers.MaxPooling2D(pool_size=(1, 1)))

model.add(tf.keras.layers.Flatten())

model.add(tf.keras.layers.Dense(64))

model.add(tf.keras.layers.Dense(1))
model.add(tf.keras.layers.Activation('sigmoid'))

model.compile(loss='binary_crossentropy', 
                optimizer='adam',
                metrics=['accuracy'])

model.summary()

#train the CNN
model.fit(trainImgSet, trainLabelSet, batch_size=10, epochs=5, validation_split=0.1)



Here is the error:
Traceback (most recent call last):
  File "/home/Code/DeepCl.py", line 49, in <module>
    model.fit(trainImgSet, trainLabelSet, batch_size=10, epochs=5, validation_split=0.1)
  File "anaconda3/lib/python3.6/site-packages/tensorflow/python/keras/engine/training.py", line 1536, in fit
    validation_split=validation_split)
  File "/anaconda3/lib/python3.6/site-packages/tensorflow/python/keras/engine/training.py", line 992, in _standardize_user_data
    class_weight, batch_size)
  File "/anaconda3/lib/python3.6/site-packages/tensorflow/python/keras/engine/training.py", line 1169, in _standardize_weights
    training_utils.check_array_lengths(x, y, sample_weights)
  File "/anaconda3/lib/python3.6/site-packages/tensorflow/python/keras/engine/training_utils.py", line 426, in check_array_lengths
    'and ' + str(list(set_y)[0]) + ' target samples.')
ValueError: Input arrays should have the same number of samples as target arrays. Found 166 input samples and 4 target samples.

Tags: inpyimportaddimgsizemodellayers
2条回答

我打电话的时候发生了这样的事:

X_train, y_train, X_test, y_test = sklearn.model_selection.train_test_split(...)

而不是

^{pr2}$

(请注意,第二个和第三个变量在LHS上猛击。)

这导致我的数据和标签长度不等:

即len(X_列车)!=len(y_序列)和len(X_测试)!=len(y_检验)。在

在这里

  • 训练样本数不等于标签数。

  • 有144个训练样本,但只有4个标签。

  • 训练和测试数据的形状必须具有相同数量的样本。

  • 例如,训练数据具有形状( 100 , 256 , 256 , 1 )。测试数据的形状应为( 100 , 1 )

相关问题 更多 >