如何正确重塑神经网络图像

如何解决如何正确重塑神经网络图像

我训练了我的NN模型并使用下一个代码将其保存：

from tensorflow import keras
import tensorflow.keras.layers as layers
import cv2
import numpy as np

inputs = keras.Input(shape=(784,),name='img')

x = layers.Dense(64,activation='relu')(inputs)
x = layers.Dense(64,activation='relu')(x)
outputs = layers.Dense(10,activation='softmax')(x)

model = keras.Model(inputs=inputs,outputs=outputs,name='mnist_model')
# model.summary()

(x_train,y_train),(x_test,y_test) = keras.datasets.mnist.load_data()
x_train = x_train.reshape(60000,784).astype('float32') / 255
x_test = x_test.reshape(10000,784).astype('float32') / 255

model.compile(loss='sparse_categorical_crossentropy',optimizer=keras.optimizers.RMSprop(),metrics=['accuracy'])
history = model.fit(x_train,y_train,batch_size=64,epochs=5,validation_split=0.2)
test_scores = model.evaluate(x_test,y_test,verbose=2)
model.save("myFirstModel.model")
# print('Test loss:',test_scores[0])
# print('Test accuracy:',test_scores[1])
# print("-"*20)

因此您可以看到训练图像的大小为28x28像素。但是，当我加载训练有素的模型并尝试预测自手机拍摄的自定义图片（1620x2160像素）上的值时，会返回一个错误：

model = keras.models.load_model('myFirstModel.model')
model.summary()

img = cv2.imread('C:/Users/Horseman.mini/Desktop/7.jpg',0) # it is a picture of number 7
# img = cv2.resize(img,(28,28)) # if I use this - error 1 occures
img = np.resize(img,(-1,28,1)) # if I use this - error 2 occures
img_array = np.asarray(img)
Xnew = np.array([img_array])
# make a prediction
ynew = model.predict(Xnew)
result = np.argmax(ynew,axis=1)
print("predicted : ",result)

error 1

ValueError: Input 0 of layer dense is incompatible with the layer: expected axis -1 of input shape to have value 784 but received input with shape [None,28]

error 2

ValueError: cannot reshape array of size 3498416 into shape (28,1)

中子

如何正确重塑图像以使该模型正常工作而不会出现这种错误。
对于w x h大小不同的图像（不仅是大小为28x28像素的图像），我该如何预测数字（在这种情况下）？

编辑1

试图将w和h更改为784，现在没有以前的错误，除了一个新错误，可能下一个是由于该错误。但是它返回了我，我的7.jpg数是2。我更改为0.jpg（即0），并且预测结果仍然是2。所以输出由于某种原因是错误的。 >

model = keras.models.load_model('myFirstModel.model')
model.summary()

img = cv2.imread('C:/Users/Horseman.mini/Desktop/7.jpg',0)
img = cv2.resize(img,(784,784))

img_array = np.asarray(img)

Xnew = np.array([img_array])
ynew = model.predict(Xnew)
result = np.argmax(ynew)
print("predicted : ",result) # always result is 2

output

WARNING:tensorflow:Model was constructed with shape (None,784) for input Tensor("img:0",shape=(None,784),dtype=float32),but it was called on an input with incompatible shape (None,784,784).
predicted :  2

解决方法

在模型中，您指定了input_shape = 784，因此您需要预测的图像尺寸也为784而不是28 X 28，因此将其调整为784。对模型进行训练的尺寸为784像素的图像因此，无论原始图像的尺寸如何，您都必须在使用前将其重塑为784像素。只是对未来的警告。您正在使用cv2读取图像，但是由于它们是灰度图像，所以可以。但是，如果它们是彩色图像，请记住cv2会将图像读取为bgr图像而不是rgb。因此，如果您对rgb图像进行了训练，但在bgr图像中进行了预测，则无法获得准确的结果。您必须使用

将图像转换为RGB

img = cv2.cvtColor(img,cv2.COLOR_BGR2RGB)

最近我自己犯了这个错误，很容易忘记转换图像

如何正确重塑神经网络图像

如何解决如何正确重塑神经网络图像

解决方法

相关推荐