ValueError:`logits`和`labels`必須具有相同的形狀，收到((None,10)vs(None,1))-有解無憂

我正在運行一個對合模型（基于此示例），并且在訓練階段我經常遇到錯誤。這是我的錯誤：

ValueError: `logits` and `labels` must have the same shape, received ((None, 10) vs (None, 1)).

以下是資料集加載的相關代碼：

    train_datagen = ImageDataGenerator(
        rescale=1./255,
        shear_range=0.2,
        zoom_range=0.2,
        horizontal_flip=True)
    test_datagen = ImageDataGenerator(rescale=1./255)
    train_ds = train_datagen.flow_from_directory(
        'data/train',
        target_size=(150, 150),
        batch_size=128,
        class_mode='binary')
    test_ds = test_datagen.flow_from_directory(
        'data/test',
        target_size=(150, 150),
        batch_size=64,
        class_mode='binary')`

這是訓練的代碼：

    print("building the involution model...")

    inputs = keras.Input(shape=(224, 224, 3))
    x, _ = Involution(channel=3, group_number=1, kernel_size=3, stride=1, reduction_ratio=2, name="inv_1")(inputs)
    x = keras.layers.ReLU()(x)
    x = keras.layers.MaxPooling2D((2, 2))(x)
    x, _ = Involution(
    channel=3, group_number=1, kernel_size=3, stride=1, reduction_ratio=2, name="inv_2")(x)
    x = keras.layers.ReLU()(x)
    x = keras.layers.MaxPooling2D((2, 2))(x)
    x, _ = Involution(
    channel=3, group_number=1, kernel_size=3, stride=1, reduction_ratio=2, name="inv_3")(x)
    x = keras.layers.ReLU()(x)
    x = keras.layers.Flatten()(x)
    x = keras.layers.Dense(64, activation="relu")(x)
    outputs = keras.layers.Dense(10)(x)

    inv_model = keras.Model(inputs=[inputs], outputs=[outputs], name="inv_model")

    print("compiling the involution model...")
    inv_model.compile(
    optimizer="adam",
    loss=keras.losses.BinaryCrossentropy(from_logits=True),
    metrics=["accuracy"],
    )

    print("inv model training...")
    inv_hist = inv_model.fit(train_ds, epochs=20, validation_data=test_ds)`

模型本身與 Keras 使用的相同，除了使用我自己的資料集而不是 CIFAR 資料集（模型適用于這個資料集）之外，我沒有進行任何更改。所以我確信我的資料加載有錯誤，但我無法確定那是什么。

型號總結：

Model: "inv_model"
_________________________________________________________________
 Layer (type)                Output Shape              Param #   
=================================================================
 input_14 (InputLayer)       [(None, 224, 224, 3)]     0         
                                                                 
 inv_1 (Involution)          ((None, 224, 224, 3),     26        
                              (None, 224, 224, 9, 1,             
                             1))                                 
                                                                 
 re_lu_39 (ReLU)             (None, 224, 224, 3)       0         
                                                                 
 max_pooling2d_26 (MaxPoolin  (None, 112, 112, 3)      0         
 g2D)                                                            
                                                                 
 inv_2 (Involution)          ((None, 112, 112, 3),     26        
                              (None, 112, 112, 9, 1,             
                             1))                                 
                                                                 
 re_lu_40 (ReLU)             (None, 112, 112, 3)       0         
                                                                 
 max_pooling2d_27 (MaxPoolin  (None, 56, 56, 3)        0         
 g2D)                                                            
                                                                 
 inv_3 (Involution)          ((None, 56, 56, 3),       26        
                              (None, 56, 56, 9, 1, 1)            
                             )                                   
                                                                 
 re_lu_41 (ReLU)             (None, 56, 56, 3)         0         
                                                                 
 flatten_15 (Flatten)        (None, 9408)              0         
                                                                 
 dense_26 (Dense)            (None, 64)                602176    
                                                                 
 dense_27 (Dense)            (None, 10)                650       
                                                                 
=================================================================

uj5u.com熱心網友回復：

當您呼叫該train_datagen.flow_from_directory()函式時，您使用class_mode='binary'了這意味著您的影像標簽僅為 0 和 1，而您總共有 10 個預測，即最終輸出層中有 10 個神經元。因此標簽和 logits 不匹配。解決方案：使用class_mode='categorical'which 意味著標簽的數量與類的數量一樣多。在 test_datagen 中也做同樣的事情。

轉載請註明出處，本文鏈接：https://www.uj5u.com/gongcheng/524104.html

標籤：张量流喀拉斯深度学习神经网络卷积神经网络

上一篇：如何在Tensorflow張量中增加某些值？

下一篇：tensorflowDepthwiseConv2D中的depth_multiplier如何作業？