Better variable naming in Darknet models. #1297

sebastian-sz · 2023-01-21T09:16:54Z

Currently when we inspect Darknet variables:

from keras_cv.models import DarkNet51
m = DarkNet51(include_rescaling=False, include_top=True, classes=1000, input_shape=(224, 224, 3))
for v in m.variables:
    print(v.name)

we will get very unclear information about the variables:

batch_normalization_56/moving_variance:0
conv2d_57/kernel:0
batch_normalization_57/gamma:0

essentially a flat list of convs + BN. This makes inspecting variables or transfering weights very difficult if not impossible.

The problem:

This will propagate on all Darknet based futere implementations: YoloV7 YoloV8, YoloX etc.

The fix:

In Darknet Conv Block the name variable is unused. One could change:
from

    model_layers = [  # line 50
        layers.Conv2D(
            filters,
            kernel_size,
            strides,
            padding="same",
            use_bias=use_bias,
        ),
        layers.BatchNormalization(),
    ]

to

    model_layers = [
        layers.Conv2D(
            filters,
            kernel_size,
            strides,
            padding="same",
            use_bias=use_bias,
            name=name+"_conv"
        ),
        layers.BatchNormalization(name=name+"_bn"),
    ]

After fix, the weights seem to be loading without issue on my machine and the variables have beautiful meaningful names:

dark5_conv4_conv/kernel:0
dark5_conv4_bn/gamma:0
dark5_conv4_bn/beta:0
dark5_conv4_bn/moving_mean:0
dark5_conv4_bn/moving_variance:0

The text was updated successfully, but these errors were encountered:

LukeWood · 2023-01-21T20:24:19Z

Thanks Sebastian! @quantumalaviya for more comments.

feel free to raise a PR addressing this

quantumalaviya · 2023-01-21T20:48:23Z

Sure, we can do this. Thanks!

This makes inspecting variables or transfering weights very difficult if not impossible.

When you say "transferring weights" here, does it mean model.load_weights() is difficult or did you mean something else?

sebastian-sz · 2023-01-21T21:04:42Z

@quantumalaviya Thanks!

does it mean model.load_weights()

Not really. Recently I played around with some Yolo implementations in Tensorflow and wanted to reuse existing darknet weights from this repo, but I gave up after noticing that it's a flat list of mostly integer names (with batchnorms appearing mostly before convs).

I was thinking that if we have this option to add names to layers / variables it is much more readable and allows such weights reuse.

quantumalaviya · 2023-01-21T21:25:38Z

Ah, got it.
I will update #1296 keeping this in mind, thanks!

sebastian-sz mentioned this issue Jan 21, 2023

[YOLOX] Step 2/? : Setting up YoloX structure, add internal layers and update iou losses #1296

Merged

5 tasks

sebastian-sz mentioned this issue Jan 21, 2023

Added missing name to DarknetConvBlock variables. #1298

Merged

5 tasks

LukeWood closed this as completed in #1298 Jan 23, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Better variable naming in Darknet models. #1297

Better variable naming in Darknet models. #1297

sebastian-sz commented Jan 21, 2023

LukeWood commented Jan 21, 2023

Uh oh!

quantumalaviya commented Jan 21, 2023

Uh oh!

sebastian-sz commented Jan 21, 2023

Uh oh!

quantumalaviya commented Jan 21, 2023

Uh oh!

Better variable naming in Darknet models. #1297

Better variable naming in Darknet models. #1297

Comments

sebastian-sz commented Jan 21, 2023

The problem:

The fix:

LukeWood commented Jan 21, 2023

Uh oh!

quantumalaviya commented Jan 21, 2023

Uh oh!

sebastian-sz commented Jan 21, 2023

Uh oh!

quantumalaviya commented Jan 21, 2023

Uh oh!