[RFC] Registration mechanism for models #6330

datumbox · 2022-07-28T18:09:43Z

Note: This PR is meant to document the RFC with code examples and it's not mean to be merged. Some tests are failing because they are not compatible with the mechanism. This will be resolved on the follow up PRs that will actually introduce the mechanism to TorchVision.

Objective

Create a model registration mechanism responsible for:

Listing the names of all available models under a specific module
Getting a model (initializing) from its name
Getting the weight enum of the model from its name or model builder
Registering models under a specific name

Background

As discussed at #5088, a model registration mechanism is needed to be able to quickly list the available modules.

Currently we use hacky workarounds in order to list our models:

vision/test/test_models.py

Lines 24 to 30 in 6ca9c76

    
           def get_models_from_module(module): 
        
               # TODO add a registration mechanism to torchvision.models 
        
               return [ 
        
                   v 
        
                   for k, v in module.__dict__.items() 
        
                   if callable(v) and k[0].lower() == k[0] and k[0] != "_" and k != "get_weight" 
        
               ]

The same applies for initializing models from their names:

vision/references/classification/train.py

Lines 223 to 225 in e288f6c

    
           print("Creating model") 
        
           model = torchvision.models.__dict__[args.model](weights=args.weights, num_classes=num_classes) 
        
           model.to(device)

vision/torchvision/models/detection/backbone_utils.py

Line 112 in 6ca9c76

    
           backbone = resnet.__dict__[backbone_name](weights=weights, norm_layer=norm_layer)

Overview

Offer a mechanism that stores all models on a single private global dictionary along with public getter/listing methods located under torchvision.models that will allow users to interact with the mechanism. This proposal aligns closely with the registration mechanism proposed at the Datasets V2 API, offering a similar user experience.

List models

Listing all available models:

>>> torchvision.models.list_models()
['alexnet', 'mobilenet_v3_large', 'mobilenet_v3_small', 'quantized_mobilenet_v3_large', ...]

Listing available models from a specific module:

>>> torchvision.models.list_models(module=torchvision.models)
['alexnet', 'mobilenet_v3_large', 'mobilenet_v3_small', ...]
>>> torchvision.models.list_models(module=torchvision.models.quantization)
['quantized_mobilenet_v3_large', ...]

Get models

Getting an initialized model with pre-trained weights from its name:

>>> torchvision.models.get_model("quantized_mobilenet_v3_large", weights="DEFAULT")
QuantizableMobileNetV3(
  (features): Sequential(
   ....
   )
)

Get weights

Getting the Weight Enum class of a model from its name:

>>> torchvision.models.get_model_weights("quantized_mobilenet_v3_large")
<enum 'MobileNet_V3_Large_QuantizedWeights'>

Getting the Weight Enum class of a model from its callable (model builder):

>>> torchvision.models.get_model_weights(torchvision.models.quantization.mobilenet_v3_large)
<enum 'MobileNet_V3_Large_QuantizedWeights'>

Register models

To register a model we use a special decorator and provide an arbitrary name:

@register_model(name="mobilenet_v3_large")
def mobilenet_v3_large(
    *, weights: Optional[MobileNet_V3_Large_Weights] = None, progress: bool = True, **kwargs: Any
) -> MobileNetV3:
    pass

@register_model(name="quantized_mobilenet_v3_large")
def mobilenet_v3_large(
    *,
    weights: Optional[Union[MobileNet_V3_Large_QuantizedWeights, MobileNet_V3_Large_Weights]] = None,
    progress: bool = True,
    quantize: bool = False,
    **kwargs: Any,
) -> QuantizableMobileNetV3:
    pass

If you are registering a method under its method name, you can omit passing a string. The following call will register the method under the name "some_model":

@register_model()
def some_model():
    pass

Registering a model under an existing name throws an error.

Other details

We will keep the registration mechanism private. Perhaps on the future we can consider allowing users to use it to register custom models but that's not part of this RFC.
Unfortunately the names of the methods under torchvision.models.quantization conflict with those under torchvision.models. As a result we need to offer the ability to register them under arbitrary names that resolve the conflicts.
The current registration mechanism will throw an error if one tries to register a model under an existing name. It's possible to extend the mechanism to allow overwrites (similar to ClassyVision) but given that the mechanism is private we don't have an immediate need of this feature.

Alternatives Considered

We considered offering separate registration APIs per submodule (aka one for torchvision.models, one for torchvision.models.detection etc). This would resolve the issue of name conflicts across submodules but would lead to having separate public methods per package. Moreover this would lead to an awkward design since methods like torchvision.models.get_weight() already support all model subpackages.

NicolasHug

Thanks @datumbox , just question from me for now

Some wasys we wanted to use the registration mechanism were:

define which weight correspond to pretrained=True so we can document this automatically
get the available weights from a model name / builder

Do we intend to plan for this now, or later?

torchvision/models/_api.py

torchvision/models/__init__.py

datumbox · 2022-07-29T08:58:10Z

@NicolasHug Thanks for the feedback.

define which weight correspond to pretrained=True so we can document this automatically

I don't think this is something that the registration mechanism should handle because the pretrained=True will go away in a few versions. I think this is still useful but probably needs to be implemented on a different annotation than register_model.

get the available weights from a model name / builder

Yes we can do that. We already have a private method that can achieve exactly this called _get_enum_from_fn() but we can make one public using the string names.

NicolasHug

I don't think this is something that the registration mechanism should handle because the pretrained=True will go away in a few versions

In my mind, it doesn't really matter when pretrained=True will be fully removed. This kind of info will be relevant and useful even when it's completely removed. For example for people looking at code (e.g. from past papers implementations) that still use this syntax, they would need to know which weight it corresponds to.

We can do it in another decorator / registration mechanism, but it looks like it might have some overlap with this one.

about the "quantized_xyz" name: it's a bit of a shame that the name doesn't match the model builder's name, but as discussed earlier I don't think there's a much better way that what you did here.

torchvision/models/_api.py

datumbox · 2022-07-29T14:03:23Z

Thanks everyone for their feedback and input. I will close this PR and send a new one with all the proposed changes. This way we will keep this live RFC doc easy to read.

datumbox added 3 commits July 28, 2022 18:40

Model registration mechanism.

0d62aaf

Add overwrite options to the dataset prototype registration mechanism.

0e7eb8a

Adding example models.

1520566

datumbox added module: models new feature labels Jul 28, 2022

datumbox requested a review from NicolasHug July 28, 2022 18:09

facebook-github-bot added the cla signed label Jul 28, 2022

datumbox changed the title ~~[RFC] Registration mechanism for models~~ [RFC] [NOMERGE] Registration mechanism for models Jul 28, 2022

datumbox marked this pull request as draft July 28, 2022 18:10

datumbox added 4 commits July 29, 2022 08:31

Fix module filtering

2e16077

Fix linter

a02c124

Fix docs

eedf8df

Make name optional if same as model builder

a91a5b4

NicolasHug reviewed Jul 29, 2022

View reviewed changes

torchvision/models/_api.py Show resolved Hide resolved

torchvision/models/_api.py Outdated Show resolved Hide resolved

torchvision/models/__init__.py Outdated Show resolved Hide resolved

datumbox added 2 commits July 29, 2022 10:03

Apply updates from code-review.

abbe23e

fix minor bug

1eb8159

NicolasHug approved these changes Jul 29, 2022

View reviewed changes

torchvision/models/_api.py Show resolved Hide resolved

Adding getter for model weight enum

924388e

NicolasHug reviewed Jul 29, 2022

View reviewed changes

torchvision/models/_api.py Outdated Show resolved Hide resolved

datumbox added 2 commits July 29, 2022 10:51

Support both strings and callables on get_model_weight.

bd2327a

linter fixes

a815a63

datumbox changed the title ~~[RFC] [NOMERGE] Registration mechanism for models~~ [RFC] Registration mechanism for models Jul 29, 2022

Fixing mypy.

9e4e62c

nairbv reviewed Jul 29, 2022

View reviewed changes

torchvision/models/_api.py Outdated Show resolved Hide resolved

jdsgomes reviewed Jul 29, 2022

View reviewed changes

torchvision/models/_api.py Show resolved Hide resolved

Renaming get_model_weight to get_model_weights

0209327

datumbox closed this Jul 29, 2022

datumbox deleted the models/registration branch July 29, 2022 14:03

This was referenced Jul 29, 2022

Add registration mechanism for models #6333

Merged

MaxVit model #6342

Merged

[FEEDBACK] Model Registration beta API #6365

Open

Expose on Hub the public methods of the registration API #6364

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[RFC] Registration mechanism for models #6330

[RFC] Registration mechanism for models #6330

Uh oh!

datumbox commented Jul 28, 2022 •

edited

Loading

Uh oh!

NicolasHug left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

datumbox commented Jul 29, 2022

Uh oh!

NicolasHug left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

datumbox commented Jul 29, 2022

Uh oh!

Uh oh!

	def get_models_from_module(module):
	# TODO add a registration mechanism to torchvision.models
	return [
	v
	for k, v in module.__dict__.items()
	if callable(v) and k[0].lower() == k[0] and k[0] != "_" and k != "get_weight"
	]

	print("Creating model")
	model = torchvision.models.__dict__[args.model](weights=args.weights, num_classes=num_classes)
	model.to(device)

[RFC] Registration mechanism for models #6330

[RFC] Registration mechanism for models #6330

Uh oh!

Conversation

datumbox commented Jul 28, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Objective

Background

Overview

List models

Get models

Get weights

Register models

Other details

Alternatives Considered

Uh oh!

NicolasHug left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

datumbox commented Jul 29, 2022

Uh oh!

NicolasHug left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

datumbox commented Jul 29, 2022

Uh oh!

Uh oh!

datumbox commented Jul 28, 2022 •

edited

Loading