Using an ImageClassificationTrainer with a dataset that only contains one class of images #4660

antoniovs1029 · 2020-01-15T19:53:26Z

Issue

What did you do?
I tried to train a model that uses the ImageClassificationTrainer, using a dataset that only contains images labeled as 'dog'.
What happened?
I got a System.InvalidOperationException: 'Metadata KeyValues does not exist' while fitting the model, which isn't very informative about the problem. If I didn't know my dataset only contained one label, or if I were to split the dataset in a way that the training set had only one label, then getting that exception wouldn't help to fix the problem.

Also notice that the exception is thrown after training is done, so an user would have already invested time on it before noticing that something is wrong.

What did you expect?
Probably an exception which tells me that there is only one class in my dataset, as done by other multiclass trainers such as the Multiclass LightGBM which throws a "System.InvalidOperationException: 'LightGBM Error, code is -1, error message is 'Number of classes should be specified and greater than 1 for multiclass training'.'", or the LinearMulticlassModelParametersBase which throws a System.ArgumentOutOfRangeException: 'Must be at least 2. Parameter name: numClasses' in here.

If I changed my dataset to include other labels, then the exception is gone and it works as expected.

Source code / logs

dataset.zip

using Microsoft.ML.Data;
using Microsoft.ML.Vision;

namespace Microsoft.ML.Samples
{
    public class ModelInput
    {
        [ColumnName("Label"), LoadColumn(0)]
        public string Label { get; set; }


        [ColumnName("ImageSource"), LoadColumn(1)]
        public string ImageSource { get; set; }
    }

    public static class Program
    {
        // private static string TRAIN_DATA_FILEPATH = @"C:\Users\anvelazq\Desktop\issue19\dogs-cats-horses.tsv"; // This one works because it has multiple classes
        private static string TRAIN_DATA_FILEPATH = @"C:\Users\anvelazq\Desktop\issue19\only-dogs.tsv"; // This one doesn't work because it has only one class

        private static MLContext mlContext = new MLContext(seed: 1);

        public static void Main()
        {
            IDataView trainingDataView = mlContext.Data.LoadFromTextFile<ModelInput>(
                                            path: TRAIN_DATA_FILEPATH,
                                            hasHeader: true,
                                            separatorChar: '\t',
                                            allowQuoting: true,
                                            allowSparse: false);

            var dataProcessPipeline = mlContext.Transforms.Conversion.MapValueToKey("Label", "Label")
                                      .Append(mlContext.Transforms.LoadRawImageBytes("ImageSource_featurized", null, "ImageSource"))
                                      .Append(mlContext.Transforms.CopyColumns("Features", "ImageSource_featurized"));

            var trainer = mlContext.MulticlassClassification.Trainers.ImageClassification(new ImageClassificationTrainer.Options() { LabelColumnName = "Label", FeatureColumnName = "Features" })
                                      .Append(mlContext.Transforms.Conversion.MapKeyToValue("PredictedLabel", "PredictedLabel"));
            var trainingPipeline = dataProcessPipeline.Append(trainer);

            var model = trainingPipeline.Fit(trainingDataView); // System.InvalidOperationException: 'Metadata KeyValues does not exist'

            var transformed = model.Transform(trainingDataView);
            var transformedPreview = transformed.Preview();
        }
    }
}

Nugets used:

The text was updated successfully, but these errors were encountered:

antoniovs1029 · 2020-01-15T21:59:33Z

I believe the core of this problem is in the ImageClassificationTrainer, on this line:

machinelearning/src/Microsoft.ML.Vision/ImageClassificationTrainer.cs

Line 606 in c4e4263

_classCount = labelCount == 1 ? 2 : (int)labelCount;

Since labelCount holds the number of labels found on the dataset, this line hardcodes to handle a dataset with only 1 label as if it had 2 classes. I am still unsure as to why this was hardcoded this way, since, later on (after training) an exception is thrown in the KeyToValueTransformer that is supposed to be added at the end of the pipeline, in here:

machinelearning/src/Microsoft.ML.Data/Transforms/KeyToValue.cs

Line 231 in c4e4263

Host.Check(typeVals != null, "Metadata KeyValues does not exist");

It's thrown because it can't find the KeyValues Annotation inside the PredictedLabel column of the output schema that results after training the ImageClassificationTrainer.

So, for instance, if I were to remove the KeyToValue trainer at the end of the pipeline of my source code, then, for the case of the dataset with only 1 label, the pipeline trains without problem, and the PredictedLabel Column has the following Annotations:

Whereas, if I train it using a dataset with more than 1 label, then the annotations are as follows (notice the KeyValues annotation):

The exact reason of why the KeyValues annotation isn't there when the dataset had only 1 label is still not a 100% clear to me. It seems to me it has to do with how Annotations are supposed to be propagated in multiclass trainers (see issue #3090 and PR #3101).

Particularly, I also noticed that the Annotations for the Score column also changes depending if there was only 1 label in the dataset or not (if there were more than 1 labels, then the Score column has annotations called SlotNames and TrainingLabelValues, if the dataset had only 1 label then those annotations are not included in the schema). For the case of the Score column, I did figure out why were those annotations missing... and the reason is in the MulticlassClassificationScorer :

machinelearning/src/Microsoft.ML.Data/Scorers/MulticlassClassificationScorer.cs

Lines 484 to 489 in b045e4d

    
           var resultMapper = mapper; 
        
           if (CanWrapTrainingLabels(resultMapper, type)) 
        
               resultMapper = LabelNameBindableMapper.CreateBound<T>(env, (ISchemaBoundRowMapper)resultMapper, type as VectorDataViewType, getter, AnnotationUtils.Kinds.TrainingLabelValues, CanWrapTrainingLabels); 
        
           if (CanWrapSlotNames(resultMapper, type)) 
        
               resultMapper = LabelNameBindableMapper.CreateBound<T>(env, (ISchemaBoundRowMapper)resultMapper, type as VectorDataViewType, getter, AnnotationUtils.Kinds.SlotNames, CanWrapSlotNames); 
        
           return resultMapper;

If CanWrapTrainingLabels and CanWrapSlotNames return true, then the TrainingLabelValues and SlotNames annotations get added to the Score column respectively. For the case of having a dataset with 1 label, those methods return false, pretty much, because there's a mismatch between the size of the Score vector (which is 2, because the ImageClassificationTrainer was trained as if there were 2 classes) and the number of labels found by the ValueToKeyTransformer (which would be only 1). Generally this mismatch wouldn't happen, and it only happens in here because of how the case of having a dataset with 1 label was handled in ImageClassificationTrainer.

And even though I haven't fully figured out how does the missing annotations in the Score column affect the PredictedLabel annotations, I would still think it's better to simply throw an exception in ImageClassificationTrainer when labelCount is 1, as is done in the other multiclass trainers I linked to in the first post.

antoniovs1029 · 2020-01-15T22:21:22Z

So I've just talked with @codemzs and we decided the best option is to simply throw an exception in here if labelCount is 1:

machinelearning/src/Microsoft.ML.Vision/ImageClassificationTrainer.cs

Line 606 in c4e4263

_classCount = labelCount == 1 ? 2 : (int)labelCount;

Instead of trying to make all the different changes required to support the corner case of having only 1 class represented on the dataset.

…nly 1 class (#4662) * Throw exception when dataset contains only 1 label. Fixes #4660

antoniovs1029 self-assigned this Jan 15, 2020

antoniovs1029 added bug Something isn't working P0 Priority of the issue for triage purpose: IMPORTANT, needs to be fixed right away. labels Jan 15, 2020

antoniovs1029 mentioned this issue Jan 15, 2020

Throw exception in ImageClassificationTrainer when dataset contains only 1 class #4662

Merged

antoniovs1029 closed this as completed in #4662 Jan 25, 2020

antoniovs1029 added a commit that referenced this issue Jan 25, 2020

Throw exception in ImageClassificationTrainer when dataset contains o…

0618515

…nly 1 class (#4662) * Throw exception when dataset contains only 1 label. Fixes #4660

ghost locked as resolved and limited conversation to collaborators Mar 19, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Using an ImageClassificationTrainer with a dataset that only contains one class of images #4660

Using an ImageClassificationTrainer with a dataset that only contains one class of images #4660

antoniovs1029 commented Jan 15, 2020 •

edited

Loading

antoniovs1029 commented Jan 15, 2020

antoniovs1029 commented Jan 15, 2020 •

edited

Loading

Using an ImageClassificationTrainer with a dataset that only contains one class of images #4660

Using an ImageClassificationTrainer with a dataset that only contains one class of images #4660

Comments

antoniovs1029 commented Jan 15, 2020 • edited Loading

Issue

Source code / logs

Nugets used:

antoniovs1029 commented Jan 15, 2020

antoniovs1029 commented Jan 15, 2020 • edited Loading

antoniovs1029 commented Jan 15, 2020 •

edited

Loading

antoniovs1029 commented Jan 15, 2020 •

edited

Loading