The hashed values stored in the annotations indices are skewed by -1

Look at the [hash extension sample](https://github.com/dotnet/machinelearning/blob/master/docs/samples/Microsoft.ML.Samples/Dynamic/Transforms/Conversion/Hash.cs#L58) and compare the hashed values with the values stored in the annotations of the "CategoryHashed" column. 

Notice how the indices in the annotations are skewed by -1 from the values in the dataview. 

// Category  CategoryHashed   Age     AgeHashed
// MLB        36206            18      127
// NFL        19015            14      62
// NFL        **_19015_**            15      43
// MLB        36206            18      127
// MLS        **6013**             14      62

versus [the annotations values:](https://github.com/dotnet/machinelearning/blob/master/docs/samples/Microsoft.ML.Samples/Dynamic/Transforms/Conversion/Hash.cs#L75)


// Output Data
// 
// The original value of the **6012** category is MLS
// The original value of the **_19014_** category is NFL
// The original value of the 36205 category is MLB




Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

The hashed values stored in the annotations indices are skewed by -1 #3072

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

The hashed values stored in the annotations indices are skewed by -1 #3072

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions