How to build transform-only pipelines #642

AndreasHoersken · 2018-08-03T10:56:18Z

I found comments, that there should be a way to use transforms without the need of a trainer/learner (i.e., building a "processing / transform - pipeline instead of an LearningPipeline, cp. #259 (comment)). Unfourtunately, I could not find out, how to achieve this.

In my usecase, I want to determine similarity of documents with n-gram vectorization and cosine distance. The functionalty for featurization is given by the TextFeaturizer (https://docs.microsoft.com/de-de/dotnet/api/microsoft.ml.transforms.textfeaturizer). In this usecase I don't want to do a training (yet), but am interessted in the output in the result of the TextFeaturizer itself.

Accessing the results of partial steps could be helpful for debugging LearningPipelines too (cp. discussion here: #259).
@TomFinley

Zruty0 · 2018-08-05T22:33:36Z

As far as I know, this is currently not possible.

However, once we build the final API (see #583), you will be able to access the output of transforms without having to train a model.

In fact, the 'trained model' will be just one form of 'transformer', and you will be able to have as many of them chained together as you want (including 0), and mix and match them with other transformers, like TextFeaturizer.

AndreasHoersken · 2018-08-06T07:29:08Z

Thanks! I'm looking forward to the final API.

AndreasHoersken closed this as completed Aug 6, 2018

AndreasHoersken reopened this Aug 6, 2018

AndreasHoersken closed this as completed Aug 6, 2018

ghost locked as resolved and limited conversation to collaborators Mar 29, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to build transform-only pipelines #642

How to build transform-only pipelines #642

AndreasHoersken commented Aug 3, 2018

Zruty0 commented Aug 5, 2018

AndreasHoersken commented Aug 6, 2018

How to build transform-only pipelines #642

How to build transform-only pipelines #642

Comments

AndreasHoersken commented Aug 3, 2018

Zruty0 commented Aug 5, 2018

AndreasHoersken commented Aug 6, 2018