Skip to content

How to build transform-only pipelines #642

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Closed
AndreasHoersken opened this issue Aug 3, 2018 · 2 comments
Closed

How to build transform-only pipelines #642

AndreasHoersken opened this issue Aug 3, 2018 · 2 comments

Comments

@AndreasHoersken
Copy link

I found comments, that there should be a way to use transforms without the need of a trainer/learner (i.e., building a "processing / transform - pipeline instead of an LearningPipeline, cp. #259 (comment)). Unfourtunately, I could not find out, how to achieve this.

In my usecase, I want to determine similarity of documents with n-gram vectorization and cosine distance. The functionalty for featurization is given by the TextFeaturizer (https://docs.microsoft.com/de-de/dotnet/api/microsoft.ml.transforms.textfeaturizer). In this usecase I don't want to do a training (yet), but am interessted in the output in the result of the TextFeaturizer itself.

Accessing the results of partial steps could be helpful for debugging LearningPipelines too (cp. discussion here: #259).
@TomFinley

@Zruty0
Copy link
Contributor

Zruty0 commented Aug 5, 2018

As far as I know, this is currently not possible.

However, once we build the final API (see #583), you will be able to access the output of transforms without having to train a model.

In fact, the 'trained model' will be just one form of 'transformer', and you will be able to have as many of them chained together as you want (including 0), and mix and match them with other transformers, like TextFeaturizer.

@AndreasHoersken
Copy link
Author

Thanks! I'm looking forward to the final API.

@ghost ghost locked as resolved and limited conversation to collaborators Mar 29, 2022
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants