[Feature Request] Random Spatial Transforms #287

esube · 2017-10-10T16:38:08Z

I just saw the refactored transform and it looks much better. I had issues with the PIL backend of the previous transform and used to completely avoid the torchvision transform and implement the transforms I want locally using mostly opencv.

I also saw that color transforms are added in #275. These are all great! Is there a plan to add spatial transforms such as translation, rotation, shear etc.. (in general warping) augmentations. They are crucial in case of limited dataset training such as in attribute prediction, person re-id, extreme classification, etc...

alykhantejani · 2017-10-10T19:10:46Z

Hi @esube, this is something we could potentially do and, as PyTorch now has support for spatial transformers, this could be implemented as just sampling an affine transformation matrix.

I'll try and put something together for this in the next week or so

esube · 2017-10-10T20:14:09Z

@alykhantejani Thanks for your fast response as always. Yeah, the easiest way to implement this is using a generic affine transform matrix just like opencv's warp function. Better yet, you might even want to make the affine matrix as user input with some default behavior for the specific specializations: i.e. translation, rotation, shear etc...

alykhantejani · 2017-11-01T23:30:25Z

I think this functionality will be added as part of #303

daavoo · 2017-11-08T16:46:06Z

I have code somewhere that implements this functionallity using PIL.Image.transform and Image.AFFINE. I need to find some time to work on adapting the code to the current transforms API but I'm a little busy right now. Maybe next week

daavoo · 2017-12-07T11:31:45Z

Opened PR with random translation #363 .

@esube @alykhantejani Question: Do we want a "generic" RandomAffine transform that let's the user specify the 6 parameters of the affine matrix or it's better to have specific transforms like RandomRotation, RandomTranslation, etc. or .. both?

alykhantejani · 2017-12-07T11:57:29Z

I think both would be good, i.e. RandomRotate + RandomTranslate are useful and often clearer if you just want traslation for example. However, the power of the full affine matrix is good too (which can underneath just call the rotation + translation functions)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Feature Request] Random Spatial Transforms #287

[Feature Request] Random Spatial Transforms #287

esube commented Oct 10, 2017

alykhantejani commented Oct 10, 2017

Uh oh!

esube commented Oct 10, 2017

Uh oh!

alykhantejani commented Nov 1, 2017

Uh oh!

daavoo commented Nov 8, 2017

Uh oh!

daavoo commented Dec 7, 2017

Uh oh!

alykhantejani commented Dec 7, 2017

Uh oh!

[Feature Request] Random Spatial Transforms #287

[Feature Request] Random Spatial Transforms #287

Comments

esube commented Oct 10, 2017

alykhantejani commented Oct 10, 2017

Uh oh!

esube commented Oct 10, 2017

Uh oh!

alykhantejani commented Nov 1, 2017

Uh oh!

daavoo commented Nov 8, 2017

Uh oh!

daavoo commented Dec 7, 2017

Uh oh!

alykhantejani commented Dec 7, 2017

Uh oh!