[RFC] Improve Transforms testing codebase

## 🚀 Feature

Current way of transformations testing is not satisfactory due random configurations, duplicated code and unclear structure (test_functional_tensor.py, test_transforms.py, test_transforms_tensor.py). This feature request proposes to rewrite transforms tests in order to takle these problems.

## Motivation
 
Structured approach for transforms tests provides :
- better coverage
- simpler way to add tests for new transforms
- extension for other transforms which could cover more input datatypes (images, masks, bboxes, etc)

## Pitch

### What do we need:

- Tests for functional operations (`torchvision.transforms.functional`)
  - code coverage checks for incorrect input
  - representative and deterministic operation configs to cover documented API
  - all possible input types:
    - PIL, accimage?, torch.Tensor, batch of tensors
    - tensor's device: CPU/CUDA
    - tensor dtype: intergrals and floats + half on CUDA 
  - ensure that same works for torchscripted transform
  - check result correctness vs stored or precomputed reference
  - consistency check for results: torchscripted, tensor and PIL
- Tests for Transforms (`torchvision.transforms`)
  - code coverage checks for incorrect input
  - representative and deterministic transform configs to cover documented API
  - tests for generated random parameters
  - ensure that same works for torchscripted transform

### How to do that:

#### Limitations:

- `pytest.parametrize` can not be used due to certain internal reasons.


#### 1) Inspiration from ["Simplify and organize test_ops" PR](https://github.com/pytorch/vision/pull/1551)

Common part of testing a transformation we can defined in special class and derived classes could configure input type etc.
Example from the referenced PR:

```python

class RoIOpTester:
    # Defines functional tests to execute
    # on cpu, on cuda, other options etc

class RoIPoolTester(RoIOpTester, unittest.TestCase):

    def fn(self, *args, **kwarsg):
        # function to test

    def get_script_fn(self, *agrs, **kwargs):
        # scripted function

     def expected_fn(self, *agrs, **kwargs):
        # reference function
```


#### 2) Use `torch.testing._internal`

```python
from torch.testing._internal.common_utils import TestCase, run_tests
from torch.testing._internal.common_device_type import dtypes, dtypesIfCUDA, instantiate_device_type_tests
from torch.testing import floating_types, floating_types_and_half, integral_types


class Tester(TestCase):

    @dtypes(*(floating_types() + integral_types()))
    @dtypesIfCUDA(*(floating_types_and_half() + integral_types()))
    def test_resize(self, device, dtype):
        img = self.create_image(h=12, w=16, device=device, dtype=dtype)
        ...


instantiate_device_type_tests(Tester, globals())

if __name__ == '__main__':
    run_tests()
```
this gives
```
TesterCPU::test_resize_cpu_float32 PASSED [ 14%]
TesterCPU::test_resize_cpu_float64 PASSED [ 28%]
TesterCPU::test_resize_cpu_int16 PASSED   [ 42%]
TesterCPU::test_resize_cpu_int32 PASSED   [ 57%]
TesterCPU::test_resize_cpu_int64 PASSED   [ 71%]
TesterCPU::test_resize_cpu_int8 PASSED    [ 85%]
TesterCPU::test_resize_cpu_uint8 PASSED   [100%]
```

Problems:
- `dtypes` is perfect for torch.Tensor and no simple way to add PIL as dtype

#### 3) [Parametrized](https://github.com/wolever/parameterized)

Packaged looks promising and could potentially solve the limitation of `pytest` (to confirm by fb).

```python
import unittest

from torch.testing import floating_types, integral_types
from parameterized import parameterized


class Tester(unittest.TestCase):

   @parameterized.expand(
       [("cuda", dt) for dt in floating_types() + integral_types()] +
       [("cpu", dt) for dt in floating_types() + integral_types()]
   )
   def test_resize(self, device, dtype):
       pass


if __name__ == "__main__":
    unittest.main()
```
this gives
```
TestMathUnitTest::test_resize_00_cuda PASSED  [  7%]
TestMathUnitTest::test_resize_01_cuda PASSED  [ 14%]
TestMathUnitTest::test_resize_02_cuda PASSED  [ 21%]
TestMathUnitTest::test_resize_03_cuda PASSED  [ 28%]
TestMathUnitTest::test_resize_04_cuda PASSED  [ 35%]
TestMathUnitTest::test_resize_05_cuda PASSED  [ 42%]
TestMathUnitTest::test_resize_06_cuda PASSED  [ 50%]
TestMathUnitTest::test_resize_07_cpu PASSED   [ 57%]
TestMathUnitTest::test_resize_08_cpu PASSED   [ 64%]
TestMathUnitTest::test_resize_09_cpu PASSED   [ 71%]
TestMathUnitTest::test_resize_10_cpu PASSED   [ 78%]
TestMathUnitTest::test_resize_11_cpu PASSED   [ 85%]
TestMathUnitTest::test_resize_12_cpu PASSED   [ 92%]
TestMathUnitTest::test_resize_13_cpu PASSED   [100%]
```


Problems:
- project's adoption and maintainance: last commit in Apr 2020

#### 4) Another approach inspired from [torchaudio tests](https://github.com/pytorch/audio/tree/master/test/torchaudio_unittest)

Split test into 3 files, for example

- [torchscript_consistency_cpu_test.py](https://github.com/pytorch/audio/blob/master/test/torchaudio_unittest/torchscript_consistency_cpu_test.py) : defines classes for different dtypes and device as CPU 
- [torchscript_consistency_cuda_test.py](https://github.com/pytorch/audio/blob/master/test/torchaudio_unittest/torchscript_consistency_cuda_test.py) : defines classes for different dtypes and device as CUDA
- [torchscript_consistency_impl.py](https://github.com/pytorch/audio/blob/master/test/torchaudio_unittest/torchscript_consistency_impl.py) : defines Functional class for tests

We can similarly put a file for PIL input, e.g. `torchscript_consistency_pil_test.py`


### Open questions

- How to do operation's configuration injection ?
  
```python
img = ...
ref_fn = ...

for config in test_configs:
    output = fn(img, **config)
    true_output = ref_fn(img, **config)
    self.assertEqual(output, true_output)
```


## Additional context

Recent bugs (e.g https://github.com/pytorch/vision/pull/2869) show unsatisfactory code coverage for transforms.


cc @vfdev-5 @fmassa @datumbox @mthrok 

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[RFC] Improve Transforms testing codebase #2881

🚀 Feature

Motivation

Pitch

What do we need:

How to do that:

Limitations:

1) Inspiration from "Simplify and organize test_ops" PR

2) Use `torch.testing._internal`

3) Parametrized

4) Another approach inspired from torchaudio tests

Open questions

Additional context

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[RFC] Improve Transforms testing codebase #2881

Description

🚀 Feature

Motivation

Pitch

What do we need:

How to do that:

Limitations:

1) Inspiration from "Simplify and organize test_ops" PR

2) Use torch.testing._internal

3) Parametrized

4) Another approach inspired from torchaudio tests

Open questions

Additional context

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions

2) Use `torch.testing._internal`