Add MovingMNIST #2690

vballoli · 2020-09-19T08:05:25Z

PR in reference to #2676. Currently in draft since the dataset train-test split needs discussion. Inputs regarding how to better handle the train-test arguments is appreciated! @vfdev-5 @pmeier @fmassa

vfdev-5

Thanks for the draft ! I left few comments

vfdev-5 · 2020-09-21T12:27:45Z

torchvision/datasets/mnist.py

@@ -435,6 +435,74 @@ def get_int(b: bytes) -> int:
    return int(codecs.encode(b, 'hex'), 16)


+class MovingMNIST(VisionDataset):
+    """MovingMNIST"""


Could you please a link (http://www.cs.toronto.edu/~nitish/unsupervised_video/) to the dataset like that

`MNIST <http://yann.lecun.com/exdb/mnist/>`_ Dataset.

and define docstring Args etc as it is done for other datasets.

vfdev-5 · 2020-09-21T12:30:37Z

torchvision/datasets/mnist.py

@@ -435,6 +435,74 @@ def get_int(b: bytes) -> int:
    return int(codecs.encode(b, 'hex'), 16)


+class MovingMNIST(VisionDataset):


Can't we inherit it from MNIST ?

If we use this as a video dataset, we shouldn't. We probably need to use MNIST to generate the training split though.

pmeier

Hey @vballoli,

thanks a lot for this PR. We discussed this internally and have some comments on this:

MovingMNIST is a video dataset, i. e. the individual frames have a strong temporal relation. Currently you are treating it as an image dataset. We should return 4D tensor representing a single video (channels x depth x height x weight) rather than an image with multiple channels (channels x height x weight). This also means that we shouldn't treat the frames individually for the transforms.
You only implemented the test split, since this is the only generated data provided by the author. Since the authors also provide the code to generate the sequences (see for example the implementation by tensorflow), it might be beneficial to also provide a train split. If we do this, we need to decide if we want to generate the sequences ahead-of-time or on-the-fly. Depending on how large the training split will be, the former might be to much on a burden on the memory (even the test split is ~1GB and resides completely in memory).
Although the authors split the sequences of two with 10 frames each, this should not be required since MovingMNIST is marketed as an unsupervised dataset. This is especially important if we generate the trainings data with a deviating number of frames. IMO we should simply return a single video. Maybe we could provide a (static) method that implements this split by the authors.

Let us know what you think and if you are willing to continue working on this.

pmeier · 2020-09-22T09:31:23Z

torchvision/datasets/mnist.py

@@ -435,6 +435,74 @@ def get_int(b: bytes) -> int:
    return int(codecs.encode(b, 'hex'), 16)


+class MovingMNIST(VisionDataset):


If we use this as a video dataset, we shouldn't. We probably need to use MNIST to generate the training split though.

pmeier · 2020-09-22T09:31:27Z

torchvision/datasets/mnist.py

+                                            self.file)))
+
+
+    def __getitem__(self, index: int) -> Tuple[Any, Any]:


Without a length, the dataset is not iterable.

Suggested change

def __getitem__(self, index: int) -> Tuple[Any, Any]:

def __len__(self) -> int:

return len(self.data)

def __getitem__(self, index: int) -> Tuple[Any, Any]:

pmeier · 2020-09-22T09:32:32Z

torchvision/datasets/mnist.py

+    with open(path, 'rb') as f:
+        x = torch.tensor(np.load(f))
+    assert(x.dtype == torch.uint8)
+    return x


Nit

Suggested change

return x

return x

fmassa · 2020-09-22T13:40:28Z

To complement @pmeier answer, if you could let the dataset return a single tensor of shape [num_frames, channels, height, width] it would be great

avijit9 · 2021-03-18T17:25:01Z

@pmeier Somebody working on closing this PR? If no, I can help.

pmeier · 2021-03-19T07:22:48Z

@avijit9 No, as far as I'm aware, no one is currently working on this. Still, lets wait how #3562 plays out. We are currently not sure which datasets we want to add.

yassineAlouini · 2022-05-02T09:53:02Z

Thanks @vballoli for this great contribution and sorry for the long time to get a new reply!

Unfortunately, there is a new datasets API in the works here and new datasets should follow the new API design.

That being said, rest assured, there are at least two options:

Given that you already put in the work in this PR, a new PR adding this to the new API is more than welcome (don't hesitate to ask for help, for example @pmeier is very knowledgable). As a starting point, you can refer to this guide on how to do that.
Someone else could take this over and build on what you have done. You will receive proper credit of course. 👍

Hope one of the two options works for you @vballoli. 👌

Add MovingMNIST

c0b9622

vfdev-5 reviewed Sep 21, 2020

View reviewed changes

pmeier requested changes Sep 22, 2020

View reviewed changes

oke-aditya mentioned this pull request Mar 12, 2021

[RFC] New datasets to torchvision #3562

Open

17 tasks

facebook-github-bot added the cla signed label Feb 23, 2022

pmeier self-assigned this Apr 8, 2022

yassineAlouini mentioned this pull request May 2, 2022

Possible new contribution? pmeier/pmeier#5

Open

pmeier closed this Nov 7, 2022

tsugumi-sys mentioned this pull request Nov 28, 2022

Add Moving MNIST dataset #6981

Closed

tsugumi-sys mentioned this pull request Dec 19, 2022

Add MovingMNIST dataset #7042

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add MovingMNIST #2690

Add MovingMNIST #2690

vballoli commented Sep 19, 2020

vfdev-5 left a comment

vfdev-5 Sep 21, 2020

vfdev-5 Sep 21, 2020

pmeier Sep 22, 2020

pmeier left a comment

pmeier Sep 22, 2020

pmeier Sep 22, 2020

pmeier Sep 22, 2020

fmassa commented Sep 22, 2020

avijit9 commented Mar 18, 2021

pmeier commented Mar 19, 2021

yassineAlouini commented May 2, 2022

		@@ -435,6 +435,74 @@ def get_int(b: bytes) -> int:
		return int(codecs.encode(b, 'hex'), 16)


		class MovingMNIST(VisionDataset):

		self.file)))


		def __getitem__(self, index: int) -> Tuple[Any, Any]:

Add MovingMNIST #2690

Add MovingMNIST #2690

Conversation

vballoli commented Sep 19, 2020

vfdev-5 left a comment

Choose a reason for hiding this comment

vfdev-5 Sep 21, 2020

Choose a reason for hiding this comment

vfdev-5 Sep 21, 2020

Choose a reason for hiding this comment

pmeier Sep 22, 2020

Choose a reason for hiding this comment

pmeier left a comment

Choose a reason for hiding this comment

pmeier Sep 22, 2020

Choose a reason for hiding this comment

pmeier Sep 22, 2020

Choose a reason for hiding this comment

pmeier Sep 22, 2020

Choose a reason for hiding this comment

fmassa commented Sep 22, 2020

avijit9 commented Mar 18, 2021

pmeier commented Mar 19, 2021

yassineAlouini commented May 2, 2022