[DONATION] of new datasets #93

PaulRabich · 2023-10-27T08:26:23Z

Hi

In the Paper "Self-Supervised Contrastive Pre-Training For Time Series via Time-Frequency Consistency" found here https://arxiv.org/abs/2206.08496 they use the following datasets:

All of these datasets are published under the https://creativecommons.org/licenses/by/4.0/ licence.

Would it be possible to add them? And if yes, what are the next steps for uploading them?

TonyBagnall · 2023-11-01T12:20:31Z

hi, we would welcome these data. Are they all labelled classification problems? If so, the next stage is to get it into our format. If they are equal length, you get the data into memory so that

X = np.ndarray shape (n_cases, n_channels, n_timepoints)
y = np.ndarray shape n_cases

if unequal length make X a list of ndarray (n_channels, n_timepoints_i) where n_timepoints_i is the length of the ith case.

then you should be able to write them to aeon compatible format

     from aeon.datasets import write_to_tsfile
    write_to_tsfile(X, path = "your_directory", y=y, problem_name="your_filename.ts")

if there is a provided train test split, create trainX, trainy, testX, testy

     from aeon.datasets import write_to_tsfile
    write_to_tsfile(trainX, path = "your_directory", y=trainy, problem_name="your_filename_TRAIN.ts")
    write_to_tsfile(testX, path = "your_directory", y=tresty, problem_name="your_filename_TEST.ts")

you can check it works with this

     from aeon.datasets import load_from_tsfile
    X, y, meta = load_from_tsfile(full_file_path_and_name="your_directory\your_filename.ts", return_meta_data=True)

if there is no provided train test split we create one, but you need to be careful, if there are repetitions from the same subject (e.g. one person repeats a HAR task many times) you need to be clear if you are splitting so train and test do not contain the same person or not. Any problems, let us know

PaulRabich · 2023-11-06T14:48:39Z

Hello, the data comes with presplit train, validation and test sets.

I have converted them all into .ts files. And i can load them with the load_from_tsfile function.

What is the next step?

TonyBagnall · 2023-11-06T14:55:06Z

fantastic, next stage is to get them to us. How big are they? you can email to [email protected] or we can find another way. I will then list them on the site.

Is there a text description we could use? And preferably an image? I set up the pages something like this

https://timeseriesclassification.com/description.php?Dataset=AsphaltObstacles

Not sure how to handle validation set, would be tempted to merge it into train, since its really part of the training.

We will try out our standard suite of classifiers and they can go into the next batch release. Hoping to improve the website this year, will try get an intern as my web skills are not the best :)

TonyBagnall · 2023-11-18T11:03:03Z

got the data, thanks, will process it all next week.

PaulRabich · 2023-11-18T11:15:07Z

If all goes ok, and there is nothing more to do from my side, i would have another set of datasets

TonyBagnall · 2023-11-19T16:50:23Z

will post here as I do them, if you could check that would be great. Ive changed the names to conform to our standards but hopefully links make it clear.
https://timeseriesclassification.com/description.php?Dataset=Sleep
https://timeseriesclassification.com/description.php?Dataset=WalkingSittingStanding

TonyBagnall · 2023-11-20T11:46:42Z

https://timeseriesclassification.com/description.php?Dataset=FaultDetectionA

TonyBagnall · 2023-11-20T12:23:19Z

https://timeseriesclassification.com/description.php?Dataset=FaultDetectionB

TonyBagnall · 2023-11-20T12:31:46Z

https://timeseriesclassification.com/description.php?Dataset=NerveDamage

TonyBagnall · 2023-11-20T13:06:44Z

https://timeseriesclassification.com/description.php?Dataset=CardiacArrhythmia

TonyBagnall · 2023-11-20T13:38:39Z

and lastly
https://timeseriesclassification.com/description.php?Dataset=Epilepsy2

Not putting Gesture in as its really UWave. Happy to put more in if you have them @PaulRabich

TonyBagnall closed this as completed Nov 20, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[DONATION] of new datasets #93

[DONATION] of new datasets #93

PaulRabich commented Oct 27, 2023

TonyBagnall commented Nov 1, 2023

Uh oh!

PaulRabich commented Nov 6, 2023

Uh oh!

TonyBagnall commented Nov 6, 2023

Uh oh!

TonyBagnall commented Nov 18, 2023

Uh oh!

PaulRabich commented Nov 18, 2023

Uh oh!

TonyBagnall commented Nov 19, 2023

Uh oh!

TonyBagnall commented Nov 20, 2023

Uh oh!

TonyBagnall commented Nov 20, 2023

Uh oh!

TonyBagnall commented Nov 20, 2023

Uh oh!

TonyBagnall commented Nov 20, 2023

Uh oh!

TonyBagnall commented Nov 20, 2023

Uh oh!

[DONATION] of new datasets #93

[DONATION] of new datasets #93

Comments

PaulRabich commented Oct 27, 2023

TonyBagnall commented Nov 1, 2023

Uh oh!

PaulRabich commented Nov 6, 2023

Uh oh!

TonyBagnall commented Nov 6, 2023

Uh oh!

TonyBagnall commented Nov 18, 2023

Uh oh!

PaulRabich commented Nov 18, 2023

Uh oh!

TonyBagnall commented Nov 19, 2023

Uh oh!

TonyBagnall commented Nov 20, 2023

Uh oh!

TonyBagnall commented Nov 20, 2023

Uh oh!

TonyBagnall commented Nov 20, 2023

Uh oh!

TonyBagnall commented Nov 20, 2023

Uh oh!

TonyBagnall commented Nov 20, 2023

Uh oh!