Skip to content
This repository was archived by the owner on Sep 11, 2023. It is now read-only.

Create a proportion of examples without PV data, outside the UK #93

Open
JackKelly opened this issue Sep 5, 2021 · 5 comments
Open
Labels
enhancement New feature or request

Comments

@JackKelly
Copy link
Member

We currently only have PV data for the UK. We will at some point want to get PV data for elsewhere but, in the meantime, we'll need nowcasting_dataset to optionally output examples from outside the UK (to train the "image prediction" part of the model on the entire geospatial extent of the satellite imagery).

Maybe we should create two sets of batches on disk: one set which always has PV data (and is over the UK), and another set which is always from outside the UK (and doesn't have PV). Then the ML training script can mix-and-match examples on the fly to vary the ML training curriculum. To keep each batch ballanced, the ML training script will need to load at least two batches at once from disk (one with PV data, the other without) and create a single batch with a mixture of examples.

@JackKelly JackKelly added the enhancement New feature or request label Sep 5, 2021
@peterdudfield
Copy link
Contributor

Not sure this is essential. For WP1 there are some GSP where there are very few / No PV systems i.e Scotland

@peterdudfield peterdudfield removed this from the WP1 essential tasks milestone Sep 24, 2021
@JackKelly
Copy link
Member Author

@jacobbieker in order to train your models in SatFlow, do you think it's essential for the dataset to include training examples from outside the UK? (these examples wouldn't have any PV data yet...)

@jacobbieker
Copy link
Member

Its probably not essential, and for a model that will primarily be focused on the UK for now anyway, it probably doesn't matter as much!

@peterdudfield
Copy link
Contributor

Ill remove this from the NG project. Just to keep things really high priority in there

@JackKelly
Copy link
Member Author

This could probably be done as part of #202

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

3 participants