Possible extensions of total_size

I have some trouble understanding the `total_size` parameter and there is no mention of it in the docs.  From the examples I have seen so far the parameter should be set to the total size of the training data when doing mini-batch training.  This is simple to interpret if the data is just a 1d array but what should I put in there if I'm training a model on views of 2d data (subsampling in both dimensions)?  Should I then use `total_size=data.shape[0]*data.shape[1]`?

From the gitter conversation it seems that total_size for subsampling in more then 1 dimensions needs some extra support on the pymc3 side.  @ferrine suggested the following API:

```python
total_size = int # for shape[0] subsampling
total_size = [int, None, int] # for subsampling [shape[0], shape[2]]
total_size = [int, Ellipsis, int] # for subsampling [shape[0], shape[-1]]
```

In my 2d case I would write `total_size = [data.shape[0], data.shape[1]]`.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Possible extensions of total_size #2125

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Uh oh!

Possible extensions of total_size #2125

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions