Skip to content

Missing test h5ad files needed for continue pretraining pipeline #12

@WhenMelancholy

Description

@WhenMelancholy

Hi Team, I greatly appreciate your work on this project. The code is very well organized and the documentation is comprehensive, which made it easy to get started. Thank you for making this valuable contribution open source.

While implementing the continue pretraining pipeline, I noticed that some h5ad files required for testing are missing. Specifically, I encountered the following error when trying to run the validation:

FileNotFoundError: [Errno 2] Unable to synchronously open file (unable to open file: name = '.../data/gNNpgpo6gATjuxTE7CCp.h5ad', errno = 2, error message = 'No such file or directory')

I also found there are some other datasets used for validation but not included in the repository:

# testdatasets=['/R4ZHoQegxXdSFNFY5LGe.h5ad', '/SHV11AEetZOms4Wh7Ehb.h5ad',
# '/V6DPJx8rP3wWRQ43LMHb.h5ad', '/Gz5G2ETTEuuRDgwm7brA.h5ad', '/YyBdEsN89p2aF4xJY1CW.h5ad',
# '/SO5yBTUDBgkAmz0QbG8K.h5ad', '/r4iCehg3Tw5IbCLiCIbl.h5ad', '/SqvXr3i3PGXM8toXzUf9.h5ad',
# '/REIyQZE6OMZm1S3W2Dxi.h5ad', '/rYZ7gs0E0cqPOLONC8ia.h5ad', '/FcwMDDbAQPNYIjcYNxoc.h5ad',
# '/fvU5BAMJrm7vrgDmZM0z.h5ad', '/gNNpgpo6gATjuxTE7CCp.h5ad'],

I wasn't able to locate these test files in the repository. Would you be willing to provide these h5ad files or instructions on how to obtain them? This would be extremely helpful for those of us looking to build upon your excellent work.

Thanks again for creating and sharing this project!

Metadata

Metadata

Assignees

No one assigned

    Labels

    enhancementNew feature or requestquestionFurther information is requested

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions