Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Datasets #13

Closed
wants to merge 12 commits into from
Closed

Conversation

ryanccarelli
Copy link
Contributor

@ryanccarelli ryanccarelli commented Oct 15, 2020

Wanted to start discussion on this first draft of Dataset class and Pipeline class functionality to .run(Dataset).

This approach introduces a filesystem described at the top of Dataset.

Changes are tested on a small folder of slides and it runs. (can create a dataset and run pipeline)

Some things aren't finished:
Do we want tiles written individually in a folder for each slide, or do we want to package the tiles for each slide (hdf5?)?

Pipeline.extract_tiles has awkward property that it takes dataset so it can add paths to tiles.

I will add parallelization to pipeline for Dataset objects.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

2 participants