-
Notifications
You must be signed in to change notification settings - Fork 1.5k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[data request] Colorectal Histology (Collection of textures in colorectal cancer histology) #49
Labels
dataset request
Request for a new dataset to be added
Comments
tfds-copybara
pushed a commit
that referenced
this issue
Feb 22, 2019
I tested it with the script and it's working. The issue is that it add PIL as lazy dependency (to load tiff images), which I'm not sure how to make it works with Python3 (b/63250444). It shouldn't matter for external users (they can install Pillow) and should not matter for internal users (dataset is already generated). It will only be a problem if we try to generate the data using Python3. Hopefully when b/63250444 will be fixed it should works without issue. PR Title: Added colorectal histology image builders PR Body: As [requested](#49). Possible issue: the data is made available as `tif` files and I'm using `PIL` to open them (and to save the dummy data for generation) - is that a problem? From what I understand tensorflow doesn't support `tif` out of the box and I'm not particularly enthusiastic about rolling my own just for this purpose... PUBLIC: Merge of PR #51 Merge d5e6ca7 into f2ea390 PiperOrigin-RevId: 234951670
tfds-copybara
pushed a commit
that referenced
this issue
Feb 22, 2019
I tested it with the script and it's working. The issue is that it add PIL as lazy dependency (to load tiff images), which I'm not sure how to make it works with Python3 (b/63250444). It shouldn't matter for external users (they can install Pillow) and should not matter for internal users (dataset is already generated). It will only be a problem if we try to generate the data using Python3. Hopefully when b/63250444 will be fixed it should works without issue. PR Title: Added colorectal histology image builders PR Body: As [requested](#49). Possible issue: the data is made available as `tif` files and I'm using `PIL` to open them (and to save the dummy data for generation) - is that a problem? From what I understand tensorflow doesn't support `tif` out of the box and I'm not particularly enthusiastic about rolling my own just for this purpose... PUBLIC: Merge of PR #51 Merge d5e6ca7 into f2ea390 PiperOrigin-RevId: 234951670
Big thanks to @jackd for submitting this dataset. |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
The dataset serves as a much more interesting MNIST or CIFAR10 problem for biologists by focusing on histology tiles from patients with colorectal cancer. In particular, the data has 8 different classes of tissue (but Cancer/Not Cancer can also be an interesting problem).
https://www.kaggle.com/kmader/colorectal-histology-mnist/home
https://zenodo.org/record/53169#.XGL2CNFCfOR
Good example for people to see use case of AI for Good.
Can tried to help to do TFRecords files if needed (I am learning it)
Folks who would also like to see this dataset in
tensorflow/datasets
, please +1/thumbs-up so the developers can know which requests to prioritize.The text was updated successfully, but these errors were encountered: