Adding TMED-2 Echocardiogram Dataset #418
Open
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Name:
Riya Verma, Sudha Gattu
NetID:
rverm2, sudhamg2
Type
Dataset Contribution
Paper:
Detecting Heart Disease from Multi-View Ultrasound Images
via Supervised Attention Multiple Instance Learning
Citation to Original Paper:
@Article{huang2023detecting, title={Detecting Heart Disease from Multi-View Ultrasound Images via Supervised Attention Multiple Instance Learning}, author={Huang, Zhe and Wessler, Benjamin S and Hughes, Michael C}, journal={arXiv preprint arXiv:2306.00003}, year={2023} }
Link:
https://arxiv.org/pdf/2306.00003
Description:
We added the EchoBagDataset class to process echocardiogram datasets from TMED-2 for multiple instance learning tasks. Imported one table from a CSV file: echo_images, which contains study IDs and diagnosis labels. The class supports dynamically loading image files from a directory based on patient-study identifiers listed in the table. It also has functions to retrieve image bags and their corresponding labels for each patient-study instance.
Files to look at to test the implementation:
pyhealth/datasets/echo.py → new EchoBagDataset class
pyhealth/datasets/configs/echo.yaml → config file specifying table schema for the dataset
pyhealth/unittests/test_datasets/test_echo.py → unit tests for dataset loading and item retrieval