HALO Implementation: HALO_MIMIC3Dataset & HALO Model Classes #528

ethanrasmussen · 2025-07-27T15:45:43Z

Adapted from: https://github.com/btheodorou99/HALO_Inpatient/tree/main

I've been using the following files to test this code via the NCSA cluster (but haven't included them in the PR/push). Both files will need to be at the root of PyHealth/ (NOT PyHealth/pyhealth/).

Testing Script (`halo_testing_script.py`):

print("BEGIN: Testing")
import subprocess, sys, os
subprocess.check_call([sys.executable, "-m", "pip", "install", "-e", "."])
subprocess.check_call([sys.executable, "-m", "pip", "install", "numpy", "--force-reinstall"])
subprocess.check_call([sys.executable, "-m", "pip", "install", "pandas", "--force-reinstall"])
print("Success on pip install -e .")
from pyhealth.models.generators.halo import HALO
from pyhealth.models.generators.halo_resources.halo_model import HALOModel
from pyhealth.models.generators.halo_resources.halo_config import HALOConfig
from pyhealth.datasets.halo_mimic3 import HALO_MIMIC3Dataset
print("Sucess on imports")
print(f"Operating in dir: {os.getcwd()}")

halo_config = HALOConfig()
halo_dataset = HALO_MIMIC3Dataset(mimic3_dir="../../../../scratch_old/ethanmr3/mimic3/physionet.org/files/mimiciii/1.4/", pkl_data_dir="../../halo_pkl/", gzip=True)
model = HALO(dataset=halo_dataset, config=halo_config, save_dir="../../halo_save/", train_on_init=False)
print("Success on model setup")

model.train()
print("Sucess on model train")

model.test(testing_results_dir = "../../halo_results/")
print("Success on model test")

model.synthesize_dataset(pkl_save_dir = "../../halo_results/")
print("Success on dataset synthesis")

print("END: Testing success!!!")

Slurm Job (`test_halo_model.slurm`):

#!/bin/bash
#SBATCH --account=ethanmr3-ic
#SBATCH --job-name=pyhealth-halo-testing
#SBATCH --output=halo-testing-logs/halo_test_%j.out
#SBATCH --error=halo-testing-logs/halo_test_%j.err
#SBATCH --partition=IllinoisComputes-GPU # Change to appropriate partition
#SBATCH --gres=gpu:1 # Request 1 GPU
#SBATCH --cpus-per-task=4
#SBATCH --mem=64G
#SBATCH --time=48:00:00

#Change to the directory where you submitted the job
cd "$SLURM_SUBMIT_DIR"

#Print useful Slurm environment variables for debugging
echo "SLURM_JOB_ID: $SLURM_JOB_ID"
echo "SLURM_JOB_NODELIST: $SLURM_JOB_NODELIST"
echo "SLURM_NTASKS: $SLURM_NTASKS"
echo "SLURM_CPUS_ON_NODE: $SLURM_CPUS_ON_NODE"
echo "SLURM_GPUS_ON_NODE: $SLURM_GPUS_ON_NODE"
echo "SLURM_GPUS: $SLURM_GPUS"
echo "CUDA_VISIBLE_DEVICES: $CUDA_VISIBLE_DEVICES"

#Optional: check what GPU(s) is/are actually visible
echo "Running nvidia-smi to confirm GPU availability:"
nvidia-smi

#Load modules or activate environment
#module load python/3.10
#module load cuda/11.7
#conda activate your-env

#Run your Python training script
python /u/ethanmr3/halo/PyHealth/halo_testing_script.py

jhnwu3 · 2025-08-04T00:23:11Z

Heads up, we have a newly updated automated unit test directory finally in master/main with tests/core. There's a couple examples in there.

For unit tests, since we know running DL models are super expensive. It's good just to show the init of classes succeed as a first sanity check.

chufangao and others added 10 commits June 15, 2025 13:04

init generators commit

d1f97af

base

ee8c52c

Stab at implementation

00f10c2

Misc. changes for testing

b666f82

Remove testing logs

ec4f23d

Clean up things a bit

4ce8e21

Clean up hardcoded file path

b1584fd

Remove testing files from PR

d374603

Init model properly

4f456f9

Update comments

56380f6

jhnwu3 added the Highlight for TAs to highlight label Aug 4, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

HALO Implementation: HALO_MIMIC3Dataset & HALO Model Classes #528

HALO Implementation: HALO_MIMIC3Dataset & HALO Model Classes #528

Uh oh!

ethanrasmussen commented Jul 27, 2025 •

edited

Loading

Uh oh!

jhnwu3 commented Aug 4, 2025

Uh oh!

Uh oh!

HALO Implementation: HALO_MIMIC3Dataset & HALO Model Classes #528

Are you sure you want to change the base?

HALO Implementation: HALO_MIMIC3Dataset & HALO Model Classes #528

Uh oh!

Conversation

ethanrasmussen commented Jul 27, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Adapted from: https://github.com/btheodorou99/HALO_Inpatient/tree/main

Testing Script (halo_testing_script.py):

Slurm Job (test_halo_model.slurm):

Uh oh!

jhnwu3 commented Aug 4, 2025

Uh oh!

Uh oh!

ethanrasmussen commented Jul 27, 2025 •

edited

Loading

Testing Script (`halo_testing_script.py`):

Slurm Job (`test_halo_model.slurm`):