Now the full data set seems to be 120GB, and I want to test it on my own PC first before moving on to large-scale training.😊