I tried to train with cifar10 and set margin to 2, but got nan. learning rate is set to 0.1. Any idea?