I feel that the code is executed on GPU. Could you tell me how the project use and balance the compute capabilities of multiple GPUs and CPUs?