Is it possible to leverage BERT within OpenNMT-py for fine-tuning purposes, specifically for enhancing translation models through transfer learning?