Common question: I'm fine-tuning for an agent. What split of data should I prioritize collecting, and in what mixture?
- Fewer long trajectories?
- More short trajectories / single-step function calls?
- If there is conversation in the mix, how much should I include? And then do i include the full trajectory flattened out? or remove for later calls?