Accurate load balancing accelerates Lagrangian simulation of water ages on distributed, multi-GPU platforms