Miscanthus is one of the most promising perennial crops for bioenergy production, with high yield potential and low environmental footprint. The increasing interest in this crop requires accelerated selection and the development of new screening techniques. The development of analytical methods for improved estimation and reduced manual inspection are needed to better characterize the effects of genetics and the environment in key traits under field conditions. We used persistent multispectral and photogrammetric UAV time-series imagery collected 10 times in the season, together with ground-truth data over thousands miscanthus accessions to determine flowering time, culm height, and biomass yield traits. We compared the performance of Convolutional Neural Network (CNN) architectures that used image data from single dates (2D-spatial) versus the integration of multiple dates (3D-spatio-temporal) architectures to evaluate the value of persistent monitoring and the type of features to predict the traits. The ability of UAV-based remote sensing to rapidly and non-destructively assess large-scale genetic variation in flowering time, height and biomass production was improved through use of 3D-spatio-temporal CNN architectures versus 2D-spatial CNN architectures. The performance gains of the best 3D-spatio-temporal analyses compared to the best 2D-spatial architectures manifested in up to: 23 % improvements in R 2 and 20 % reduction in mean absolute error (MAE). The integration of