Point cloud video streaming over networks is challenging because of the extremely high data rate of uncompressed point cloud data. Adaptive point cloud video streaming has been proposed to deal with this challenge. However, temporal quality variation and stalling might occur under unstable network conditions, potentially degrading users' Quality of Experience (QoE). This paper aims to evaluate and model the impacts of temporal quality variations and stalling on users' QoE in adaptive point cloud video streaming. We first conduct a largescale subjective study to construct a QoE database. Then, based on the constructed database, the effects of individual factors are analyzed, and two novel QoE prediction models are presented. Experiment results show that the proposed QoE models achieve high prediction performance in terms of PLCC, SROCC, and RMSE across various point cloud videos.