Landslides are a significant global geological hazard, with adverse and far for human life, the economy and the natural environment on an annual basis worldwide. Accurately estimating the spatial and temporal distribution of landslide probability is crucial for reducing these losses. Existing landslide warning systems may fail to consider the selection of non-landslide samples and the dynamic process of landslides, potentially compromising the accuracy of landslide warning systems. This study explores the impact of different selections of non-landslide samples and satellite rainfall datasets on the early warning model for landslides in the Guangdong-Hong Kong-Macao Greater Bay Area (GBA). Through Pearson correlation analysis, critical factors associated with landslide occurrences were identified, including elevation, slope, aspect, distance to roads and rivers, soil type, plan curvature, profile curvature, Topographic Wetness Index (TWI), and Normalized Difference Vegetation Index (NDVI). In this study, a semi-supervised random forest (SSRF) model incorporating frequency ratios (FR) to evaluate landslide susceptibility in the GBA. The susceptibility and rainfall threshold model were subsequently combined into a dynamic landslide hazard warning system through a matrix approach. The findings revealed that the maximum area under the curve (AUC) value for a landslide to non-landslide ratio of 1:4 is 0.973. The very high susceptibility zone is typically located between 125 and 250 meters away from roads, which is a common characteristic of the highly urbanized areas. Moreover, the validation phase yielded successful predictions for 67 out of 96 landslide events, providing effective early warning and a reference point for disaster mitigation and prevention.