%0 Dataset %T A High-Resolution daily CO₂ Dataset for China (2016–2020) %J National Cryosphere Desert Data Center %I National Cryosphere Desert Data Center(www.ncdc.ac.cn) %U http://www.ncdc.ac.cn/portal/metadata/626fdfb5-61ef-4ef4-8563-305b4078911b %W NCDC %R 10.12072/ncdc.atmosphere.db7467.2026 %A YANG Aixia %K CO₂;XCO₂;XGBoost model;SHAP %X High-resolution column-averaged dry-air CO2 mole fraction (XCO2) data are essential for characterizing carbon sources and sinks, advancing carbon cycle research, and supporting climate policy goals such as carbon peaking and carbon neutrality. However, current satellite retrievals are often spatially fragmented and temporally discontinuous due to cloud cover and aerosol interference. To address these limitations, this study utilizes an XGBoost model optimized via Bayesian optimization (XGBoost-BO) to construct a robust mapping relationship between atmospheric XCO2 concentrations and multi-source auxiliary parameters. Crucially, the incorporation of the SHAP (SHapley Additive exPlanations) methodology enhances model interpretability, ensuring that the reconstruction captures physically meaningful spatiotemporal dynamics across China. The reconstructed XCO2 dataset exhibits high consistency with OCO-2 satellite observations, achieving a coefficient of determination (R²) of 0.98, a Root Mean Square Error (RMSE) of 0.58 ppm, and a Mean Absolute Percentage Error (MAPE) of 0.07%. The model’s reliability is further validated against ground-based TCCON measurements in China, achieving an R2 of 0.92 (RMSE = 1.16 ppm, MAPE = 0.2%) at the Hefei site and an R2 of 0.70 (RMSE = 2.00 ppm, MAPE = 0.4%) at the Xianghe site.For detailed information, please refer to the associated data paper.