%0 Dataset %T OpenSWI: A Massive-Scale Benchmark Dataset for Surface Wave Dispersion Curve Inversion((2013,2016) %J National Cryosphere Desert Data Center %I National Cryosphere Desert Data Center(www.ncdc.ac.cn) %U http://www.ncdc.ac.cn/portal/metadata/70160163-73d4-4508-9416-b462b1b2ca10 %W NCDC %R 10.5281/zenodo.16874111 %A Li Yaxing %K Geophysical inversion;benchmark dataset;OpenSWI %X In recent years, inspired by the success of computer vision and natural language processing, data-driven deep learning methods have shown great potential to overcome these challenges. However, the lack of large-scale and diverse benchmark datasets remains a major obstacle to the development and evaluation of such methods. To fill this gap, we have launched the OpenSWI comprehensive benchmark dataset, which is generated through the Surface Wave Inversion Dataset Preprocessing (SWIDP) pipeline. OpenSWI includes two synthetic datasets for different research scales and application scenarios - OpenSWI sharow and OpenSWI deep, as well as the AI ready real dataset OpenSWI real for generalization evaluation. OpenSWI real integrates from an open source project, including two sets of observed dispersion curves and their corresponding one-dimensional reference models, as a benchmark dataset for evaluating the generalization ability of deep learning models.