The dataset contains 73 sets of sample data from the Yangxin dry dike section of the Yangtze River, which were analyzed by the gray correlation method to extract five key influencing factors: water level altitude difference, cover layer thickness, permeability coefficient, pore ratio, and compression coefficient. The data were analyzed to record the main factors affecting seepage of embankment, which provides an important basis for embankment safety assessment, flood control model construction, and training and validation of machine learning algorithms. The dataset can be widely used in the field of water conservancy engineering to promote the development of related research and practical applications.
| collect place | the lower reaches of the yangtze river |
|---|---|
| data size | 13.0 KiB |
| data format | *.xlxs |
| Coordinate system |
This dataset is derived from the master's thesis, “Predictive Analysis of Pipe Surge Risks in Yangtze River Binary Embankment Based on Gray Correlation and GA-DBN”. In that study, the deep belief network (GA-DBN) method optimized by genetic algorithm was used to apply and validate the levee leakage prediction model by importing five levee leakage factors of Yangxin Dry Dike of the Yangtze River, which were collected as practical application cases, into the model for analysis.
(1) A Deep Belief Network (DBN) model and a Genetic Algorithm optimized DBN (GA-DBN) model are used for the prediction of embankment leakage, which can effectively capture the complex features in the data through the multilayer nonlinear structure. The DBN model can effectively capture the complex features in the data, while the GA-DBN model further combines the genetic algorithm to optimize the network structure and parameters, which improves the prediction accuracy and generalization ability of the model.
(2) The data are input into the support vector machine (SVM) model and compared and analyzed with the training results of the GA-DBN model.
The data quality evaluation based on the training results of the GA-DBN model indicates that the present dataset is of extremely high quality and reliability. During the validation process, the GA-DBN model achieved 100% accuracy, recall, precision, and F1 score, which further validated the high quality of the dataset and the excellent predictive ability of the model, which was significantly better than the SVM model. These excellent performance indicators show that the integrity and accuracy of the data are fully guaranteed during the selection of influence factors, data collection and processing, and can effectively support the prediction and analysis of embankment seepage risk.
| # | number | name | type |
| 1 | 2021YFC3000100 | Lower Yangtze River Flood Disaster Integration and Control and Emergency De-risking Technology and Equipment | National key R & D plan |
This work is licensed under a
Creative
Commons Attribution 4.0 International License.
| # | title | file size |
|---|---|---|
| 1 | _ncdc_meta_.json | 5.0 KiB |
| 2 | 长江阳新干堤水位高度差、覆盖层厚度、渗透系数、孔隙比及压缩系数5个影响因子数据集.xlsx | 13.0 KiB |
©Copyright 2005-. Northwest Institute of Eco-Environment and Resources, CAS.
Donggang West Road 320, Lanzhou, Gansu, China (730000)

