Enhanced Predictive Ensemble Learning Models for Compressive Strength of Concrete Modified with Bentonite Clay

Samer  Dessouky; Ali Reza  Ghanizadeh; Amir Tavana  Amlashi

Open Access |

Enhanced Predictive Ensemble Learning Models for Compressive Strength of Concrete Modified with Bentonite Clay

Amir Tavana Amlashi1*, Ali Reza Ghanizadeh2, Samer Dessouky1

1School of Civil and Environmental Engineering and Construction Management, University of Texas at San Antonio, San Antonio, Texas, USA

2Department of Civil Engineering, Sirjan University of Technology, Sirjan, Iran

*Correspondence to: Amir Tavana Amlashi, School of Civil and Environmental Engineering and Construction Management, University of Texas at San Antonio, San Antonio, Texas, 78282, USA; Email: Amir.tavanaamlashi@my.utsa.edu

DOI: 10.53964/id.2024033

Abstract

This study addresses the logistical challenges of testing bentonite plastic concrete in the field by developing predictive models for compressive strength (CS) using Python-based machine learning techniques. The models, including random forest, adaptive boosting, extreme gradient boosting, and gradient boosting regression tree, were enhanced with forensic-based investigation optimization. A dataset of 285 CS records was used to train and validate these models. SHAP analysis highlighted the impact of various inputs like gravel, bentonite, curing time, and cement on CS. Results show that the GBRT-FBIO model outperformed the others in CS prediction, indicating its superior effectiveness for material science applications.

Keywords: bentonite plastic concrete, random forest, adaptive boosting, extreme gradient boosting, gradient boosting regression tree, shap analysis, forensic-based investigation optimization

1 INTRODUCTION

Waste management is a critical global challenge as industries and societies push toward sustainability. One of the key issues in this domain is heavy metal contamination in wastewater which arises from various industrial processes, including mining, electroplating, and metal refining[1]. Heavy metals such as chromium (Cr), mercury (Hg), copper (Cu), lead (Pb), cadmium (Cd), zinc (Zn), and nickel (Ni) pose significant ecological hazards because they do not degrade and tend to bioaccumulate in organisms, causing severe health and environmental issues[2,3].Traditional methods for treating wastewater, such as chemical precipitation and ion exchange, can be costly and sometimes ineffective at removing trace levels of heavy metals. Adsorption, especially using clay minerals, has emerged as an efficient and economical method for removing these pollutants[4].

Bentonite, a clay composed primarily of montmorillonite, has proven to be particularly effective for heavy metal adsorption due to its large surface area, high cation exchange capacity, low cost, and non-toxic nature[5,6]. Several studies have demonstrated the potential of bentonite for removing heavy metals from water[7,8]. The combination of bentonite with concrete, forming what is known as bentonite plastic concrete (BPC), enhances the material’s properties for environmental engineering applications. Specifically, BPC is valued for its low permeability, which makes it ideal for use in structures like dam cutoff walls to prevent water seepage[9]. Research has shown that BPC can absorb water and expand, contributing to its effective performance in limiting seepage and increasing the longevity of engineering structures[10].

BPC has seen widespread use in projects where controlling seepage is crucial, such as dam construction, where it reduces the risk of structural failure caused by internal water pressure and soil movement[11]. Despite its advantages, the preparation, testing, and optimization of BPC mixtures are labor-intensive and costly, often requiring specialized equipment and skilled personnel. These factors drive the need for computational models that can predict the material’s behavior, particularly its compressive strength (CS), without relying on extensive physical testing[12].

Recent advances in machine learning (ML) and artificial intelligence (AI) have opened up new possibilities for material property prediction. ML techniques such as artificial neural networks (ANN), support vector machines (SVM), and ensemble learning methods like gradient boosting regression tree (GBRT) and random forest (RF) have demonstrated superior accuracy in predicting concrete properties compared to traditional regression-based methods[13-15]. These methods can model complex, non-linear relationships between input variables, providing precise predictions without requiring large-scale experimental data[16,17]. For BPC, previous studies have applied various ML techniques to predict key properties such as slump, elastic modulus, and CS. For instance, Tavana Amlashi et al.[18] found that ANN models were more accurate than conventional methods for predicting the mechanical properties of BPC, while Ghanizadeh et al.[14] demonstrated that SVM and ANN could effectively predict CS, with cement content and silty clay showing the greatest and least influence, respectively, on the material’s performance.

In more recent work, hybrid AI models, such as Adaptive Neuro-Fuzzy Inference Systems (ANFIS) optimized with Particle Swarm Optimization, have been shown to outperform standalone models, achieving high accuracy in predicting the CS and tensile strength of plastic concretes[19]. Similarly, other computational techniques, including response surface methodology (RSM), multigene genetic programming (MGGP), and the group method of data handling (GMDH), have also been employed to model the mechanical properties of BPC using input parameters such as water content, bentonite, cement, sand, gravel, and curing time[20]. Ensemble learning models, which combine the predictions of multiple base learners to improve performance, are particularly promising for this task. Algorithms such as GBRT, extreme gradient boosting (XGB), and RF have shown notable success in predicting the mechanical properties of various concrete types, including high-performance concrete (HPC), recycled aggregate concrete (RAC), and geopolymer concrete (GPC)[21,22].

One of the limitations in existing studies is the lack of robustness in these ML models when applied to diverse datasets, as well as insufficient attention to meta-parameter optimization, which is crucial for improving model performance[23]. Tuning meta-parameters in ensemble learning (EL) models, including the number of estimators, learning rate, and tree depth, can significantly impact the accuracy and generalizability of predictions. Recognizing this gap, recent work has proposed various metaheuristic optimization techniques to address the challenges of manual parameter selection[24,25]. Among these, the forensic-based investigation optimization (FBIO) algorithm has emerged as an effective tool for fine-tuning the meta-parameters of ML models, ensuring that ensemble methods such as boosting and bagging achieve their full predictive potential[26]. FBIO mimics the process of forensic investigations, iteratively narrowing down the search space to find optimal solutions, thus improving model accuracy while reducing the computational effort required for parameter tuning[27].

This study seeks to fill the gaps in current research by developing a robust ensemble learning framework, integrated with FBIO, to predict the compressive strength of BPC. By utilizing a comprehensive dataset of 285 CS test records and employing advanced machine learning techniques such as RF, ADB, GBRT, and XGB, this research aims to provide more accurate and generalizable models for BPC strength prediction. Additionally, SHapley Additive exPlanations (SHAP) are used to interpret the influence of input variables on model predictions, ensuring transparency and interpretability of the results. This approach offers a significant improvement over existing methods by reducing the reliance on costly experimental testing while enhancing the understanding of how various factors-such as gravel content, curing time, and cement-affect BPC’s performance.

2 METHODOLOGY

This study employs several advanced machine learning algorithms to predict the compressive strength of BPC. ADB, introduced by Schapire[28], is a boosting method that combines weak learners iteratively, improving model accuracy over time by focusing on misclassified samples[29]. Decision trees (DT) and ANN are commonly used within this framework due to their versatility in handling both regression and classification tasks[30,31]. GBRT, based on the classification and regression tree (CART) method[32], improves predictions by incorporating poor learners in each iteration, reducing forecasting errors[33]. This technique is highly effective in improving the accuracy of weak learners, particularly in regression applications, making it a suitable candidate for predicting the compressive strength of BPC. XGB, developed by Chen et al.[34], builds upon the principles of GBRT and offers enhanced speed and performance in both classification and regression tasks. XGB is especially effective due to its regularization capabilities, which help prevent overfitting while optimizing the ensemble tree structure. RF, introduced by Svetnik et al[35], is another popular ensemble method that constructs multiple decision trees using bootstrap samples and averages their outcomes. It is valued for its simplicity, speed, and capacity to handle large datasets[36]. RF’s out-of-bag (OOB) sampling method allows for internal cross-validation, improving model accuracy and reducing bias. To further optimize the performance of these machine learning models, this study integrates FBIO, a metaheuristic approach that adjusts iteration values and population sizes without requiring predefined internal parameters. FBIO, introduced by Chou and Nguyen[37], simulates forensic investigation techniques to narrow the search space and find optimal solutions. This method is advantageous in fine-tuning the global meta-parameters of machine learning models, enhancing their predictive accuracy in complex tasks such as BPC strength prediction[27].

3 RESULTS AND DISCUSSION

3.1 Specifics of Data Gathering

The database employed in this study is made up of 285 datasets for CS[18,35,38-45]. This comprehensive database was collected from 29 published studies. Elwell and Fu[46] proposed UNESCO conversion factors to homogenize cylindrical and cubic CS values. As effective variables for BPC properties, this research examined the following variables: contents of gravel, bentonite, silty clay, curing time, sand, cement, and water. A lower correlation is observed between negative and positive values in the overall model variables, as shown in Figure 1. In addition, according to the correlation heat map analysis, water, cement, gravel, and curing time have a greater positive impact on the CS of BPC. In addition, the intended graph distribution is not uniform, so the developed models are applicable to a wide range of target data[47].

Figure 1. Coefficients of Pearson Correlation for CS.

Before modeling, the data were randomly split into testing (30%) and training (70%) parts. Table 1 displays the statistical characteristics of the output and input variables for the testing and training data for CS-BPC. A logical and technical range can be developed for each of the input variables by considering the minimum and maximum for each of the four data sets. In particular, these ranges are 0 to 1,060kg/m3 for gravel; 509 to 1,499kg/m3 for sand; 0 to 380kg/m3 for silty clay; 65 to 300kg/m3 for cement; 16 to 320kg/m3 for bentonite; 152 to 520kg/m3 for water; and 7 to 540 days for curing time. Tables 1 allows you to identify extreme data points (maximum and minimum), data centers (mean and median), data spread (standard deviation and variance), and distribution shapes (skewness and kurtosis)[47]. Moreover, the diversity among databases and the ability of models developed on them to generalize are illustrated by the diverse alterations observed in each of the outputs[12].

Table 1. Comprehensive Statistics for Both Testing and Training Data Related to CS-BPC

Compressive strength	Curing time	Water	Bentonite	Cement	Silty clay	Sand	Gravel	Statistic
							Training data: 199
0.6	7	152.1	16	65	0	509	0	Minimum
21.7	540	520	320	300	380	1499	1060	Maximum
4.2	66.2	330.8	52.6	158.8	71	813.2	686.1	Mean
3.2	28	340.4	40	150	0	750	750	Median
3.3	94	76.6	36.9	51.2	103.5	222.4	220.3	Standard deviation
11.1	8,854.4	5,873.1	1,363.1	2,626.5	10,712.5	4,9486.8	48,560.1	Variance
2.2	3.7	-0.3	2.8	0.8	0.9	1.5	-1.3	Skewness
6.8	15.2	-0.3	14	0.2	-0.6	1.3	1.4	Kurtosis
							Testing data: 86
0.8	7	162	16	50	0	524	0	Minimum
19.1	540	520	168	289	310	1,305	1,060	Maximum
4	102.7	341.8	55.6	156.3	91	805.3	673.5	Mean
3	28	340.2	49	142	0	730	750	Median
3.2	150.7	78.5	32.9	52.9	106.5	221.1	219.4	Standard deviation
10.8	22,729.7	6,177.9	1084	2,801.8	11,358.8	48,922	48,141.3	Variance
2	2.2	-0.1	1.8	0.6	0.4	1.4	-1.1	Skewness
5.2	3.6	-0.2	3.6	-0.1	-1.4	0.9	0.9	Kurtosis

3.2 Model efficiency Assessment Specifications

Several error metrics were employed to assess the accuracy of the models. Among these variables are R2, MAE, RMSE, MAPE, a20-index, and OBJ[13]. These statistical metrics are summarized as follows:

$${{R}}^{2}={\left[\frac{{\sum }_{{i}=1}^{{N}}({{Y}}_{{o}{b}{s}}-{\overline{{Y}}}_{{o}{b}{s}})({{Y}}_{{p}{r}{e}}-{\overline{{Y}}}_{{p}{r}{e}})}{\sqrt{{\sum }_{{i}=1}^{{N}}({{Y}}_{{o}{b}{s}}-{\overline{{Y}}}_{{o}{b}{s}}{)}^{2}{\sum }_{{i}=1}^{{N}}({{Y}}_{{p}{r}{e}}-{\overline{{Y}}}_{{p}{r}{e}}{)}^{2}}}\right]}^{2} \left(1\right)$$

$${M}{A}{E}=\frac{{\sum }_{{i}=1}^{{N}}\left|{{Y}}_{{p}{r}{e}}-{{Y}}_{{o}{b}{s}}\right|}{{N}} \left(2\right)$$

$${M}{A}{P}{E}=\frac{{\sum }_{{i}=1}^{{N}}\left|{{Y}}_{{p}{r}{e}}-{{Y}}_{{o}{b}{s}}\right|}{{\sum }_{{i}=1}^{{N}}{{Y}}_{{o}{b}{s}}}\times 100 \left(3\right)$$

$${R}{M}{S}{E}=\sqrt{\frac{1}{{N}}{\sum }_{{i}=1}^{{N}}({{Y}}_{{p}{r}{e}}-{{Y}}_{{o}{b}{s}}{)}^{2}} \left(4\right)$$

$${a}20-{i}{n}{d}{e}{x}=\frac{{m}20}{{N}} \left(5\right)$$

when N is the number of records, Ypre and Yobs show the predicted and actual values, and the bar items over the parameters indicate the average rate; The variable m20 shows the records quantity where the Yobs/Ypre ratio ranges from 0.80 to 1.20;

3.3 Algorithms for Hybrid Ensemble Learners

The FBIO was used in this study to determine the ideal values using the given criteria to set the first random values (Table 2). After these statistics were entered into EL approaches and the EL algorithms were trained using the training dataset, the objective function was determined to be the average RMSE of both data (test and train). Figure 2 provides a summary of the various EL-FBIO approaches. Table 3 presents the multiple meta-parameters values optimized for CS-PC models.

Figure 2. An Overview of The Modeling Process in This Study.

Table 2. Different Meta-Parameter Ranges Used in The Optimization Process.

Parameters	Considered Range	Parameters	Considered Range
Number of estimators	[5, 200]	Max_samples	[0.1, 1]
Min_samples_split	RF: [1, 10] Other methods: [1e-10, 1]	lu_ns	[2, 150]
Min_samples _leaf	RF: [1, 10] Other methods: [1e-10, 1]	lu_max_d	[2, 100]
Max_depth	[2, 500]	lu_max_mlf	[2, 100]
Max_features	[1, maximum number of variables]	lu_lr	[0.0001, 1]
Max_ leaf_nodes	[2, 500]	lu_gamma	[0, 10]
Ccp_alpha	[0, 1]	lu_gamma	[0, 10]
Min_weight_fraction_leaf	[0, 0.5]	lu_min_cw	[0, 1]
Learning rate	[0.001, 3]	lu_subsample	[0.5, 1]
Alpha	[0.001, 0.99]	lu_subsample_bt	[0.5, 1]
Subsample	[1e-6, 1]	reg_lambda	[0.01, 2]

Table 3. Optimized Parameter Values for CS-BPC

Parameters	Models
Parameters	ADB-FBIO	GBRT- FBIO	XGB- FBIO	RF- FBIO
Number of estimators	198	89	40	10
Min_samples_split	0.004	0.027	-	2
Min_samples _leaf	0.0003	0.008	-	1
Max_depth	500	87	137	395
Max_features	7	6	-	6
Max_ leaf_nodes	465	7	98	118
Ccp_alpha	3.57	4.17e-06	-	0
Min_weight_fraction_leaf	0.0029	0.017	-	0
Learning rate	0.23	0.49	0.47	-
Alpha	-	0.66	-	-
Subsample	-	0.76	0.53	-
Max_samples	-	-	-	0.95
Gamma	-	-	0.006	-
Min_child_weight	-	-	0.16	-
Reg_lambda	-	-	0.01	-
Colsample_bytree	-	-	0.57	-

3.4 Model Prediction Accuracy Study

Because of a higher R2-value and fewer scattered spots, the GBRT-FBIO approach outperforms conventional CS-BPC models in both phases of test and train, as shown in Figure 3. A20-index is a new significant engineering parameter that determines how many specimens have expected values that are at most 20% off from observed values[48]. Furthermore, with an a20-index of 0.5 and 0.88 throughout the testing and training step, the RF- FBIO had the lowest desire to perform well for CS-BPC.

Figure 3. Measured Versus Expected Scattering Dots in the CS-BPC Phases.

Different statistical indicators were evaluated for training and testing datasets in order to assess the precision of the proposed forecasting models. Table 4 shows what the results are for different statistical parameters. In the CS-BPC training phase, ADB is 0.14, 0.23, and 0.57MPa lower than GBRT, XGB and RF in terms of RMSE, while in the testing phase, GBRT outperformed ADB, XGB, and RF by 0.46, 0.04, and 0.43MPa of difference in RMSE, respectively. Despite the relative superiority of the ADB model during training, the GBRT model with MAE and MPAE of 0.43 and 0.16 respectively, is the most accurate in testing phase.

Table 4. The Precision and Effectiveness of Each CS-FBIO Model

Testing				Training				Models
MAPE	MAE	RMSE	R2	MAPE	MAE	RMSE	R2	Models
0.215	0.694	1.034	0.921	0.018	0.034	0.068	0.999	ADB- FBIO
0.155	0.43	0.572	0.970	0.047	0.148	0.206	0.996	GBRT- FBIO
0.167	0.464	0.611	0.965	0.075	0.239	0.296	0.992	XGB- FBIO
0.255	0.768	1	0.922	0.096	0.369	0.640	0.966	RF- FBIO

The effectiveness of each design was evaluated using the diagram of Taylor presented in Figure 4. To compare the anticipated outcomes with the actual values, three statistical measures of RMSE, STD, and R2) were used. The standard deviation is shown through a circle connecting the plot's axes of horizontal and vertical; RMSE is indicated by the horizontal green dots and the blue line shows the values of R2. As a result, among all techniques for CS-BPC, the GBRT-FBIO and XGB methods have the top performance.

Figure 4. Taylor Graphs of Several CS-FBIO Models and Traditional Approaches.

ML and EL models can accurately predict BPC strength and workability properties, as reported by several studies[14,18-21]. According to R2 compare, GBRT methods for CS outperformed all existing models during the testing and training levels. Based on RMSE and MAE values for CS, some existing models were superior, potentially due to the dataset’s quality. As a result, EL methods are more practical and generalize more effectively to BPC characteristics, thus saving time and resources.

Table 5. Comparing Proposed El Models to Those in the Literature

Model	Reference	Inputs No.	No. of Data	MAE		RMSE		R2
Model	Reference	Inputs No.	No. of Data	Test	Train	Test	Train	Test	Train
CS (MPa)
ANN	Tavana Amlashi et al.[18]	7	169	0.989	0.972	0.393	0.574	0.301	0.412
MARS	Tavana Amlashi et al.[18]	7	169	0.978	0.965	0.573	0.656	0.435	0.497
M5Tree	Tavana Amlashi et al.[18]	7	169	0.893	0.904	1.293	0.963	0.895	0.71
SVM	Ghanizadeh et al.[14]	7	72	0.993	0.991	0.432	0.451	0.191	0.17
ANN	Ghanizadeh et al.[14]	7	72	0.992	0.967	0.488	0.844	-	-
SVM	Tavana Amlashi et al.[19]	8	387	0.989	0.956	0.207	0.427	-	-
ANN	Tavana Amlashi et al.[19]	8	387	0.996	0.921	0.124	0.577	-	-
ANFIS	Tavana Amlashi et al.[19]	8	387	0.967	0.899	0.354	0.669	-	-
SVM	Tavana Amlashi et al.[20]	7	169	0.992	0.963	0.365	0.629	0.296	0.494
MGGP	Tavana Amlashi et al.[20]	7	169	0.956	0.944	0.803	0.987	0.605	0.616
GMDH	Tavana Amlashi et al.[20]	7	169	0.889	0.861	1.221	1.147	0.941	0.892
RSM	Tavana Amlashi et al.[20]	7	169	0.946	0.829	0.894	1.523	0.635	0.999
DT	Alishvandi et al.[21]	7	645	Overall: 0.87		Overall: 0.26		Overall: 0.121
RF	Alishvandi et al.[21]	7	645	Overall: 0.83		Overall: 0.297		Overall: 0.196
GB	Alishvandi et al.[21]	7	645	Overall: 0.67		Overall: 0.422		Overall: 0.308
XGB	Alishvandi et al.[21]	7	645	Overall: 0.86		Overall: 0.263		Overall: 0.152
SVM	Alishvandi et al.[21]	7	645	Overall: 0.52		Overall: 0.513		Overall: 0.357
KNN	Alishvandi et al.[21]	7	645	Overall: 0.79		Overall: 0.337		Overall: 0.121
ADB-FBIO	This Study	7	285	0.694	0.034	1.034	0.068	0.921	0.999
GBRT-FBIO	This Study	7	285	0.43	0.148	0.572	0.206	0.97	0.996
XGB-FBIO	This Study	7	285	0.464	0.239	0.611	0.296	0.965	0.992
RF-FBIO	This Study	7	285	0.768	0.369	1	0.64	0.922	0.966

3.5 SHAP

SHAP is a game-theoretic approach designed to describe the result of machine-learning methods[49]. SHAP presents the contribution of the feature to the mode’s output, offering a more interpretable and transparent understanding of the model’s decision-making process. In the ensuing sections, we thoroughly analyze the outcomes in the proposed predicting structure, which is designed to interpret and comprehend the results of the probabilistic predicting model. Our initial focus is on point forecasts, specifically examining how the developed model utilizes various features to make predictions. The SHAP method is employed for explanations, covering CS. Figure 5 illustrates the average contribution of each feature, with each bar plot representing the importance of a specific property. Cement and water play significant roles in CS model, contributing more substantially. Also, Silt exhibits minimal impact on the outputs.

Figure 5. Feature Significance of the Input Variables.

Each dot in Figure 6 represents a distinct forecasting, and its location along the x-axis signifies the impact of that attribute on the output of the model. Furthermore, each dot's color corresponds to a feature value (varies from blue to red) emphasizing the relative contributions of different feature values to the final result. The long tails show characteristics that are highly significant. The dots’ vertical distribution suggests that there are more findings with comparable effects. These SHAP summary graphs in such a setting include details on the number of reports that have those qualities as well as the size and direction of each feature's effect. As an example, elevated cement values in CS model tend to elevate the model output, while values closer to zero for cement lead to a decrease in the model output. Therefore, the impact on the model output becomes more substantial with higher cement values.

Figure 6. Summary Plot of the Point Predicting Model.

A heatmap of SHAP values across all input variables is shown in Figure 7. CS was shown on top as functions of all variables. The ranges of SHAP values which show the impact on each model target are depicted by various colors ranging from blue to red. For this heatmap, arbitrary sample pools equal to training data sets were chosen. Considering the intense 11 on the left side of the CS-BPC, water and cement appear to be two key input parameters.

Figure 7. SHAP Values and Function Variations of Each Target.

3.6 Online Application of Proposed BPC Models

Models developed using EL-FBIO methods differ from classical regression methods as they do not simply establish direct relationships between inputs and outputs[50]. To address this, implementing an online application would allow researchers and practicing engineers, who are the end users of the proposed BPC models, to easily estimate mechanical and workability properties. Several previous studies have developed software using MATLAB Graphical User Interface to predict the properties of different types of concrete[51,52]. The proposed online application offers several key advantages: (i) faster access to results, enabling more thorough investigation of mix designs; (ii) reduced production costs, while ensuring the safety and quality of concrete designs, by allowing users to determine whether a mix design is reasonable; and (iii) ease of use, which minimizes human error in calculations[37]. Free online access is provided as well (https://colab.research.google.com/drive/1a3pjScLsripH-756yYk2nyol5g7HM9U1), allowing engineers and researchers to obtain accurate predictions of BPC strength and workability parameters directly at their project site in just a few simple steps.

3.7 Limitations

The dataset of 285 compressive strength records used in this study is relatively small, which could limit the robustness and generalizability of the machine learning models developed. Small datasets increase the risk of overfitting, where models perform well on the training data but struggle to generalize to new, unseen data. Additionally, the dataset, compiled from various published sources, may contain inherent biases due to variations in material preparation, testing environments, and input parameters. These inconsistencies could affect the model’s ability to accurately predict compressive strength in broader applications. While ensemble models like RF, GBRT, and XGB are generally robust, they are sensitive to the quality and diversity of the input data, and with a limited dataset, they may not capture the full range of variability needed for real-world applications. Cross-validation and regularization techniques were applied to mitigate some of these concerns, but the findings should still be interpreted cautiously. Future work should focus on expanding the dataset through additional experiments or synthetic data generation, and employing uncertainty quantification methods to enhance the reliability and generalizability of the predictions in practical engineering contexts.

4 CONCLUSION

This research introduces an advanced machine learning approach for predicting the CS of BPC using Python-based ensemble techniques, including RF, ADB, GBRT, and XGB, all optimized through FBIO algorithms. The challenge of accurately assessing BPC’s compressive strength through experimental or field evaluation lies in the complexity, high costs, and need for professional expertise. To overcome these barriers, this study proposes an improved and accessible computational method, allowing for more efficient and reliable predictions of BPC properties. To build the predictive models, 285 data records of CS tests were collected from a variety of published literature sources. The selected input variables-gravel, bentonite, silty clay, curing time, sand, cement, and water-were used to train the models and assess their performance. A comprehensive evaluation of each model was performed using key performance metrics, including the R², MAE, RMSE, MAPE, and the a20-index, for both the training and test datasets. Among the models, GBRT emerged as the most accurate for predicting BPC compressive strength, consistently outperforming other ensemble methods based on the evaluation metrics. In addition to superior predictive accuracy, the study utilized SHAP to interpret the model’s predictions and provide insights into the relative importance of each input variable. The SHAP analysis demonstrated that cement was the most significant factor influencing BPC’s compressive strength, with a mean SHAP value of 0.17, indicating its substantial impact on model predictions. This insight emphasizes the importance of cement variations in the overall performance of BPC and highlights the practical value of the model in optimizing BPC mix designs. Overall, the findings of this research underscore the potential of ensemble learning methods, particularly GBRT with FBIO optimization, in offering a robust and efficient alternative to traditional testing methods for BPC. The developed models not only reduce the need for expensive and time-consuming experimental evaluations but also provide a practical tool for researchers and engineers to predict BPC properties with high accuracy.

As a suggestion for future work, we recommend exploring the application of advanced machine learning techniques, such as deep neural networks (DNNs) and transfer learning, which may enhance model performance, particularly for small datasets. To address scalability concerns, future studies should also investigate the feasibility of applying FBIO to larger datasets or real-time applications by integrating more efficient optimization algorithms, such as Bayesian optimization or genetic algorithms, which could improve hyperparameter tuning while reducing computational costs. Expanding the dataset through additional experiments or synthetic data generation could further improve model generalizability. Additionally, exploring parallel processing or cloud-based computing could help optimize FBIO’s scalability for real-time use. Finally, incorporating uncertainty quantification methods would offer valuable insights into the reliability and robustness of predictions in practical engineering applications.

Acknowledgements

Thanks to Dr. Ali Reza Ghanizadeh and Dr. Samer Dessouky, for their invaluable guidance and support.

Conflicts of Interest

The authors declared no conflict of interest.

Data Availability

All data generated or analyzed during this study are included in this published article.

Copyright Permissions

Copyright © 2024 The Author(s). Published by Innovation Forever Publishing Group Limited. This open-access article is licensed under a Creative Commons Attribution 4.0 International License (https://creativecommons.org/licenses/by/4.0), which permits unrestricted use, sharing, adaptation, distribution, and reproduction in any medium, provided the original work is properly cited.

Author Contribution

Amlashi AT was responsible for conceptualization, conducting the investigation, providing resources, drafting the original manuscript, and project administration. Ghanizadeh AR contributed to the methodology, validation, software development, visualization, and provided supervision. Dessouky S oversaw the supervision and contributed to the review and editing of the manuscript.

Abbreviation List

ADB, Adaptive boosting

AI, Artificial intelligence

ANN, Artificial neural network

BPC, Bentonite plastic concrete

CS, Compressive strength

DT, Decision trees

EL, Ensemble learning

FBIO, Forensic-based investigation optimization

GBRT, Gradient boosting regression tree

ML, Machine learning

OOB, Out-of-bag