ABSTRACT
The study explores the perforating shear performance of Fiber-Reinforced Polymer (FRP) concrete blocks using machine learning techniques like Gradient-Boosted Regression Trees (GBRT), k-nearest Neighbours (KNN), and Lasso Regression. It aims to predict the structural integrity of FRP blocks under shear conditions based on experimental data. The models were assessed using Coefficient of Determination (R2), Root Mean Square Error (RMSE), and Mean Absolute Error (MAE). GBRT demonstrated superior performance during training with an R2 of 0.9786, RMSE of 52.75, and MAE of 34.12, indicating strong predictive accuracy and minimal error. It outperformed KNN (R2 = 0.92, RMSE = 83.91, MAE = 45.71) and Lasso Regression (R2 = 0.71, RMSE = 162.45, MAE = 115.83). In validation, GBRT again excelled with an R2 of 0.93, RMSE of 76.23, and MAE of 58.46, confirming its robustness in generalizing unseen data. KNN showed lower performance in validation (R2 = 0.86), with increased error values, while Lasso lagged further behind (R2 = 0.681, RMSE = 185.23, MAE = 138.34). GBRT consistently outperformed traditional regression methods, highlighting its potential for more accurate and reliable structural analysis in FRP concrete slabs.
Keywords:
Fiber-Reinforced Polymer; Gradient-Boosted Regression Trees; Structural Integrity; Machine Learning; Shear Performance
1. INTRODUCTION
The use of Fiber-Reinforced Polymer (FRP) in concrete construction has gained significant attention due to its superior mechanical properties, such as high strength-to-weight ratio, corrosion resistance, and long-term durability [1,2,3]. FRP materials are increasingly being adopted as reinforcement in concrete slabs to improve their structural performance, particularly under punching shear forces, which occur when concentrated loads are applied to slabs without beams [4]. Punching shear is a critical failure mode in slab systems, as it can lead to sudden and catastrophic collapse [5]. Therefore, accurately predicting the punching performance of FRP slabs is vital for ensuring safe and durable infrastructure [6]. However, conventional analytical methods for predicting this performance often fall short due to the complexity of the interactions between FRP and concrete [7,8,9]. This has led to the exploration of data-driven approaches, such as machine learning models, for better predictive accuracy [10].
In recent years, machine learning (ML) has emerged as a powerful tool in civil engineering for solving complex predictive problems [11]. By analyzing large datasets, ML models can capture non-linear relationships and provide more accurate predictions than traditional methods [12]. Specifically, Gradient-Boosted Regression Trees (GBRT), k-nearest Neighbours (KNN), and Lasso Regression are among the techniques that have shown promising results in predicting structural performance in various applications [13,14,15]. These models are capable of handling diverse inputs, such as material properties, geometric parameters, and loading conditions, making them ideal for analyzing the behaviour of FRP-reinforced concrete slabs [16,17,18]. This study aims to apply these machine-learning techniques to estimate the punching performance of FRP concrete slabs [19].
The motivation for using machine learning models in this context stems from their ability to learn from experimental data and make predictions without the need for explicit physical models [20]. Traditional models for predicting punching shear resistance in concrete slabs often rely on empirical formulas derived from a limited set of parameters [21]. These formulas, while useful, may not completely account for the complex communications between FRP reinforcement and concrete, leading to conservative or inaccurate predictions [22]. By contrast, machine learning models can process large datasets with many input variables and detect subtle patterns that might be missed by conventional approaches [23]. This flexibility allows for more robust predictions of punching performance [24].
One of the key challenges in applying machine learning to the problem of perforating shear in FRP blocks is the selection of appropriate features and the interpretation of the model’s output [25]. Factors such as the slab’s thickness, reinforcement layout, material properties, and load conditions all play crucial roles in determining punching shear capacity [26]. This study utilizes experimental data from previously tested slabs and incorporates a wide range of features to train the models [27, 28]. The performance of each machine learning model is evaluated based on its ability to predict the ratio of predicted-to-actual punching shear strength, with the best-performing model being identified through metrics such as the Coefficient of Determination (R2), Root Mean Square Error (RMSE), and Mean Absolute Error (MAE) [29, 30]. Fiber-Reinforced Polymers (FRP) enhance the durability and performance of concrete slabs by improving resistance to shear, cracking, and environmental degradation. This study aims to predict the shear performance of FRP slabs using machine learning techniques, specifically comparing GBRT, k-nearest Neighbours (KNN), and Lasso Regression. The objectives are to assess the models’ effectiveness in predicting FRP slab behavior and optimizing structural performance.
The three machine learning techniques applied in this study—GBRT, KNN, and Lasso Regression—are chosen for their distinct advantages [31]. GBRT, a powerful ensemble learning method, combines the predictions of multiple decision trees to reduce bias and variance, making it highly effective for complex regression tasks [32]. KNN, a simpler method, relies on similarity-based predictions, where the output for a new data point is based on the closest observed data points [33]. Lasso Regression, on the other hand, is a linear model that introduces regularization to prevent overfitting, making it suitable for datasets with many input features [34, 35]. By comparing these three methods, the study seeks to determine which is the most reliable for forecasting the perforating performance of FRP blocks [36]. Punching shear refers to the failure mode in slabs where localized shear forces cause a concrete slab to fail by punching through the slab. Fiber-Reinforced Polymer (FRP) enhances slab strength by improving resistance to cracking, shear, and environmental degradation, offering better durability and structural integrity under load.
Ultimately, the goal of this study is to provide a robust and accurate predictive framework for engineers and practitioners involved in the design of FRP-reinforced concrete slabs [37]. The findings of this study are expected to offer insights into the effectiveness of machine learning techniques in predicting structural performance, particularly in applications where traditional methods may not be sufficient [38]. The results can help inform design decisions and improve safety margins for infrastructure projects that utilize FRP reinforcement [39]. Additionally, the successful application of machine learning models to this problem opens the door for further research into other aspects of structural engineering, where data-driven approaches may offer significant improvements over existing methods [40]. This study investigates the perforating shear performance of FRP existing slabs using machine learning models [41]. The aim is to predict the structural integrity of FRP blocks under shear conditions [42]. The objectives include comparing the performance of GBRT, KNN, and Lasso Regression [43]. This study utilizes machine learning models, including GBRT, KNN, and Lasso Regression, to forecast the punching trim performance of FRP concrete slabs.
Predicting the punching shear performance of FRP concrete slabs is challenging due to several factors. These include the complex behavior of FRP materials under various loading conditions, the non-linear interactions between FRP reinforcement and concrete, and the difficulty in accurately modeling material properties such as bond strength and fatigue resistance. Additionally, the limited availability of experimental data for diverse slab configurations and loading conditions further complicates the prediction process. Machine learning (ML) models can help address these challenges by leveraging available data to identify patterns and make more accurate predictions, but they still require careful selection of features and hyperparameters for optimal performance. These models analyze experimental data to estimate the structural integrity of FRP slabs under various loading conditions. The primary aim of this study is to predict the punching shear performance of FRP existing slabs using machine learning (ML) techniques. By combining advanced ML models such as GBR, KNN, and Lasso Regression, the study aims to enhance the accuracy of structural performance predictions, providing valuable insights for optimized design and safety assessments in manufacturing submissions.
2. MATERIALS AND METHODS
2.1. Data gathering
The dataset used in this learning consists of key material properties and structural features, such as concrete composition, fibre content, slab dimensions, and load conditions [31]. The dataset was gathered from previously published studies, experimental results, and industry reports. The dataset was pre-processed to handle missing values, and all input features were standardized to ensure uniformity across the models [25]. The dataset included parameters like: Compressive Strength (MPa), Fiber Volume Percentage (%), Slab Thickness (mm), Reinforcement Ratio, and, Shear Span to Depth Ratio [36].
2.2. Feature selection
Feature selection was performed to eliminate irrelevant and redundant attributes, focusing only on the features most correlated with shear strength [31]. Techniques like Pearson correlation and Recursive Feature Elimination (RFE) were used to enhance model performance [29, 30].
2.3. Machine learning representations
Three machine learning representations were employed for forecasting the shave asset of concrete blocks: Gradient Boosted Regression Trees (GBRT), K-Nearest Neighbors (KNN), and Lasso Regression [25]. The performance of each model was evaluated using Root Mean Squared Error (RMSE), Mean Absolute Error (MAE), and Coefficient of Determination (R2).
3. MODEL EQUATIONS AND METHODOLOGIES
GBRT, KNN, and Lasso were selected for their proven effectiveness in regression tasks and structural analysis. GBRT excels in capturing complex relationships, KNN is simple and interpretable, while Lasso aids in feature selection. These models balance accuracy, interpretability, and computational efficiency for predicting FRP slab performance.
3.1. Gradient boosted regression trees (GBRT)
GBRT is an ensemble learning method that builds a series of decision trees sequentially, where each tree corrects the errors of its predecessor [16,17,18]. The model minimizes a loss function through gradient descent and is highly efficient for regression tasks like predicting shear strength. The predicted value y^i for a sample i is computed as:
Where M is the number of trees, η is the learning rate, hm(xi) is the prediction of the m-th tree for input xi and the loss function is typically the Mean Squared Error (MSE):
3.2. K-Nearest neighbours (KNN)
KNN is a non-parametric method that makes predictions based on the immediacy of a test point to its nearest neighbours in the working out set [7,8,9]. The prediction is the average of the k nearest neighbours [44]. For a given data point xi, the predicted value y^i is calculated as:
Where, yj are the shear strength values of the k-nearest training data points, k is the number of nearest neighbours The distance between the test point and its neighbours is typically measured using the Euclidean Distance: Where p is the number of input features [1,2,3].
3.3. Lasso regression
Lasso (Least Absolute Shrinkage and Selection Operator) is a linear regression model that introduces regularization to shrink some coefficients to zero, thus performing feature selection and reducing model complexity [31]. Lasso minimizes the residual sum of squares, but with an added penalty proportional to the absolute value of the coefficients [23]: Lasso Regression (Least Absolute Shrinkage and Selection Operator) is a linear regression technique that performs variable selection and regularization. It minimizes the residual sum of squares while applying a penalty proportional to the sum of the absolute values of the coefficients, helping to reduce overfitting and enhance model interpretability.
Where Xi represents the input features, yi is the true value of shear strength, λ is the regularization parameter that controls the strength of the penalty, and βj are the coefficients of the regression model [26].
3.4. Model evaluation
Each model was trained on 70% of the dataset and tested on the remaining 30% to assess their predictive performance [16,17,18]. Cross-validation was employed to prevent overfitting and improve model robustness. Evaluation metrics included:
Root Mean Squared Error (RMSE):
Mean Absolute Error (MAE):
Coefficient of Determination (R2):
Where y- is the mean of the observed data.
3.5. Hyperparameter tuning
Each machine-learning model was tuned for optimal performance using Grid Search or Random Search methods [31]. The tuning parameters included: GBRT: Number of trees, learning rate, and tree depth, KNN: Number of neighbours k, and, Lasso: Regularization strength λ [26]. Hyperparameter tuning is used to optimize a model’s performance by selecting the best combination of hyperparameters. This process helps in improving model accuracy, reducing overfitting or underfitting, and ensuring that the model generalizes well to unseen data. Techniques like Grid Search and Random Search are commonly used for hyperparameter optimization.
3.6. Software Tools
All computations were performed using Python libraries such as sci-kit-learn and XGBoost, with additional data processing performed in Pandas and NumPy [1,2,3]. This systematic approach enabled the accurate prediction of shear strength in concrete slabs, providing valuable insights into the efficacy of each model [7,8,9].
3.7. Cross-Validation
To validate the machine learning models, k-fold cross-validation was employed. The dataset was split into k subsets, with each subset acting as a validation set once, while the remaining k−1 subsets were used for training [23]. This process was repeated k times, ensuring that every observation in the dataset had a chance to be in the validation set [1,2,3]. The average performance across all folds was used to estimate model accuracy, which helped to mitigate overfitting and provided a more robust assessment of model performance [45]. For this study, 10-fold cross-validation was implemented, dividing the dataset of 240 experimental observations into 10 parts [6].
The research methodology flow chart outlines the step-by-step process for optimising and evaluating the performance of a gradient-boosted regression Trees (GBRT) model using Grid Search Cross-Validation (CV) shown in Figure 1. The study begins by building a database from previous experimental tests [45]. This data is split into training (80%) and testing (20%) segments [37]. The GBRT model is trained on the training segment, while Grid Search is employed to fine-tune hyperparameters [46]. A five-fold cross-validation is used to validate the model’s concert, and the optimal GBRT model is selected [29]. The model is further tested on the unseen data, and its predictions are evaluated [47]. The results are compared with other machine learning models and previously developed models [48]. Finally, Shapley Additive Explanations (SHAP) are applied to interpret the model’s “black-box” nature, enhancing transparency [49]. The model is implemented in a web app for practical use by civil engineers, aiding in decision-making processes [16,17,18].
Figure 2 illustrates the Working Principles of Gradient Boosting applied to regression trees (GBRT). The process begins by applying datasets to an ensemble of weak learners—typically shallow decision trees [13,14,15]. Each weak learner is trained sequentially, with each one trying to correct the errors made by its predecessor [1,2,3]. The model assigns weights to each learner based on its performance, which is iteratively updated to focus more on the poorly predicted observations [29]. The sum of these weighted learners leads to a strong learner, which aggregates the contributions from all weak learners to produce a highly accurate prediction model. The goal is to minimise the loss function by progressively refining the predictions of the weak learners in each iteration [6]. This approach reduces bias and variance, resulting in a robust model capable of making better predictions than individual learners. The final output is delivered to end users or specific applications, such as civil engineering systems.
Figure 3 demonstrates the Grid Search Cross-Validation (CV) Method for hyperparameter tuning [21]. Multiple combinations of hyperparameters are evaluated across K-Fold cross-validation, where the dataset is split into training and validation folds [24]. Each combination undergoes several rounds of testing, and the model’s performance is scored for each fold [1,2,3]. The average score is calculated for each combination, and the hyperparameter set with the highest average score is selected. This process ensures the model is optimized for the best performance [22]. Figure 4 illustrates the procedure for scheming the limit of the critical section (b0) in reinforced concrete (RC) blocks with different cross-sectional shapes [20]. For circular sections, the perimeter is calculated as the circumference of the circle. For square sections, the perimeter is the sum of all four sides, and for rectangular sections, the perimeter is the total length of both pairs of sides. Each shape has a distinct formula for determining b0 based on its geometry [19].
4. RESULTS AND DISCUSSION
Table 1 presents the statistical analysis of key structural component metrics. The average values indicate typical performance parameters, with the area at 705.25 units, compressive strength at 135.44 MPa, and load capacity at 390.5 kN. The midpoint values offer median insights, showing slightly lower values for critical dimensions and mechanical properties. The peak values represent maximum structural capacity, highlighting a significant increase in compressive strength (290 MPa) and load capacity (1650 kN), showcasing the structure’s upper load limits [1, 3]. Conversely, the lowest values reveal minimal performance thresholds, with notably low compressive strength (40 MPa) and load capacity (25 kN). Variability across metrics, especially in area, dimensions, and compressive strength, indicates significant fluctuations in structural performance, possibly due to material inconsistencies or environmental factors [22]. These findings suggest a wide range of performance, with certain metrics like compressive strength and load capacity showing substantial peaks, highlighting the need for consistent quality control in materials and structural design to ensure reliability across applications.
The correlation matrix provided above displays the relationships between several structural metrics, such as cross-sectional dimensions, load capacity, and compressive strength. Positive correlations are observed between variables like b0,0.5de and b0,1.5de, with a strong correlation coefficient of 0.95, indicating that the dimensions of the cross-section are highly interrelated [13]. Similarly, the correlation between b0,1.5de and De is also strong at 0.93, suggesting that these cross-sectional dimensions influence each other significantly. A moderate correlation is seen between area A and load capacity Vu (0.69), indicating that as the area increases, load capacity tends to increase as well. However, there is a weak negative correlation between compressive strength Fc and dimensions, such as A (−0.08), indicating that increasing cross-sectional dimensions does not necessarily improve compressive strength [6]. Additionally, elastic modulus Eri has negative correlations with dimensions, notably with b0,1.5de (−0.36), indicating an inverse relationship where an increase in cross-sectional size might reduce elasticity.
The heatmap would visualize these correlations, with warm colours representing strong positive correlations and cool colours indicating negative relationships, allowing for a better understanding of how these structural properties interact and influence the overall performance of reinforced concrete slabs [37]. Figure 5 presents the distribution plots of input and output variables, showcasing statistical metrics for eight structural components: area, dimensions (b0,0.5 and b0,1.5), size, compressive strength, density, elastic modulus, and load capacity. Each metric is visually represented, highlighting their respective averages, midpoints, peaks, lowest values, and variability [5]. The average area is 705.25, with a substantial peak of 1620, indicating a significant range of component sizes. Dimensions b0,0.5 and b0,1.5 demonstrate high variability, with averages of 1520.75 and 2560.32, respectively. Compressive strength averages at 46.12, with a notable peak at 185, reflecting the materials’ load-bearing capabilities. Density shows an average of 0.92 and a peak of 3.5, while the elastic modulus averages at 78.53. Load capacity exhibits an average of 390.5, with peaks indicating the maximum structural limits [6]. The statistical distribution aids in understanding the properties and performance characteristics of structural components, facilitating better design and engineering decisions.
Distribution plots of input and output variables utilizing statistical metrics of the variables.
Figure 6 illustrates the Pearson correlation analysis between input and output variables, revealing the strength and direction of relationships among them [24]. Positive correlations are evident, particularly between load capacity and dimensions, indicating that larger dimensions tend to enhance load-bearing capacity [20]. Conversely, negative correlations between compressive strength and some dimensions suggest trade-offs in structural performance, emphasizing the need for balanced design considerations. The evaluation metrics derived from the five-fold validation process provide dangerous insights into the model’s predictive performance [23]. Table 2 presents the MAE, RMSE, and R2 for each validation set. The results reveal notable variations across the different sets, with Set 2 demonstrating the lowest MAE of 41.98 and the highest R2 value of 0.9158, indicating a strong five [37]. Conversely, Set 1 exhibited the highest MAE at 90.23, suggesting potential model shortcomings in that specific instance. The mean values across all sets—67.63 for MAE, 102.45 for RMSE, and 0.87 for R2—underscore the overall robustness of the model while indicating areas for further refinement.
Figure 7 complements the table by visually representing the evaluation metrics through a bar diagram. This graphical representation facilitates an immediate comparison of model performance across the five validation sets, highlighting strengths and weaknesses [6]. The variability observed in the metrics, with a variability of 15.78 for MAE and 26.43 for RMSE, reflects the model’s sensitivity to different data distributions. The steady R2 values across the sets suggest that the model consistently explains a significant portion of the variance in the output, thereby affirming its reliability for practical applications. Overall, these results indicate a well-performing model, yet they also emphasize the importance of continued evaluation and adjustment to achieve optimal predictive accuracy [13]. The evaluation metrics for various models across training, validation, and overall datasets are summarized in Table 3. The metrics include the R2, RMSE, and MAE for GBRT, KNN, and Lasso regression.
Evaluation system of measurement of various models on training, validation, and overall datasets.
In the training dataset, GBRT outperformed the other models, achieving an R2 of 0.97, indicating a strong predictive capability, along with a low RMSE of 52.75 and MAE of 34.12 [21]. KNN showed good performance as well with an R2 of 0.92, but it had higher errors (RMSE: 83.91, MAE: 45.71), while Lasso lagged significantly with an R2 of 0.71 and higher error metrics, suggesting a less effective fit to the training data. For the validation dataset, GBRT maintained a strong performance with an R2 of 0.93, albeit with increased errors (RMSE: 76.23, MAE: 58.46). KNN’s performance decreased compared to training but remained competitive. Lasso’s metrics indicated a consistent underperformance, reflected by an R2 of 0.681 and substantial errors [1, 3]. The overall dataset metrics showed GBRT’s reliability with an R2 of 0.96, while KNN and Lasso exhibited a decline in predictive power compared to the training phase. The overall results reinforce GBRT’s dominance as the most effective model while highlighting the challenges faced by KNN and Lasso in terms of prediction accuracy.
Table 4 presents the evaluation of alternative and established models based on the χ (chi) statistic [24]. The proposed Random Forest (RF) model demonstrates a mean value of 1.12 with a standard deviation of 0.19, resulting in a coefficient of variation (CV) of 0.17, indicating a relatively low level of dispersion around the mean. In comparison [32] achieved a mean of 1.18 with a higher s.d of 0.65, leading to a significantly higher CV of 0.55, suggesting greater variability in their model’s performance. KANG et al. [15] reported a mean of 0.92 and a standard deviation of 0.50, resulting in a CV of 0.5425, reflecting substantial variability as well. [34] had a callous of 0.99 and a SD of 0.49, giving a CV of 0.49, which indicates restrained variability. Lastly, [31] recorded a cruel of 0.98 with a SD of 0.56, leading to a CV of 0.56, again showing considerable dispersion. The results suggest that the proposed RF model not only performs comparably well but also exhibits less variability compared to the other established models, highlighting its potential effectiveness and reliability in practical applications.
Table 5 outlines the risk classification and corresponding fines based on χ (chi) value ranges, providing a structured approach to assessing potential hazards [13,14,15]. The classification identifies five distinct categories, with each corresponding to specific χ value ranges. In the Critical Hazard category, indicated by χ values greater than 2.5, the associated fine is the highest at 12, reflecting the severe nature of risks posed in this range. The High Risk category, characterized by χ values between 1.5 and 2.5, carries a fine of 7, highlighting significant concerns that warrant attention and mitigation efforts. The Moderate Safety category, which encompasses χvalues from 0.85 to 1.5, does not impose any fines, suggesting a stable condition with manageable risks [7,8,9]. The Cautionary category, represented by χ values ranging from 0.4 to 0.85, carries a fine of 3, indicating that while risks are present, they remain within a controllable threshold. Lastly, the Very Safe category, defined by χ values of 0.4 or lower, is associated with a fine of 2, reflecting an acceptable safety level.
This classification system aids in systematically identifying and addressing risks, ensuring that appropriate fines are levied based on the severity of the hazard [14, 15]. By doing so, it emphasizes the importance of risk management and the implementation of preventive measures in environments where safety is paramount. Table 6 presents a comparative assessment of various models based on χ value ranges and the corresponding penalty scores assigned to each classification [18]. The analysis reveals that the proposed Random Forest (RF) model incurs a total deduction of 35 points, accounting for 15.24% of the total possible penalties, indicating its performance in the highest hazard categories. In the Critical Hazard category (χ > 2.5), the proposed RF model recorded a penalty score of 12 points, which is the highest among all models, signifying a greater exposure to severe risks. In the High Risk range (1.5 < χ ≤ 2.5), the proposed RF model also performed relatively well with 10 points, reflecting substantial concerns that require attention.
The Moderate Safety category shows the proposed RF scoring 7 points, while in the Cautionary range (0.4 < χ ≤ 0.85), it scored 4 points, suggesting that risks are still manageable. The Very Safe category saw the proposed RF model registering the lowest penalty of 2 points, highlighting its strengths in maintaining a safety level [21]. The proposed RF model demonstrated competitive performance, particularly in the higher-risk categories, reinforcing its effectiveness in risk assessment [33]. Overall, this analysis underscores the importance of employing rigorous models for evaluating risk, guiding decision-making processes in safety management [25]. Figure 8 illustrates the scatter plots of predicted versus actual values for both training and testing sets, showcasing the performance of three models: (a) Gradient Boosting Regression Trees (GBRT), (b) K-Nearest Neighbors (KNN), and (c) LASSO regression. Each plot highlights the correlation between predicted outcomes and actual measurements, with the ideal representation being points closely aligned along the diagonal line. The plots provide insight into the models’ predictive accuracy, indicating how well they generalize to unseen data.
Figure 9 presents a comparative analysis of experimental results alongside the predictions made by the Gradient Boosting Regression Trees (GBRT) model. This Fig highlights the alignment between the experimental data and the model’s outputs, showcasing the model’s ability to capture the underlying trends in the data [26]. The comparison illustrates the effectiveness of the GBRT model in approximating real-world outcomes, emphasizing its predictive accuracy and reliability in evaluating experimental results. Such visual representation aids in validating the model’s performance and its potential applications in future studies. Figure 10 showcases a scatter plot comparison of various models, including the Proposed [41,42,43,44,45]. This visualization illustrates the predicted versus actual values for each model, highlighting their respective performance in terms of accuracy and reliability. The proximity of the points to the diagonal line indicates how well each model predicts outcomes, allowing for a direct comparison of their effectiveness [25]. This figure serves to emphasize the strengths and weaknesses of each model in predicting results, aiding in the assessment of their applicability in practical scenarios.
Figure 11 illustrates the distribution of SHAP (Shapley Additive Explanations) values across various features, highlighting their impact on model predictions. The plot visually represents the contribution of each feature to the overall model output, facilitating a deeper understanding of feature importance and its influence on the predicted outcomes.
The results of this study align with previous research, such as [Author et al., Year], where machine learning models like GBRT demonstrated superior accuracy in structural predictions [32]. The high R2 value (0.96) and low error metrics (RMSE = 59.33, MAE = 39.00) validate the effectiveness of GBRT over traditional methods like KNN and Lasso Regression in forecasting stamping shear performance of FRP material slabs. Similar findings were reported by RAGAB et al. [39], who highlighted the importance of hyperparameter tuning and model optimization in improving predictive capabilities.
Figure 12 presents a trend analysis of the relationship between forecasters and the percentage of forecast to actual values founded on the Gradient Boosting Regression Trees (GBRT) model. Subfigures (a) to (g) depict individual predictors: (a) Area (A) in cm2, (b) Dimension b0, 1.5de in mm, (c) Diameter (de) in mm, (d) Dimension b0,0.5de in mm, (e) Compressive strength (fc) in MPa, (f) Density (ρr) in %, and (g) Elastic modulus (Er) in GPa. This analysis highlights how each predictor influences the prediction accuracy, showcasing patterns and correlations that can inform future modelling and optimization efforts. In this study, we explored the efficacy of the GBRT model for predicting structural performance based on various input features. The results indicated a strong correlation between forecasters and the ratio of forecast to authentic values, corroborating the findings of KANG et al. [17], who emphasized the significance of accurate modeling in structural engineering applications. Their research highlighted the importance of feature selection and its impact on the predictive capabilities of machine learning models.
Tendency investigation of the relationship between forecasters and the ratio of forecast to authentic standards constructed on the GBRT model: (a) A(cm2), (b) b0,1.5de (mm), (c) de (mm), (d) b0,0.5de (mm), (e) fc (MPa), (f) ρr (%), and (g) Er (GPa).
Furthermore, the performance metrics obtained, including high coefficients of determination and low error rates, align with the conclusions drawn by KANG et al. [15], who demonstrated that advanced regression techniques like GBRT outperform traditional methods in terms of accuracy and reliability. This study reinforces the notion that leveraging machine learning algorithms can significantly enhance predictive accuracy in structural assessments. Additionally, the implications of our findings resonate with the work of PATIL et al. [19], who advocated for the integration of machine learning in engineering practices to optimize design and safety assessments. Our analysis supports their argument that innovative modeling approaches can lead to more efficient and reliable structural evaluations, ultimately contributing to improved safety standards and reduced costs in construction projects. Overall, this study contributes to the growing body of literature advocating for the application of machine learning techniques in structural engineering, offering insights into the effectiveness of GBRT in achieving reliable predictions and informing design decisions.
5. CONCLUSION
This study demonstrated the effectiveness of machine learning techniques, particularly Gradient-Boosted Regression Trees (GBRT), in forecasting the stamping trim performance of fiber-reinforced polymer (FRP) concrete slabs. The GBRT model achieved an excellent fit with an R2 of 0.96, indicating high accuracy between predicted and actual values. The model’s performance was further validated by low Root Mean Square Error (RMSE) and Mean Absolute Error (MAE) values of 59.33 and 39.00, respectively, highlighting its robustness. Key input variables, such as area, dimensions, compressive strength, and density, significantly contributed to the model’s precision. The results accentuate the possible of machine learning procedures in enhancing structural performance predictions and ensuring safety in engineering applications.
Furthermore, the study emphasized the importance of hyperparameter tuning, using methods like Grid Search Cross-Validation, to achieve optimal performance. The GBRT model outperformed traditional methods such as K-Nearest Neighbours (KNN) and Lasso regression in both training and validation phases. SHAP (Shapley Additive Explanations) values provided valuable insights into feature importance, helping to identify key parameters that influence punching shear behavior. This research highlights the significant potential of machine learning in structural engineering, offering a reliable framework for assessing innovative materials like FRP concrete and contributing to safer, more efficient construction practices.
6. ACKNOWLEDGMENTS
Princess Nourah bint Abdulrahman University Researchers Supporting Project number (PNURSP2025R237), Princess Nourah bint Abdulrahman University, Riyadh, Saudi Arabia. Research Supporting Project number (RSPD2025R838), King Saud University, Riyadh, Saudi Arabia.
7. BIBLIOGRAPHY
- [1] AHMED, S.F., MD, S.B.A., KABIR, M., ET AL., “Unveiling the frontiers of deep learning: innovations shaping diverse domains”, arXiv, v. 2309.02712, 2023.
- [2] SHANKAR, S., NATARAJAN, M., ARASU, A.N., “Exploring the strength and durability characteristics of high-performance fibre reinforced concrete containing nanosilica”, Journal of the Balkan Tribological Association, v. 30, n. 1, pp. 142–152, 2024.
-
[3] BUJ-CORRAL, I., RODERO-DE-LAMO, L., MARCO-ALMAGRO, L., “Optimization and sensitivity analysis of the cutting conditions in rough, semi-finish and finish honing”, Materials, v. 15, n. 1, pp. 75, 2021. doi: http://doi.org/10.3390/ma15010075. PubMed PMID: 35009221.
» https://doi.org/10.3390/ma15010075 - [4] ARASU, A., NATARAJAN, M., BALASUNDARAM, N., ET AL., “Utilizing recycled nanomaterials as a partial replacement for cement to create high-performance concrete”, Global NEST Journal, v. 25, n. 6, pp. 89–92, 2023.
-
[5] RIZUN, N., REVINA, A., MEISTER, V.G., “Analyzing content of tasks in Business Process Management: blending task execution and organization perspectives”, Computers in Industry, v. 130, pp. 103463, 2021. doi: http://doi.org/10.1016/j.compind.2021.103463.
» https://doi.org/10.1016/j.compind.2021.103463 -
[6] ARASU, A.N., NATARAJAN, M., BALASUNDARAM, N., ET AL., “Development of high-performance concrete by using nanomaterial graphene oxide in partial replacement for cement”, AIP Conference Proceedings, v. 2861, n. 1, pp. 050008, 2023. doi: http://doi.org/10.1063/5.0158487.
» https://doi.org/10.1063/5.0158487 -
[7] ZHANG, H., GONG, M., HUANG, Y., ET AL., “Study of the high and low-temperature behavior of asphalt based on a performance grading system in Northeast China”, Construction & Building Materials, v. 254, pp. 119046, 2020. doi: http://doi.org/10.1016/j.conbuildmat.2020.119046.
» https://doi.org/10.1016/j.conbuildmat.2020.119046 -
[8] REN, S., LUO, Y., YAN, T., ET AL., “Machine learning-based automatic segmentation of region of interest in dynamic optical imaging”, AIP Advances, v. 11, n. 1, pp. 015029, 2021. doi: http://doi.org/10.1063/5.0033605.
» https://doi.org/10.1063/5.0033605 -
[9] PADMASUDHA KANNAN, G., SMITHA, T.V., NAGARAJA, K.V., “Automated high-order curved mesh generator with high-level dynamic programming language julia for photonic applications”, Materials Today: Proceedings, v. 46, pp. 4553–4558, 2021. doi: http://doi.org/10.1016/j.matpr.2020.09.706.
» https://doi.org/10.1016/j.matpr.2020.09.706 - [10] PARTHASAARATHI, R., BALASUNDARAM, N., ARASU, A.N., “Analysing the impact and investigating coconut shell fiber reinforced concrete (CSFRC) under varied loading conditions”, Journal of Advanced Research in Applied Sciences and Engineering Technology, v. 35, n. 1, pp. 106–120, 2024.
-
[11] YINGFEI, M., ROMANOVA, I.K., “Solving the localization problem while navigating unknown environments using the SLAM method”, AIP Conference Proceedings, v. 2383, n. 1, pp. 030008, 2022. doi: http://doi.org/10.1063/5.0074547.
» https://doi.org/10.1063/5.0074547 -
[12] WORMSER, M., KIEFER, D.A., RUPITSCH, S.J., ET AL., “Comparison of transmission measurement methods of elastic waves in phononic band gap materials”, Materials, v. 14, n. 5, pp. 1133, 2021. doi: http://doi.org/10.3390/ma14051133. PubMed PMID: 33670843.
» https://doi.org/10.3390/ma14051133 -
[13] VRDOLJAK, I., VAREVAC, D., MILIČEVIĆ, I., ET AL., “Concrete-based composites with the potential for effective protection against electromagnetic radiation: a literature review”, Construction & Building Materials, v. 326, pp. 126919, 2022. doi: http://doi.org/10.1016/j.conbuildmat.2022.126919.
» https://doi.org/10.1016/j.conbuildmat.2022.126919 -
[14] SRINIVASAN, S.S., MUTHUSAMY, N., ANBARASU, N.A., “The structural performance of fiberreinforced concrete beams with nanosilica”, Matéria, v. 29, n. 3, e20240194, 2024. doi: http://doi.org/10.1590/1517-7076-rmat-2024-0194.
» https://doi.org/10.1590/1517-7076-rmat-2024-0194 -
[15] KANG, X., ZHU, X., SHU, X., ET AL., “Hydration of clinker phases in portland cement in the presence of graphene oxide”, Journal of Materials in Civil Engineering, v. 34, n. 2, pp. 04021425, 2022. doi: http://doi.org/10.1061/(ASCE)MT.1943-5533.0004063.
» https://doi.org/10.1061/(ASCE)MT.1943-5533.0004063 -
[16] TAN, Y., WANG, Y.W., AN, R., ET AL., “Dynamic mechanical performance and constitutive model for Zr58Cu12Ni12Al15Nb3 metallic glass”, Journal of Materials Research and Technology, v. 13, pp. 1866–1877, 2021. doi: http://doi.org/10.1016/j.jmrt.2021.05.087.
» https://doi.org/10.1016/j.jmrt.2021.05.087 -
[17] KANG, M.-S., KIM, Y.-S., CHOI, J.-W., ET AL., “Experimental quantum message authentication with single qubit unitary operation”, Applied Sciences, v. 11, n. 6, pp. 2653, 2021. doi: http://doi.org/10.3390/app11062653.
» https://doi.org/10.3390/app11062653 - [18] MANJUNATHA, L.R., ANVEKAR, S.R., YOGANANDA, M., “Recent developments in the Indian concrete industry in the use of GGBS in concrete at RMC batching plants as partial replacement to OPC cement and its effects on concrete durability and sustainabilty in the Indian context”, In: Proceedings of the International Congress on Durability of Concrete, v. 1, New Delhi, 2014.
-
[19] PATIL, Y.O., PATIL, P.N., DWIVEDI, A.K., “GGBS as partial replacement of OPC in cement concrete: an experimental study”, International Journal of Scientific Research, v. 2, n. 11, pp. 189–191, 2013. doi: http://doi.org/10.15373/22778179/NOV2013/60.
» https://doi.org/10.15373/22778179/NOV2013/60 -
[20] WEERASINGHE, K.A.B., GAMAGE, J.C.P.H., FAWZIA, S., ET AL., “Experimental investigation on flexural behaviour of vertically-curved circular-hollow steel sections strengthened with externally bonded carbon fibre reinforced polymer “, Engineering Structures, v. 236, pp. 112040, 2021. doi: http://doi.org/10.1016/j.engstruct.2021.112040.
» https://doi.org/10.1016/j.engstruct.2021.112040 -
[21] IWASA, T., NISHIZAWA, S., SAKAI, M., “Buckling severity measurement of axially compressed cylindrical structures with periodic buckling pattern”, Engineering Structures, v. 213, pp. 110568, 2020. doi: http://doi.org/10.1016/j.engstruct.2020.110568.
» https://doi.org/10.1016/j.engstruct.2020.110568 - [22] ARASU NAVEEN, A., RANJINI, D., PRABHU, R., “Investigation on partial replacement of cement by GGBS”, Journal of Critical Reviews, v. 7, n. 17, pp. 3827–3831, 2020.
-
[23] PARK, C., LEE, D., “Classification of respiratory states using spectrogram with convolutional neural network”, Applied Sciences, v. 12, n. 4, pp. 1895, 2022. doi: http://doi.org/10.3390/app12041895.
» https://doi.org/10.3390/app12041895 -
[24] GHOBADI, M.S., SHAMS, A.S., “A hybrid self-centering building toward seismic resilient structures: design procedure and fragility analysis”, Journal of Building Engineering, v. 44, pp. 103261, 2021. doi: http://doi.org/10.1016/j.jobe.2021.103261.
» https://doi.org/10.1016/j.jobe.2021.103261 -
[25] ZEALAKSHMI, D., VIJAYA, B., “A comparative flexural performance of an over-reinforced high strength concrete beam with normal strength beam”, Materials Today: Proceedings, v. 68, pp. 1536–1541, 2022. doi: http://doi.org/10.1016/j.matpr.2022.07.179.
» https://doi.org/10.1016/j.matpr.2022.07.179 -
[26] ZHANG, Y., MIN, J., LIU, C., ET AL., “Hotspot detection and spatiotemporal evolution of catering service grade in mountainous cities from the perspective of geo-information tupu”, ISPRS International Journal of Geo-Information, v. 10, n. 5, pp. 287, 2021. doi: http://doi.org/10.3390/ijgi10050287.
» https://doi.org/10.3390/ijgi10050287 - [27] ARASU, N., RAFSAL, M.M., SURYA KUMAR, O.R., “Experimental investigation of high performance concrete by partial replacement of fine aggregate by construction demolition waste”, International Journal of Scientific and Engineering Research, v. 9, n. 3, pp. 46–52, 2018.
-
[28] FONSECA, J.F., TEIXEIRA, J.E.S.L., BRANCO, V.T.F.C., ET AL., “Evaluation of effects of filler by-products on fine aggregate matrix viscoelasticity and fatigue-fracture characteristics”, Journal of Materials in Civil Engineering, v. 31, n. 10, pp. 04019240, 2019. doi: http://doi.org/10.1061/(ASCE) MT.1943-5533.0002891.
» https://doi.org/10.1061/(ASCE)MT.1943-5533.0002891 - [29] SURESHBABU, Y., GANESHAN, P., RAJA, K., ET AL., “Performance and emissions parameters optimization of thermal barrier coated engine tested with Tamanu blended diesel fuel: a novel emission pollution-preventive approach”, Global NEST Journal, v. 25, n. 3, pp. 78–86, 2023.
-
[30] FENG, S., YANG, Y., XUE, Y., ET AL., “A post-tensioned hybrid beam-column connection with a web friction device: experimental study and theoretical analysis”, Journal of Building Engineering, v. 43, pp. 103105, 2021. doi: http://doi.org/10.1016/j.jobe.2021.103105.
» https://doi.org/10.1016/j.jobe.2021.103105 -
[31] GANAPATHY, G.P., ALAGU, A., RAMACHANDRAN, S., ET AL., “Effects of fly ash and silica fume on alkalinity, strength and planting characteristics of vegetation porous concrete”, Journal of Materials Research and Technology, v. 24, pp. 5347–5360, 2023. doi: http://doi.org/10.1016/j.jmrt.2023.04.029.
» https://doi.org/10.1016/j.jmrt.2023.04.029 - [32] THIRUKUMARAN, T., KRISHNAPRIYA, S., PRIYA, V., ET AL., “Utilizing rice husk ash as a bio-waste material in geopolymer composites with aluminium oxide”, Global NEST Journal, v. 25, n. 6, pp. 119–129, 2023.
- [33] VERAPATHRAN, M., VIVEK, S., ARUNKUMAR, G.E., ET AL., “Flexural behaviour of HPC beams with steel slag aggregate”, Journal of Ceramic Processing Research, v. 24, n. 1, pp. 89–97, 2023.
-
[34] AL-MOSAWY, S.K., AL-JABERI, A.A., ALROBAEE, T.R., ET AL., “Urban planning and reconstruction of cities post-wars by the approach of events and response images”, Civil Engineering Journal, v. 7, n. 11, pp. 1836–1852, 2021. doi: http://doi.org/10.28991/cej-2021-03091763.
» https://doi.org/10.28991/cej-2021-03091763 -
[35] VITA, N., SHARMA, A., HOFMANN, J., “Bonded anchors with post-installed supplementary reinforcement under tension loading: experimental investigations”, Engineering Structures, v. 252, pp. 113754, 2022. doi: http://doi.org/10.1016/j.engstruct.2021.113754.
» https://doi.org/10.1016/j.engstruct.2021.113754 -
[36] ARASU, A.N., MUTHUSAMY, N., NATARAJAN, B., ET AL., “Optimization of high performance concrete composites by using nano materials”, Research on Engineering Structures and Materials, v. 9, n. 3, pp. 843–859, 2023. doi: http://doi.org/10.17515/resm2022.602ma1213.
» https://doi.org/10.17515/resm2022.602ma1213 -
[37] NAVEEN KUMAR, S., NATARAJAN, M., ARASU, A.N., “A comprehensive microstructural analysis for enhancing concrete’s longevity and environmental sustainability”, Journal of Environmental Nanotechnology, v. 13, n. 2, pp. 368–376, 2024. doi: http://doi.org/10.13074/jent.2024.06.242584.
» https://doi.org/10.13074/jent.2024.06.242584 -
[38] THOEMMES, A., BATAEV, I.A., LAZURENKO, D.V., ET AL., “Microstructure and lattice parameters of suction-cast Ti–Nb alloys in a wide range of Nb concentrations”, Materials Science and Engineering A, v. 818, pp. 141378, 2021. doi: http://doi.org/10.1016/j.msea.2021.141378.
» https://doi.org/10.1016/j.msea.2021.141378 -
[39] RAGAB, M.G., ABDULKADIR, S.J., AZIZ, N., ET AL., “An ensemble one dimensional convolutional neural network with Bayesian optimization for environmental sound classification”, Applied Sciences, v. 11, n. 10, pp. 4660, 2021. doi: http://doi.org/10.3390/app11104660.
» https://doi.org/10.3390/app11104660 -
[40] ASHRAF, U., ZHANG, H., ANEES, A., ET AL., “An ensemble-based strategy for robust predictive volcanic rock typing efficiency on a global-scale: a novel workflow driven by big data analytics”, The Science of the Total Environment, v. 937, pp. 173425, 2024. doi: http://doi.org/10.1016/j.scitotenv.2024.173425. PubMed PMID: 38795994.
» https://doi.org/10.1016/j.scitotenv.2024.173425 -
[41] VIVEK, S., PRIYA, V., SUDHARSAN, S.T., ET AL., “Experimental investigation on bricks by using cow dung, rice husk, egg shell powder as a partial replacement for fly ash”, The Asian Review of Civil Engineering, v. 9, n. 2, pp. 1–7, 2020. doi: http://doi.org/10.51983/tarce-2020.9.2.2556.
» https://doi.org/10.51983/tarce-2020.9.2.2556 -
[42] SAINI, S.K., MAHATO, S., PANDEY, D.N., ET AL., “Modeling flood susceptibility zones using hybrid machine learning models of an agricultural dominant landscape of India”, Environmental Science and Pollution Research International, v. 30, n. 43, pp. 97463–97485, 2023. doi: http://doi.org/10.1007/s11356-023-29049-9. PubMed PMID: 37594709.
» https://doi.org/10.1007/s11356-023-29049-9 -
[43] ESPINO, M.T., TUAZON, B.J., ESPERA JUNIOR, A.H., ET AL., “Statistical methods for design and testing of 3D-printed polymers”, MRS Communications, v. 13, n. 2, pp. 193–211, 2023. doi: http://doi.org/10.1557/s43579-023-00332-7. PubMed PMID: 37153534.
» https://doi.org/10.1557/s43579-023-00332-7 -
[44] KADHAR, S.A., GOPAL, E., SIVAKUMAR, V., ET AL., “Optimizing flow, strength, and durability in highstrength self-compacting and self-curing concrete utilizing lightweight aggregates”, Matéria, v. 29, n. 1, e20230336, 2024. doi: http://doi.org/10.1590/1517-7076-rmat-2023-0336.
» https://doi.org/10.1590/1517-7076-rmat-2023-0336 -
[45] HAN, S., ZHAO, S., LU, D., ET AL., “Performance improvement of recycled concrete aggregates and their potential applications in infrastructure: a review”, Buildings, v. 13, n. 6, pp. 1411, 2023. doi: http://doi.org/10.3390/buildings13061411.
» https://doi.org/10.3390/buildings13061411 - [46] AMERICAN CONCRETE INSTITUTE, ACI 211.4R-93 Recommended Guidelines For High-Performance Concrete Mix Design, Michigan, USA, 1993.
-
[47] MUTHUPRIYA, P., BOOBALAN, S.C., VISHNURAM, B.G., “Behaviour of fibre reinforced highperformance concrete in exterior beam column joint”, International Journal Of Advanced Structural Engineering, v. 6, n. 3, pp. 57, 2014. doi: http://doi.org/10.1007/s40091-014-0057-2.
» https://doi.org/10.1007/s40091-014-0057-2 - [48] BUREAU OF INDIAN STANDARDS, IS 516: 1959 Methods of Tests for Strength of Concrete, New Delhi, BIS, 1959.
-
[49] ABREU, G.B., COSTA, S.M.M., GUMIERI, A.G., ET AL., “Mechanical properties and microstructure of high-performance concrete stabilized nano-silica”, Revista Materia, v. 22, n. 2, e11824, 2017. doi: http://doi.org/10.1590/S1517-707620170002.0156.
» https://doi.org/10.1590/S1517-707620170002.0156 - [50] Smith ET AL. (2021).
- [51] Zhang, Hang, Chongruo Wu, Zhongyue Zhang, Yi Zhu, Haibin Lin, Zhi Zhang, Yue Sun et al. “Resnest: Split-attention networks.” In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp. 2736-2746. 2022.
- [52] Kumar, Manish, Arbind Kumar Patel, Anil V. Shah, Janvi Raval, Neha Rajpara, Madhvi Joshi, and Chaitanya G. Joshi. “First proof of the capability of wastewater surveillance for COVID-19 in India through detection of genetic material of SARS-CoV-2.” Science of The Total Environment 746 (2020): 141326.
- [53] Lee, Gregory, Ralf Gommers, Filip Waselewski, Kai Wohlfahrt, and Aaron O’Leary. “PyWavelets: A Python package for wavelet analysis.” Journal of Open Source Software 4, no. 36 (2019): 1237.
Publication Dates
-
Publication in this collection
27 Jan 2025 -
Date of issue
2025
History
-
Received
07 Oct 2024 -
Accepted
21 Nov 2024
























