Acessibilidade / Reportar erro

Use of machine learning techniques for predicting the bearing capacity of piles

Abstract

Geotechnical engineers frequently rely on semi-empirical methods like Décourt-Quaresma and Meyehof’s to estimate the bearing capacity of piles. This paper proposes alternatives to these methods, presenting an approach using machine learning models for predicting the bearing capacity of precast concrete piles. It uses data samples including 165 load tests, each one accompanied with a SPT sounding. This study proposes two types of analysis using two separated datasets, one based on the Décourt-Quaresma method and the other based on the Meyerhof method. Six machine learning algorithms of distinct biases are trained and tested with a leave-one-out cross validation procedure and the models’ predictive performance is assessed through two metrics: root mean squared error (RMSE) and coefficient of determination (R2). The best performing technique was random forest (RF) using Décourt-Quaresma dataset, with an RMSE of 642.38. All other machine learning techniques obtained a RMSE below 710, overcoming Meyerhof's and Décourt-Quaresma's semi-empirical methods, which both obtained RMSE values close to 900. This study proposes 95% and 90% confidence intervals for the best technique employing a graphical interpretation, so that geotechnical engineers can choose which level of safety they wish to work with. Finally, the study presents a case study showing that the best performing models achieve a reasonable accuracy, surpassing the semi-empirical methods in two of the three piles considered. The representativity of the new examples within the used datasets explain the accuracy of the techniques.

Keywords
Concrete precast piles; Bearing capacity; Semi-empirical methods; Machine learning

1. Introduction

Designers need to estimate the load bearing capacity of piles and the most precise way is through static pile load tests. The Brazilian Standard (ABNT 2019Associação Brasileira de Normas Técnicas – ABNT (2019). Projeto e execução de fundações. Associação Brasileira de Normas Técnicas, Rio de Janeiro.) define procedures for this type of test, which basically consists on applying an increasing load to an executed pile and measuring its displacement. Designers can obtain the load bearing capacity by examining the load-displacement graph, using criteria defined by the standard. Nonetheless, they cannot rely only on static pile load tests because they are expensive, time consuming and usually executed when part of the piles of the project are already in place. The most popular approach to estimate pile bearing capacity beforehand is to use semi-empirical methods, like those proposed by Aoki & Velloso (1975)Aoki, N., & Velloso, D. (1975). An approximate method to estimate the bearing capacity of piles. In Proc., 5th Pan-American Conf. of Soil Mechanics and Foundation Engineering (pp. 367-376) Buenos Aires: International Society of Soil Mechanics and Geotechnical Engineering., Décourt & Quaresma (1978Décourt, L., & Quaresma, A. (1978). Capacidade de carga de estacas a partir de valores de SPT. In Congresso Brasileiro de Mecânica dos Solos e Engenharia de Fundações (pp. 45-53). São Paulo: Associação Brasileira de Mecânica dos Solos e Engenharia Geotécnica., 1998Décourt, L., & Quaresma, A. (1998). Análise e Projeto de Fundações Profundas. In: W. Hachich (Eds.), Fundações: teoria e prática (pp. 275-276). PINI.) and Meyerhof (1976)Meyerhof, G. (1976). Bearing capacity and settlements of pile foundations. Journal of Geotechnical Engineering, 102, 197-228. http://dx.doi.org/10.1061/AJGEB6.0000243.
http://dx.doi.org/10.1061/AJGEB6.0000243...
. Most semi-empirical methods propose two separate estimates: one for the shaft resistance and another for the tip resistance. The total pile bearing capacity given by the sum of them. These methods usually estimate the bearing capacity through results of in situ tests and pile geometric features. In several countries (including Brazil), contractors usually only make available the standard penetration test (SPT). The main reasons are cost and simplicity when compared to methods like the cone penetration test, making the SPT popular in those countries. Even when designers do have access to other in situ tests, they sometimes rely on correlations to convert data into SPT values.

In recent years, machine learning techniques are increasingly gaining space within a wide variety of engineering applications. Their advantages include the capability to deal with large amounts of data and to find complex and highly nonlinear relationships among different parameters. In geotechnics many works have been using these algorithms to solve different kinds of problems with good results over traditional methods. Some of these problems are: soil classification (Bhattacharya & Solomatine, 2006Bhattacharya, B., & Solomatine, D.P. (2006). Machine learning in soil classification. Neural Networks, 19(2), 186-195. http://dx.doi.org/10.1016/j.neunet.2006.01.005.
http://dx.doi.org/10.1016/j.neunet.2006....
; Kovacevic et al., 2010Kovacevic, M., Bajat, B., & Gajic, B. (2010). Soil type classification and estimation of soil properties using support vector machines. Geoderma, 154(3-4), 340-347. http://dx.doi.org/10.1016/j.geoderma.2009.11.005.
http://dx.doi.org/10.1016/j.geoderma.200...
; Bonini et al., 2017Bonini Neto, A., Bonini, C.S.B., Bisi, B.S., Reis, A.R., & Coletta, L.F.S. (2017). Artificial neural network for classification and analysis of degraded soils. IEEE Latin America Transactions, 15(3), 503-509.; Carvalho & Ribeiro 2019Carvalho, L.O., & Ribeiro, D.B. (2019). Soil classification system from cone penetration test data applying distance-based machine learning algorithms. Soils and Rocks, 42(2), 167-178. http://dx.doi.org/10.28927/SR.422167.
http://dx.doi.org/10.28927/SR.422167...
), soil profiling (Arel, 2012Arel, E. (2012). Predicting the spatial distribution of soil profile in Adapazari/Turkey by artificial neural networks using CPT data. Computers & Geosciences, 43, 90-100. http://dx.doi.org/10.1016/j.cageo.2012.01.021.
http://dx.doi.org/10.1016/j.cageo.2012.0...
), soil liquefaction (Juang & Chen, 1999Juang, C.H., & Chen, C.J. (1999). CPT-based liquefaction evaluation using artificial neural networks. Computer-Aided Civil and Infrastructure Engineering, 14(3), 221-229. http://dx.doi.org/10.1111/0885-9507.00143.
http://dx.doi.org/10.1111/0885-9507.0014...
; Hanna et al., 2007Hanna, A.M., Ural, D., & Saygili, G. (2007). Neural network model for liquefaction potential in soil deposits using Turkey and Taiwan earthquake data. Soil Dynamics and Earthquake Engineering, 27(6), 521-540. http://dx.doi.org/10.1016/j.soildyn.2006.11.001.
http://dx.doi.org/10.1016/j.soildyn.2006...
; Goh & Goh, 2007Goh, A.T.C., & Goh, S.H. (2007). Support vector machines: their use in geotechnical engineering as illustrated using seismic liquefaction data. Computers and Geotechnics, 34(5), 410-421. http://dx.doi.org/10.1016/j.compgeo.2007.06.001.
http://dx.doi.org/10.1016/j.compgeo.2007...
; Livingston et al., 2008Livingston, G., Piantedosi, M., Kurup, P., & Sitharam, T.G. (2008). An approximate method to estimate the bearing capacity of piles. In X. Zeng, M.T. Manzari, & D.R. Hiltunen (Eds.), Geotechnical Earthquake Engineering and Soil Dynamics IV Congress (pp. 1-10). Reston: American Society of Civil Engineers.; Kohestani et al., 2015Kohestani, V.R., Hassanlourad, M., & Ardakani, A. (2015). Evaluation of liquefaction potential based on CPT data using random forest. Natural Hazards, 79(2), 1079-1089. http://dx.doi.org/10.1007/s11069-015-1893-5.
http://dx.doi.org/10.1007/s11069-015-189...
), slope stability (Ferentinou & Sakellariou, 2007Ferentinou, M.D., & Sakellariou, M.G. (2007). Computational intelligence tools for the prediction of slope performance. Computers and Geotechnics, 34(5), 362-384. http://dx.doi.org/10.1016/j.compgeo.2007.06.004.
http://dx.doi.org/10.1016/j.compgeo.2007...
; Bui et al., 2019Bui, D.T., Moayedi, H., Gör, M., Jaafari, A., & Foong, L.K. (2019). Predicting slope stability failure through machine learning paradigms. ISPRS International Journal of Geo-Information, 8(9), 395. http://dx.doi.org/10.3390/ijgi8090395.
http://dx.doi.org/10.3390/ijgi8090395...
; Maxwell et al., 2020Maxwell, A.E., Sharma, M., Kite, J.S., Donaldson, K.A., Thompson, J.A., Bell, M.L., & Maynard, S.M. (2020). Slope failure prediction using random forest machine learning and lidar in an eroded folded mountain belt. Remote Sensing, 12(3), 486. http://dx.doi.org/10.3390/rs12030486.
http://dx.doi.org/10.3390/rs12030486...
) , foundation settlement (Samui, 2008Samui, P. (2008). Support vector machine applied to settlement of shallow foundations on cohesionless soils. Computers and Geotechnics, 35(3), 419-427. http://dx.doi.org/10.1016/j.compgeo.2007.06.014.
http://dx.doi.org/10.1016/j.compgeo.2007...
; Nejad et al., 2009Nejad, F.P., Kakhi, M.B., & McCabe, B.A. (2009). Prediction of pile settlement using artificial neural networks based on standard penetration test data. Computers and Geotechnics, 36(7), 1125-1133. http://dx.doi.org/10.1016/j.compgeo.2009.04.003.
http://dx.doi.org/10.1016/j.compgeo.2009...
; Nejad & Jaksa, 2017Nejad, F.P., & Jaksa, M.B. (2017). Load-settlement behavior modeling of single piles using artificial neural networks and CPT data. Computers and Geotechnics, 89, 9-21. http://dx.doi.org/10.1016/j.compgeo.2017.04.003.
http://dx.doi.org/10.1016/j.compgeo.2017...
) and bearing capacity. For the latter, authors have used well-known algorithms with different kinds of inputs, such as wave stress data (Lok & Che, 2004Lok, T.M.H., & Che, W.F. (2004). Axial capacity prediction for driven piles using ANN: model comparison. In M.K. Yegian, & E. Kavazanjian (Eds.), Geotechnical Engineering for Transportation Projects (pp. 697-704). Reston: American Society of Civil Engineers. https://doi.org/10.1061/40744(154)56.
https://doi.org/10.1061/40744(154)56...
; Pal & Deswal, 2008Pal, M., & Deswal, S. (2008). Modeling pile capacity using support vector machines and generalized regressions neural network. Journal of Geotechnical and Geoenvironmental Engineering, 134(7), 1021-1024. http://dx.doi.org/10.1061/(ASCE)1090-0241(2008)134:7(1021).
http://dx.doi.org/10.1061/(ASCE)1090-024...
; Maizir & Kassim, 2013Maizir, H., & Kassim, K.A. (2013). Neural network application in prediction of axial bearing capacity of driven piles. In O. Castillo, C. Douglas, D.D. Feng & J. Lee (Eds.), Proc of the International MultiConference of Engineers and Computer Scientists. Hong Kong: Newswood Limited.; Momeni et al., 2015Momeni, E., Nazir, R., Armaghani, D., & Maizir, H. (2015). Application of artificial neural network for predicting shaft and tip resistances of concrete piles. Earth Sciences Research Journal, 19(1), 85-93.), soil parameters (Pal & Deswal, 2010Pal, M., & Deswal, S. (2010). Modelling pile capacity using Gaussian process regression. Computers and Geotechnics, 37(7), 924-947. http://dx.doi.org/10.1016/j.compgeo.2010.07.012.
http://dx.doi.org/10.1016/j.compgeo.2010...
; Moayedi & Hayati, 2019Moayedi, H., & Hayati, S. (2019). Artificial intelligence design charts for predicting friction capacity of driven pile in clay. Neural Computing & Applications, 31(11), 7429-7445. http://dx.doi.org/10.1007/s00521-018-3555-5.
http://dx.doi.org/10.1007/s00521-018-355...
; Kardani et al., 2020Kardani, N., Zhou, A., Nazem, M., & Shen, S. (2020). Estimation of bearing capacity of piles in cohesionless soil using optimised machine learning approaches. Geotechnical and Geological Engineering, 38(2), 2271-2291. http://dx.doi.org/10.1007/s10706-019-01085-8.
http://dx.doi.org/10.1007/s10706-019-010...
), CPT results (Ardalan et al., 2009Ardalan, H., Eslami, A., & Nariman-Zadeh, N. (2009). Piles shaft capacity from CPT and CPTu data by polynomial neural networks and genetic algorithms. Computers and Geotechnics, 36(4), 616-625. http://dx.doi.org/10.1016/j.compgeo.2008.09.003.
http://dx.doi.org/10.1016/j.compgeo.2008...
; Shahin, 2010Shahin, M.A. (2010). Intelligent computing for modeling axial capacity of pile foundations. Canadian Geotechnical Journal, 47(2), 230-243. http://dx.doi.org/10.1139/T09-094.
http://dx.doi.org/10.1139/T09-094...
; Kordjazi et al., 2014Kordjazi, A., Nejad, F.P., & Jaksa, M.B. (2014). Prediction of ultimate axial load-carrying capacity of piles using a support vector machine based on CPT data. Computers and Geotechnics, 55, 91-102. http://dx.doi.org/10.1016/j.compgeo.2013.08.001.
http://dx.doi.org/10.1016/j.compgeo.2013...
; Kordjazi et al., 2013Kordjazi, A., Nejad, F.P., & Jaksa, M.B. (2013). Prediction of load-carrying capacity of piles using a support vector machine and improved data collection. In G. Cassidy (Ed.), Proceedings of the 12th Australia New Zealand conference on geomechanics: the changing face of the earth - geomechanics & human influence (pp. 1-8). Wellington: Opus.; Alkroosh et al., 2015Alkroosh, I., Bahadori, M., Nikraz, H., & Bahadori, A. (2015). Regressive approach for predicting bearing capacity of bored piles from cone penetration test data. Journal of Rock Mechanics and Geotechnical Engineering, 7(5), 584-592. http://dx.doi.org/10.1016/j.jrmge.2015.06.011.
http://dx.doi.org/10.1016/j.jrmge.2015.0...
) and SPT measures (Lee & Lee, 1996Lee, I., & Lee, J. (1996). Prediction of pile bearing capacity using artificial neural networks. Computers and Geotechnics, 18(3), 189-200. http://dx.doi.org/10.1016/0266-352X(95)00027-8.
http://dx.doi.org/10.1016/0266-352X(95)0...
; Nawari & Liang, 2000Nawari, N.O., & Liang, R. (2000). Intelligent hybrid system for the design of pile foundations. In N.D. Dennis, R. Castelli, & M.W. O'Neill (Eds.), New Technological and Design Developments in Deep Foundations (pp. 312-326). Reston: American Society of Civil Engineers. https://doi.org/10.1061/40511(288)22.
https://doi.org/10.1061/40511(288)22...
; Ismail & Jeng, 2011Ismail, A., & Jeng, D. (2011). Modelling load–settlement behavior of piles using high-order neural network (HON-PILE model). Engineering Applications of Artificial Intelligence, 24(5), 813-821. http://dx.doi.org/10.1016/j.engappai.2011.02.008.
http://dx.doi.org/10.1016/j.engappai.201...
; Benali et al., 2018Benali, A., Nechnech, A., Boukhatem, B., Hussein, M.N., & Karry, M. (2018). Neural networks and principle component analysis approaches to predict pile capacity in sand. In MATEC Web of Conference (Org.), 2nd International Congress on Materials & Structural Stability. Les Ulis: EDP Sciences. https://doi.org/10.1051/matecconf/201814902025
https://doi.org/10.1051/matecconf/201814...
; Jesswein & Liu, 2018Jesswein, M., & Liu, J. (2018). Predicting the side resistance of piles using a genetic algorithm and SPT n-values. In Canadian Geotechnical Society (Org.), Proceedings of the 71st Canadian Geotechnical Conference and the 13th Joint CGS/IAH-CNC Groundwater Conference - GeoEdmonton 2018 : moving forward (pp. 1-8). Richmond: Canadian Geotechnical Society.; Pham et al., 2020Pham, T., Ly, H., Tran, V., Giap, L., Vu, H., & Duong, H. (2020). Prediction of pile axial bearing capacity using artificial neural network and random forest. Applied Sciences, 10(5), 1871. http://dx.doi.org/10.3390/app10051871.
http://dx.doi.org/10.3390/app10051871...
). These studies achieved reasonable results for the proposed problems. Nevertheless, most of them include weaknesses like comparing few techniques or using very homogeneous datasets, with all soundings taken from the same construction site.

The main objective of this paper is to propose a new approach for the use of machine learning techniques, using classical semi-empirical methods as a basis for estimating the bearing capacity of piles. It is better than previous machine learning models from the literature concerning generality for tropical soils and ease of use. The used datasets include only static load tests (slow maintained load) of pre-cast concrete piles executed accordingly to the Brazilian standard (see ABNT 2019Associação Brasileira de Normas Técnicas – ABNT (2019). Projeto e execução de fundações. Associação Brasileira de Normas Técnicas, Rio de Janeiro.) and accompanied with SPT soundings. The investigation starts with the training of six machine learning techniques, producing two models for each one: the first using the inputs from the Décourt-Quaresma method and the second the inputs from the Meyerhof method. A multiple linear regression (LR) is also included as a baseline for performance. The authors selected the Décourt-Quaresma method because it is commonly used in Brazilian foundation projects and Meyerhof for being widely used worldwide. Both sets of inputs include pile diameter and length, the mean SPT along the shaft and the mean SPT at the pile tip. The main difference of the two sets is how mean SPT values are calculated.

It is shown in a general application that the precision of all machine learning techniques surpassed both Meyerhof (1976)Meyerhof, G. (1976). Bearing capacity and settlements of pile foundations. Journal of Geotechnical Engineering, 102, 197-228. http://dx.doi.org/10.1061/AJGEB6.0000243.
http://dx.doi.org/10.1061/AJGEB6.0000243...
and Décourt & Quaresma (1978Décourt, L., & Quaresma, A. (1978). Capacidade de carga de estacas a partir de valores de SPT. In Congresso Brasileiro de Mecânica dos Solos e Engenharia de Fundações (pp. 45-53). São Paulo: Associação Brasileira de Mecânica dos Solos e Engenharia Geotécnica., 1998Décourt, L., & Quaresma, A. (1998). Análise e Projeto de Fundações Profundas. In: W. Hachich (Eds.), Fundações: teoria e prática (pp. 275-276). PINI.) semi-empirical methods with respect to RMSE. This work proposes a graphical method to provide 90% and 95% confidence intervals for the results of the best technique. A case study applies the top two machine learning models and the two semi-empirical methods to three new examples, from one site that was not included in the training dataset. The machine learning techniques presented reasonable performance, and were better than the semi-empirical methods in two of the three piles.

2. Semi-empirical methods

Semi-empirical methods work based on empirical correlations of in situ tests data and adjustments with load test results. Results can vary for these methods due to their implicit subjectivity. For the Meyerhof method, little subjectivity was included because it uses NSPT and pile geometry as inputs, which are not sensitive to interpretation. On the other hand, the Décourt-Quaresma method relies on soil types as presented in Table 1, which are sensitive to interpretation.

Table 1
Values for K (Décourt & Quaresma, 1978Décourt, L., & Quaresma, A. (1978). Capacidade de carga de estacas a partir de valores de SPT. In Congresso Brasileiro de Mecânica dos Solos e Engenharia de Fundações (pp. 45-53). São Paulo: Associação Brasileira de Mecânica dos Solos e Engenharia Geotécnica., 1998Décourt, L., & Quaresma, A. (1998). Análise e Projeto de Fundações Profundas. In: W. Hachich (Eds.), Fundações: teoria e prática (pp. 275-276). PINI.).

In these methods, the pile load capacity Rt is usually divided into two parts: lateral friction Rl and tip resistance Rp, as shown in Equation 1. Different expressions are proposed for Rl and Rp in the literature, using information such as soil type, pile type, pile geometry and in situ test results.

R t = R l + R p (1)

The authors selected two methods for this study: the Décourt-Quaresma method for being popular in Brazil and the Meyerhof method for being widely used around the world. The next sections describe these methods.

2.1. Décourt-Quaresma

This method obtains the tip resistance using a factor related to the soil type, as presented in Table 1. It also uses the tip area Ap and the mean NSPT index around the pile tip SPTp, considering the value at the tip, the one above and the one below. It obtains the lateral resistance using pile geometry and the mean NSPT index along the pile shaft SPTl. The latter is subjected to 3<SPTp<15 and NSPT values used to obtain SPTp cannot be included. The final expression is

R t = α . K . S P T p . A p + U . β .10 S P T l 3 + 1 L (2)

where α and β refer to soil and pile type, respectively. U is the pile perimeter and L is the pile length.

2.2. Meyerhof

This method uses the NSPT index, pile length L and pile diameter D to estimate the pile bearing capacity. It calculates SPTl as the mean of the whole pile shaft and SPTp as the mean including 8D above the tip and 3D below it (Meyerhof, 1976Meyerhof, G. (1976). Bearing capacity and settlements of pile foundations. Journal of Geotechnical Engineering, 102, 197-228. http://dx.doi.org/10.1061/AJGEB6.0000243.
http://dx.doi.org/10.1061/AJGEB6.0000243...
). The expression proposed by Meyerhof is

R t = A p . q p + U . L . q s (3)

where

q p = 40. L S P T p D 400. S P T p (4)

and

q s = 2. S P T l (5)

3. Dataset

The information used in this work includes 165 precast concrete pile load tests and their respective SPT measures collected from many different construction sites in Brazil. It was obtained from the works of Lobo (2005)Lobo, B. (2005). Método de previsão de capacidade de carga de estacas: aplicação dos conceitos de energia do ensaio SPT [Doctoral thesis]. Universidade Federal do Rio Grande do Sul., Vianna (2000)Vianna, A. (2000). Análise de provas de carga estática em estacas pré-moldadas cravadas na cidade de Curitiba e região metropolitana [Masters thesis]. Universidade de São Paulo. https://doi.org/10.11606/D.18.2000.tde-18102006-101542
https://doi.org/10.11606/D.18.2000.tde-1...
and Santos (1988)Santos Junior, O. (1988). Previsão do comportamento carga-recalque de estacas pré-moldadas de concreto [Masters thesis]. Universidade de São Paulo. and all load tests were performed according to the Brazilian Standard (ABNT, 2019Associação Brasileira de Normas Técnicas – ABNT (2019). Projeto e execução de fundações. Associação Brasileira de Normas Técnicas, Rio de Janeiro.). When the maximum applied load was not achieved, the load-settlement curve was extrapolated using the Van der Veen method (Van Der Veen, 1953Van Der Veen, C. (1953). The bearing capacity of a pile. In International Conference on Soil Mechanics and Foundation Engineering (Org.), International Conference on Soil Mechanics and Foundation Engineering (pp. 367-376). Zurich: ICOSOMEF.). Interested readers can find further detail about these load tests in Lobo (2005)Lobo, B. (2005). Método de previsão de capacidade de carga de estacas: aplicação dos conceitos de energia do ensaio SPT [Doctoral thesis]. Universidade Federal do Rio Grande do Sul., Vianna (2000)Vianna, A. (2000). Análise de provas de carga estática em estacas pré-moldadas cravadas na cidade de Curitiba e região metropolitana [Masters thesis]. Universidade de São Paulo. https://doi.org/10.11606/D.18.2000.tde-18102006-101542
https://doi.org/10.11606/D.18.2000.tde-1...
and Santos (1988)Santos Junior, O. (1988). Previsão do comportamento carga-recalque de estacas pré-moldadas de concreto [Masters thesis]. Universidade de São Paulo.. In specific cases, information about pile rupture and comparisons between applied and ultimate loads is available.

Figure 1 presents the location of the soundings, most of them from the south and southeast regions of Brazil. The country presents a predominant tropical climate and high temperatures, with 65% of its territory covered by non-homogeneous lateritic soils. The clay-ferruginous soil is the most common type (Morais et al., 2020Morais, T.S.O., Tsuha, C.H.C., Bandeira Neto, L.A., & Singh, R.M. (2020). Effects of seasonal variations on the thermal response of energy piles in an unsaturated Brazilian tropical soil. Energy and Building, 216, 1-19. http://dx.doi.org/10.1016/j.enbuild.2020.109971.
http://dx.doi.org/10.1016/j.enbuild.2020...
). The authors had access to some details about the set provided by Vianna (2000)Vianna, A. (2000). Análise de provas de carga estática em estacas pré-moldadas cravadas na cidade de Curitiba e região metropolitana [Masters thesis]. Universidade de São Paulo. https://doi.org/10.11606/D.18.2000.tde-18102006-101542
https://doi.org/10.11606/D.18.2000.tde-1...
, which is composed by soundings taken from the city of Curitiba, in Paraná state. The geology of this region can be divided into three groups: a metamorphic rock complex from the Precambian; sedimentary deposits from Tertiary; and a more recent sedimentary deposit (Holocene), as a result of a partial removal of older sediments (Cenozoic). This entire sequence of Cenozoic sediments in the Curitiba Basin is named Guabirotuba Formation in the literature (Bigarella & Salamuni, 1962Bigarella, J.J., & Salamuni, R. (1962). Caracteres texturais dos sedimentos da bacia de Curitiba. Instituto de Geologia.).

Figure 1
Number of soundings at each Brazilian state.

After assembly, raw data was preprocessed into two datasets. The first, named Décourt dataset, uses SPTp, SPTl, D (in cm) and L (in m) as calculated in the Décourt-Quaresma method. The second uses the same inputs, but defined accordingly to the Meyerhof method. Notice that the difference is how each method calculates SPTp and SPTl, as presented in previous sections. The authors did not include soil type among the inputs because, based on their previous experience, these variables do not contribute to improve accuracy and include too much human error. Thus, although the authors consider the position of the water table relevant for the problem, they decided not to include it because many of the used soundings did not include this information. Figure 2 illustrates the variables used in each set. The target feature is the pile bearing load capacity Qu in kN, obtained from the load test. Tables 2 and 3 present a sample of each dataset. Unity b refers to the number of blows needed for the sampler to penetrate 30 cm into the soil (Salgado, 2008Salgado, R. (2008). The engineering of foundations. McGraw Hill.).

Figure 2
a) Décourt-Quaresma parameters. b) Meyerhof parameters.
Table 2
Sample from Décourt dataset.
Table 3
Sample from Meyerhof dataset.

Tables 4 and 5 present correlation matrices generated for each dataset. Inputs are not severely correlated, with all values within the interval 0.7,0.7. This indicates that they can be all considered informative, occurring few redundancies between them. Notice that the correlation between SPTl and SPTp is only 0.35 for the Décourt dataset, while it rises to 0.7 for the Meyerhof dataset. This can be explained by the way each method obtains these variables, with completely separated soil layers considered for the Décourt dataset and an intersection of common soil layers considered for the Meyerhof dataset (see Figure 2). D and L are the ones with stronger correlation to the output Qu, which was expected.

Table 4
Correlation matrix for Meyerhof dataset.
Table 5
Correlation matrix for Décourt dataset.

4. Model description

After pre-processing, this study uses both datasets to train a set of selected machine learning algorithms. First step is organizing each dataset as a matrix, where each column represents an input or the output and each line represents an example. In other words, each dataset becomes a 165×5 matrix. Next, it divides the examples (lines) of each dataset into two portions: the training set and the test set. This work uses the leave-one-out cross validation approach, using the full dataset for training except for one example kept apart for test. The procedure tests all examples and the final accuracy is given by the mean (Wong, 2015Wong, T. (2015). Performance evaluation of classification algorithms by k-fold and leave-one-outcross validation. Pattern Recognition, 48(9), 2839-2846. http://dx.doi.org/10.1016/j.patcog.2015.03.009.
http://dx.doi.org/10.1016/j.patcog.2015....
).

The coefficient of determination R20,1 is one metric used in this work to evaluate the performance of the algorithms. It is obtained using Equation 6, where y^i is a predicted value obtained from the model, yi is an observed value from dataset, y¯ the mean of all observed values and ne is the number of examples. In this work, yi is the pile bearing load capacity (Qu) of a specific pile i. R2 values close to 1 imply that the target variable is completely explained by the used model. 0 means no connection between predicted and observed values. The literature considers this metric a meaningful indicator of accuracy (Draper & Smith, 1998Draper, N.R., & Smith, H.K. (1998). Applied regression analysis (3rd edition). Wiley.).

R 2 = i = 1 n e y ^ i y ¯ 2 i = 1 n e y i y ¯ 2 (6)

Other performance metric adopted in this work is the root mean square error (RMSE). It is calculated for all machine learning models and is given by Equation 7.

R M S E = i = 1 n e y ^ i y ¯ 2 n e (7)

The machine learning techniques used in this work are k-nearest neighbor (KNN), kernel-KNN (KKNN), decision tree (DT), random forest (RF), artificial neural networks (ANN) and support vector machines (SVM). The following subsections present them, with a brief overview of its functionality. They were chosen considering their popularity within machine learning applications, their different biases and their reasonable results towards this work dataset. Multiple linear regression (LR) is also included as a baseline for the performance of the techniques.

4.1 KNN and KKNN

The KNN technique understands each example as a point whose coordinates are the inputs. It expects that a new example would have an output similar to those that are close in this input space. The regression problem can use Equation 8, which defines the output of the new example as the average value of its k nearest neighbors.

y ^ i = 1 k j = 1 k y j (8)

This work weights the output of each neighbor with respect to its distance to the new example, giving more weight to closer ones to improve accuracy (Dudani, 1976Dudani, S.A. (1976). The distance-weighted k-nearest-neighbor rule. IEEE Transactions on Systems, Man, and Cybernetics, SMC-6(4), 325-327.). It calculates the distance using the Minkowski metric, as presented in Equation 9. In this work p=2, which leads to the Euclidian metrics. Equation 9 gives the distance between arbitrary points represented by vectors a and b, with components a1, ,an and b1, ,bn, considering an n-dimensional space.

d a , b = i = 1 n a i b i p 1 / p (9)

KNN has the disadvantage of poor performance for some type of complex problems (Kuo et al., 2008Kuo, B., Yang, J., Sheu, T., & Yang, S. (2008). Kernel-based KNN and gaussian classifiers for hyperspectral image classification. In Geoscience and Remote Sensing Society (Org.), 2008 International Geoscience and Remote Sensing Symposium (pp. 1006-1008). Piscataway: IEEE.). The KKNN technique solves this problem by changing the distribution of samples, mapping them into a higher dimensional input space. The objective is to obtain a linear problem in this new space, equivalent to the nonlinear problem of the original space. Equation 10 presents an example of mapping a n-dimensional input space into a m-dimensional space:

a = a 1 , , a n m a p p i n g ψ a = φ 1 a , , φ m a , a S 1 , ψ a S 2 (10)

S1 is the original n-dimension space and S2 the new m-dimension space. a is a vector in S1 and ψa is its corresponding vector in S2. ψ defines the mapping from S1 to S2 and φi, i=1,,m, are input mapping functions. One problem in this approach is that finding ψ is usually impracticable. Nevertheless, the mapping does not require ψ if the internal product ψa.ψb is known for arbitrary vectors a and b. This inner product is called kernel (Yu et al., 2002Yu, K., Ji, L., & Zhang, X. (2002). Kernel nearest-neighbor algorithm. Neural Processing Letters, 15(2), 147-156. http://dx.doi.org/10.1023/A:1015244902967.
http://dx.doi.org/10.1023/A:101524490296...
).

The most commonly used kernel functions are: polynomial, radial basis and sigmoid, as shown in Equations 11, 12 and 13, respectively:

K a , b = 1 + a . b ρ (11)
K a , b = e x p a b 2 σ 2 (12)
K a , b = t a n h γ . a . b + ω (13)

where ρ, σ, γ and ω are adjustable parameters and a.b is the inner product between vectors a and b. This work uses the radial basis kernel based on preliminary tests.

4.2 DT and RF

A DT model is a flow-chart-like structure, with nodes that create ramifications dividing the dataset. It starts with a single root node that receives the complete dataset and distributes it to other nodes using a rule, which is usually an inequality applied to one of the inputs. New nodes receive portions of the dataset, subjects them to their rules and distributes them to other nodes, forming the branches of the tree. The last nodes, called leafs, assign outputs to the examples. Figure 3 presents a scheme of a DT.

Figure 3
Example of a decision tree.

One disadvantage of DTs is that they tend to become overspecialized in the dataset used for training, which prejudices performance for new examples. This behavior is called overfitting. RF is a technique based on DTs that minimizes this problem by using a collection of different DTs built randomly. The algorithm selects a subset of examples for each tree and node, ensuring that they are different. After RF creates the trees, each one make a separate prediction and the mean gives the final value (Ho, 1995Ho, T.K. (1995). Random decision forests. In International Association for Pattern Recognition. Technical Committee 11 (Org.), Proceedings of 3rd International Conference on Document Analysis and Recognition (pp. 278-282). Los Alamitos: IEEE Computer Society Press.).

4.3 ANN

The interaction of neurons in the human brain inspires the ANN algorithm. Its structure consists of a number of processing elements or nodes that are arranged in layers: an input layer, an output layer and one or more hidden layers. Each node from the first layer receives an input xi, which is multiplied by an adjustable connection weight wij. These values are inputs for the neurons of the next layer, that sum them and add a threshold value θj to obtain a combined input Ij. Then, the algorithm applies an activation function fIj to produce the output oj, as expressed in Equations 14 and 15.

I j = w i j x i + θ j (14)
o j = f I j (15)

This work uses a sigmoid function for activation, which is expressed as:

f I j = 1 1 + e λ I j (16)

where λ is a calibration parameter.

4.4 SVM

SVMs use statistical learning principles as a basis. Their main objectives are minimizing errors associated with the training dataset and maximizing the generalization of the model (Vapnik, 1999Vapnik, V.N. (1999). An overview of statistical learning theory. IEEE Transactions on Neural Networks, 10(5), 988-999.). The algorithm uses a set of functions for its regression model that can have a solution as given in Equation 17:

y = w . x + ι (17)

where x=x1,y1,,xl,yl, xRm is the input of l samples and m dimensions, yRm is the output, w is the weight vector and ι is the bias. The margin is a distance from the hyperplane which is set to contain all points, as illustrated in Figure 4. This distance is the error to be minimized, included in Equation 18 as follows:

Figure 4
Graphical representation of Support Vector Machines.
y = w . x + ι ± (18)

Equation 19 presents the function to be minimized. ξi and ξi* are parameters introduced to penalize points outside the margins and parameter C controls these penalties (Smola & Scholkopf, 2004Smola, A.J., & Scholkopf, B. (2004). A tutorial on support vector regression. Statistics and Computing, 14(3), 199-222. http://dx.doi.org/10.1023/B:STCO.0000035301.49549.88.
http://dx.doi.org/10.1023/B:STCO.0000035...
). The algorithm solves this optimization problem using Lagrange multipliers (Vapnik, 1998Vapnik, V.N. (1998). Statistical learning theory. Wiley.).

1 2 w 2 + C i = 1 l ξ i + ξ i * (19)

This procedure is valid for linear problems. One can extend it to nonlinear problems using kernels to map the input data into a higher dimensional space. It is the same approach described for the KKNN. The authors chose radial basis functions after observing better accuracy in preliminary tests.

4.5 LR

A LR seeks a linear relationship between the input variables and the output. Equation 20 represents the model generated by this kind of regression:

y ^ i = β 0 + β 1 x 1 + β 2 x 2 + + β n x n (20)

where y^i is the predicted variable, βj are the coefficients determined by the model and xj are the input values for the problem.

This technique has the advantage of being simple and widely used in geotechnical engineering practice, but it cannot reproduce non-linear behavior. Although it is not expected to obtain good results from this technique, it is included in this work as one of the baselines for the performance of the machine learning techniques.

5. Results and discussion

5.1 General application

The objective of this example is to apply the six machine learning techniques to Décourt and Meyerhof datasets, using RMSE and R2 metrics to evaluate performance. The baselines for performance are the original semi-empirical methods and LR.

Tables 6 and 7 present the performance obtained using the Décourt and Meyerhof datasets, respectively. RF was the technique with best accuracy, presenting the lowest RMSE in both tables. The second best was KNN for Décourt, followed by ANN and KKNN. For Meyerhof, the second best was ANN followed by KKNN and KNN. DT presented the worst performance, which can be explained by the tendency of this technique to overfit. SVM presented poor performance as well, worse than LR which is the baseline.

Table 6
Performance metrics using Décourt dataset.
Table 7
Performance metrics using Meyerhof dataset.

Table 8 presents a comparison between the performance of the semi-empirical methods (Décourt-Quaresma and Meyerhof), the LR and the RF algorithms. The subscript Dq indicates the use of the Décourt dataset for training. One can observe that even LR surpass the semi-empirical method of Meyerhof, encouraging the use of machine learning techniques for this type of problem. This conclusion is corroborated by other studies (Lee & Lee, 1996Lee, I., & Lee, J. (1996). Prediction of pile bearing capacity using artificial neural networks. Computers and Geotechnics, 18(3), 189-200. http://dx.doi.org/10.1016/0266-352X(95)00027-8.
http://dx.doi.org/10.1016/0266-352X(95)0...
; Pham et al., 2020Pham, T., Ly, H., Tran, V., Giap, L., Vu, H., & Duong, H. (2020). Prediction of pile axial bearing capacity using artificial neural network and random forest. Applied Sciences, 10(5), 1871. http://dx.doi.org/10.3390/app10051871.
http://dx.doi.org/10.3390/app10051871...
).

Table 8
Performance results by method.

Figure 5 complements the comparisons presented in Table 8, with abscissas representing measured values and ordinates representing predicted values. This study uses logarithm scale to better represent the wide range of values. A predicted value equal to the observed one produces a point at the black line, while poor predictions tend to produce points far from it. Note that the cloud of white circles, that represents RFMey, is clearly more concentrated around the black line than black squares and triangles, which represent semi-empirical methods. It is also shown that the semi-empirical methods tend to underestimate the load bearing capacity, while points from RFMey tend to be split in half by the black line.

Figure 5
Graphical comparison between methods studied in this work.

In order to present a complementary analysis, Table 9 presents predicted values using RFDq, the corresponding measured values Qu and the ratio between them. This ratio is organized in ascending order including all 165 load tests, with a range from 0.340 to 4.62. The objective is to produce confidence intervals for the RFDq /Qu values, allowing a better understanding of the accuracy of the algorithm.

Table 9
Relation between predicted and measured values.

The authors first verify whether the RFDq /Qu results follow a normal distribution using the Shapiro-Wilk test. The procedure uses a significance level of 5% and a starting null hypothesis H0 that data follows a normal distribution. However, when calculating the p-value, the result was far below 5%, indicating that the data does not have a normal behavior. This means that it is not possible to apply the confidence level theory for this distribution.

To solve this problem, this study proposes a less rigorous approach using the concept of percentiles. The nth percentile is a value greater than n percent of all values in the list. The authors use the RFDq /Qu ordered list from Table 9 to estimate the confidence interval, considering that it must be centralized in the list with respect to the percentiles. The analysis proposes two confidence intervals: one of 90%, that must be limited by the 5th and 95th percentiles, and one of 95%, that must be limited by the 2.5th and 97.5th percentiles. This procedure resulted 0.603, 2.185 for the 90% confidence interval of RFDq /Qu and 0.559, 2.170 for its 95% confidence interval.

Figure 6 illustrates these results. Abscissa axis represents measured values, the ordinate axis represents predicted values and each point represents a RFDq /Qu value. The continuous line is the locus of points with RFDq = Qu, while the other lines represent the limits of the confidence intervals. Considering engineering practice, this graph can give to geotechnical engineers a sense of which confidence interval would suit better their specific case.

Figure 6
Confidence intervals for ratio between observed and predicted values.

5.2 Case study

This section presents a case study with new examples to validate the generated models. The analysis uses results taken from three SPT soundings and load tests of precast concrete piles located in a construction site in Monte Largo, Paraná state, Brazil. These examples came from Lobo (2005)Lobo, B. (2005). Método de previsão de capacidade de carga de estacas: aplicação dos conceitos de energia do ensaio SPT [Doctoral thesis]. Universidade Federal do Rio Grande do Sul. and were not used to train the machine learning techniques. The objective is to evaluate what would be the accuracy of the models if applied in the future to a completely new site. Figure 7 presents the SPT values and load test information.

Figure 7
SPT sounding and load test for the pile tests. Adapted from Lobo (2005)Lobo, B. (2005). Método de previsão de capacidade de carga de estacas: aplicação dos conceitos de energia do ensaio SPT [Doctoral thesis]. Universidade Federal do Rio Grande do Sul..

The study starts calculating the results obtained with the original semi-empirical methods of Décourt-Quaresma and Meyerhof, as well as for the best performing techniques for each dataset. To facilitate comparisons, Table 10 presents all relative errors. For a predicted value y^i and an observed value yi, the relative error is REi=y^iyi/yi.

Table 10
Relative errors for the load tests from Monte Largo.

Note that the first example seems to be more difficult than the other two. One possible explanation for this disparity is the soil of this examples, with the first underrepresented within the training datasets. This issue is investigated incorporating these piles to the datasets, to verify if performance changes. The objective is verifying if the inclusion of two of the load tests of this construction site helps predicting the third one, as performed in the leave-one-out methodology. Table 11 presents the result.

Table 11
Relative error for predictions made with the updated datasets.

One can observe that most machine learning techniques presented some improvement for the first example, which is still the most difficult. For the other two, although some specific values increased, the overall performance of the techniques can be considered better. This allows concluding that the inclusion of information from the same construction site helped improving performance. In other words, the performance of the techniques for new examples depends on its representativity within the training dataset.

6. Conclusions

This work applies machine learning techniques to predict the bearing capacity of concrete precast piles. It presents two examples, the first with a general application and the second with a case study. The results obtained in the first example, considering all techniques applied to both datasets, allows concluding that RF is the best algorithm for this problem, with lower RMSE values. KNN and ANN also detached from the others, presenting the second best performance for Décourt-Quaresma and Meyerhof datasets, respectively. The semi-empirical methods of Décourt-Quaresma and Meyerhof presented a relatively poor performance in this example with an RMSE close to 900, being surpassed by all other techniques including LR. These results demonstrate that machine learning algorithms are a good alternative for predicting the ultimate bearing capacity of piles. The analysis proposed an approximation of the confidence intervals using the concept of percentile. A graph presented two intervals, 90% and 95%, to give engineers choices for the desired accuracy.

The second example presented a study to evaluate the effect of the representativity of the dataset. Results confirm that performance depends on representativity and also reveal the limits of these models, which tend to present poor accuracy for examples very different from the ones contained in the used datasets.

List of symbols

a,b: Arbitrary vectors

ai,bi: Vector components

b: Number of blows needed for the sampler to penetrate 30 cm into the soil

Ap: Area of the pile tip

f: Activation function

Ij: Combined input of a neuron

k: Number of nearest neighbors

K: Soil type factor

K(a,b): Kernel

l: Number of samples for a SVM

L: Pile length

n,m: Space dimensions

ne: Number of examples

oj: Output of a neuron

p: Exponent of Minowsky equation

qp,qs: Parameters of Meyerhof´s method

Qu: Pile bearing load capacity

R2: Coefficient of determination

REi: Relative error

Rl: Lateral friction

RMSE: Root mean square error

Rp: Tip resistance

Rt: Pile load capacity

S1,S2: Spaces of dimension n and m, respectfully

SPTl: Mean NSPT index along the pile shaft

SPTp: Mean NSPT index around the pile tip

U: Pile perimeter

w: Weight vector

wij: Adjustable connection weight

x: Input of a SVM

xi: Input of a neuron

xi,yi: Components of x and y, respectfully

y: Output of a SVM

yi: Observed value

y^i: Predicted value

y¯: Mean of all observed values

ξi,ξi*,C: Parameters of a SVM

α, β: Parameters of Décourt-Quaresma method

θj: Threshold value

ρ,σ,γ,ω: Kernel parameters

φi: Component of 𝜓

𝛽j: Coefficients to be determined for a LR

𝜄: Bias of a SVM

𝜆: Calibration parameter

𝜓: Mapping vector

𝜖: Error to be minimized

Acknowledgements

To the Coordination of Improvement of Higher Education Personnel (CAPES) for granting the first author’s scholarship.

  • Discussion open until February 28, 2022.

References

  • Alkroosh, I., Bahadori, M., Nikraz, H., & Bahadori, A. (2015). Regressive approach for predicting bearing capacity of bored piles from cone penetration test data. Journal of Rock Mechanics and Geotechnical Engineering, 7(5), 584-592. http://dx.doi.org/10.1016/j.jrmge.2015.06.011
    » http://dx.doi.org/10.1016/j.jrmge.2015.06.011
  • Aoki, N., & Velloso, D. (1975). An approximate method to estimate the bearing capacity of piles. In Proc., 5th Pan-American Conf. of Soil Mechanics and Foundation Engineering (pp. 367-376) Buenos Aires: International Society of Soil Mechanics and Geotechnical Engineering.
  • Ardalan, H., Eslami, A., & Nariman-Zadeh, N. (2009). Piles shaft capacity from CPT and CPTu data by polynomial neural networks and genetic algorithms. Computers and Geotechnics, 36(4), 616-625. http://dx.doi.org/10.1016/j.compgeo.2008.09.003
    » http://dx.doi.org/10.1016/j.compgeo.2008.09.003
  • Arel, E. (2012). Predicting the spatial distribution of soil profile in Adapazari/Turkey by artificial neural networks using CPT data. Computers & Geosciences, 43, 90-100. http://dx.doi.org/10.1016/j.cageo.2012.01.021
    » http://dx.doi.org/10.1016/j.cageo.2012.01.021
  • Associação Brasileira de Normas Técnicas – ABNT (2019). Projeto e execução de fundações Associação Brasileira de Normas Técnicas, Rio de Janeiro.
  • Benali, A., Nechnech, A., Boukhatem, B., Hussein, M.N., & Karry, M. (2018). Neural networks and principle component analysis approaches to predict pile capacity in sand. In MATEC Web of Conference (Org.), 2nd International Congress on Materials & Structural Stability Les Ulis: EDP Sciences. https://doi.org/10.1051/matecconf/201814902025
    » https://doi.org/10.1051/matecconf/201814902025
  • Bhattacharya, B., & Solomatine, D.P. (2006). Machine learning in soil classification. Neural Networks, 19(2), 186-195. http://dx.doi.org/10.1016/j.neunet.2006.01.005
    » http://dx.doi.org/10.1016/j.neunet.2006.01.005
  • Bigarella, J.J., & Salamuni, R. (1962). Caracteres texturais dos sedimentos da bacia de Curitiba Instituto de Geologia.
  • Bonini Neto, A., Bonini, C.S.B., Bisi, B.S., Reis, A.R., & Coletta, L.F.S. (2017). Artificial neural network for classification and analysis of degraded soils. IEEE Latin America Transactions, 15(3), 503-509.
  • Bui, D.T., Moayedi, H., Gör, M., Jaafari, A., & Foong, L.K. (2019). Predicting slope stability failure through machine learning paradigms. ISPRS International Journal of Geo-Information, 8(9), 395. http://dx.doi.org/10.3390/ijgi8090395
    » http://dx.doi.org/10.3390/ijgi8090395
  • Carvalho, L.O., & Ribeiro, D.B. (2019). Soil classification system from cone penetration test data applying distance-based machine learning algorithms. Soils and Rocks, 42(2), 167-178. http://dx.doi.org/10.28927/SR.422167
    » http://dx.doi.org/10.28927/SR.422167
  • Décourt, L., & Quaresma, A. (1978). Capacidade de carga de estacas a partir de valores de SPT. In Congresso Brasileiro de Mecânica dos Solos e Engenharia de Fundações (pp. 45-53). São Paulo: Associação Brasileira de Mecânica dos Solos e Engenharia Geotécnica.
  • Décourt, L., & Quaresma, A. (1998). Análise e Projeto de Fundações Profundas. In: W. Hachich (Eds.), Fundações: teoria e prática (pp. 275-276). PINI.
  • Draper, N.R., & Smith, H.K. (1998). Applied regression analysis (3rd edition). Wiley.
  • Dudani, S.A. (1976). The distance-weighted k-nearest-neighbor rule. IEEE Transactions on Systems, Man, and Cybernetics, SMC-6(4), 325-327.
  • Ferentinou, M.D., & Sakellariou, M.G. (2007). Computational intelligence tools for the prediction of slope performance. Computers and Geotechnics, 34(5), 362-384. http://dx.doi.org/10.1016/j.compgeo.2007.06.004
    » http://dx.doi.org/10.1016/j.compgeo.2007.06.004
  • Goh, A.T.C., & Goh, S.H. (2007). Support vector machines: their use in geotechnical engineering as illustrated using seismic liquefaction data. Computers and Geotechnics, 34(5), 410-421. http://dx.doi.org/10.1016/j.compgeo.2007.06.001
    » http://dx.doi.org/10.1016/j.compgeo.2007.06.001
  • Hanna, A.M., Ural, D., & Saygili, G. (2007). Neural network model for liquefaction potential in soil deposits using Turkey and Taiwan earthquake data. Soil Dynamics and Earthquake Engineering, 27(6), 521-540. http://dx.doi.org/10.1016/j.soildyn.2006.11.001
    » http://dx.doi.org/10.1016/j.soildyn.2006.11.001
  • Ho, T.K. (1995). Random decision forests. In International Association for Pattern Recognition. Technical Committee 11 (Org.), Proceedings of 3rd International Conference on Document Analysis and Recognition (pp. 278-282). Los Alamitos: IEEE Computer Society Press.
  • Ismail, A., & Jeng, D. (2011). Modelling load–settlement behavior of piles using high-order neural network (HON-PILE model). Engineering Applications of Artificial Intelligence, 24(5), 813-821. http://dx.doi.org/10.1016/j.engappai.2011.02.008
    » http://dx.doi.org/10.1016/j.engappai.2011.02.008
  • Jesswein, M., & Liu, J. (2018). Predicting the side resistance of piles using a genetic algorithm and SPT n-values. In Canadian Geotechnical Society (Org.), Proceedings of the 71st Canadian Geotechnical Conference and the 13th Joint CGS/IAH-CNC Groundwater Conference - GeoEdmonton 2018 : moving forward (pp. 1-8). Richmond: Canadian Geotechnical Society.
  • Juang, C.H., & Chen, C.J. (1999). CPT-based liquefaction evaluation using artificial neural networks. Computer-Aided Civil and Infrastructure Engineering, 14(3), 221-229. http://dx.doi.org/10.1111/0885-9507.00143
    » http://dx.doi.org/10.1111/0885-9507.00143
  • Kardani, N., Zhou, A., Nazem, M., & Shen, S. (2020). Estimation of bearing capacity of piles in cohesionless soil using optimised machine learning approaches. Geotechnical and Geological Engineering, 38(2), 2271-2291. http://dx.doi.org/10.1007/s10706-019-01085-8
    » http://dx.doi.org/10.1007/s10706-019-01085-8
  • Kohestani, V.R., Hassanlourad, M., & Ardakani, A. (2015). Evaluation of liquefaction potential based on CPT data using random forest. Natural Hazards, 79(2), 1079-1089. http://dx.doi.org/10.1007/s11069-015-1893-5
    » http://dx.doi.org/10.1007/s11069-015-1893-5
  • Kordjazi, A., Nejad, F.P., & Jaksa, M.B. (2013). Prediction of load-carrying capacity of piles using a support vector machine and improved data collection. In G. Cassidy (Ed.), Proceedings of the 12th Australia New Zealand conference on geomechanics: the changing face of the earth - geomechanics & human influence (pp. 1-8). Wellington: Opus.
  • Kordjazi, A., Nejad, F.P., & Jaksa, M.B. (2014). Prediction of ultimate axial load-carrying capacity of piles using a support vector machine based on CPT data. Computers and Geotechnics, 55, 91-102. http://dx.doi.org/10.1016/j.compgeo.2013.08.001
    » http://dx.doi.org/10.1016/j.compgeo.2013.08.001
  • Kovacevic, M., Bajat, B., & Gajic, B. (2010). Soil type classification and estimation of soil properties using support vector machines. Geoderma, 154(3-4), 340-347. http://dx.doi.org/10.1016/j.geoderma.2009.11.005
    » http://dx.doi.org/10.1016/j.geoderma.2009.11.005
  • Kuo, B., Yang, J., Sheu, T., & Yang, S. (2008). Kernel-based KNN and gaussian classifiers for hyperspectral image classification. In Geoscience and Remote Sensing Society (Org.), 2008 International Geoscience and Remote Sensing Symposium (pp. 1006-1008). Piscataway: IEEE.
  • Lee, I., & Lee, J. (1996). Prediction of pile bearing capacity using artificial neural networks. Computers and Geotechnics, 18(3), 189-200. http://dx.doi.org/10.1016/0266-352X(95)00027-8
    » http://dx.doi.org/10.1016/0266-352X(95)00027-8
  • Livingston, G., Piantedosi, M., Kurup, P., & Sitharam, T.G. (2008). An approximate method to estimate the bearing capacity of piles. In X. Zeng, M.T. Manzari, & D.R. Hiltunen (Eds.), Geotechnical Earthquake Engineering and Soil Dynamics IV Congress (pp. 1-10). Reston: American Society of Civil Engineers.
  • Lobo, B. (2005). Método de previsão de capacidade de carga de estacas: aplicação dos conceitos de energia do ensaio SPT [Doctoral thesis]. Universidade Federal do Rio Grande do Sul.
  • Lok, T.M.H., & Che, W.F. (2004). Axial capacity prediction for driven piles using ANN: model comparison. In M.K. Yegian, & E. Kavazanjian (Eds.), Geotechnical Engineering for Transportation Projects (pp. 697-704). Reston: American Society of Civil Engineers. https://doi.org/10.1061/40744(154)56
    » https://doi.org/10.1061/40744(154)56
  • Maizir, H., & Kassim, K.A. (2013). Neural network application in prediction of axial bearing capacity of driven piles. In O. Castillo, C. Douglas, D.D. Feng & J. Lee (Eds.), Proc of the International MultiConference of Engineers and Computer Scientists Hong Kong: Newswood Limited.
  • Maxwell, A.E., Sharma, M., Kite, J.S., Donaldson, K.A., Thompson, J.A., Bell, M.L., & Maynard, S.M. (2020). Slope failure prediction using random forest machine learning and lidar in an eroded folded mountain belt. Remote Sensing, 12(3), 486. http://dx.doi.org/10.3390/rs12030486
    » http://dx.doi.org/10.3390/rs12030486
  • Meyerhof, G. (1976). Bearing capacity and settlements of pile foundations. Journal of Geotechnical Engineering, 102, 197-228. http://dx.doi.org/10.1061/AJGEB6.0000243
    » http://dx.doi.org/10.1061/AJGEB6.0000243
  • Moayedi, H., & Hayati, S. (2019). Artificial intelligence design charts for predicting friction capacity of driven pile in clay. Neural Computing & Applications, 31(11), 7429-7445. http://dx.doi.org/10.1007/s00521-018-3555-5
    » http://dx.doi.org/10.1007/s00521-018-3555-5
  • Momeni, E., Nazir, R., Armaghani, D., & Maizir, H. (2015). Application of artificial neural network for predicting shaft and tip resistances of concrete piles. Earth Sciences Research Journal, 19(1), 85-93.
  • Morais, T.S.O., Tsuha, C.H.C., Bandeira Neto, L.A., & Singh, R.M. (2020). Effects of seasonal variations on the thermal response of energy piles in an unsaturated Brazilian tropical soil. Energy and Building, 216, 1-19. http://dx.doi.org/10.1016/j.enbuild.2020.109971
    » http://dx.doi.org/10.1016/j.enbuild.2020.109971
  • Nawari, N.O., & Liang, R. (2000). Intelligent hybrid system for the design of pile foundations. In N.D. Dennis, R. Castelli, & M.W. O'Neill (Eds.), New Technological and Design Developments in Deep Foundations (pp. 312-326). Reston: American Society of Civil Engineers. https://doi.org/10.1061/40511(288)22
    » https://doi.org/10.1061/40511(288)22
  • Nejad, F.P., & Jaksa, M.B. (2017). Load-settlement behavior modeling of single piles using artificial neural networks and CPT data. Computers and Geotechnics, 89, 9-21. http://dx.doi.org/10.1016/j.compgeo.2017.04.003
    » http://dx.doi.org/10.1016/j.compgeo.2017.04.003
  • Nejad, F.P., Kakhi, M.B., & McCabe, B.A. (2009). Prediction of pile settlement using artificial neural networks based on standard penetration test data. Computers and Geotechnics, 36(7), 1125-1133. http://dx.doi.org/10.1016/j.compgeo.2009.04.003
    » http://dx.doi.org/10.1016/j.compgeo.2009.04.003
  • Pal, M., & Deswal, S. (2008). Modeling pile capacity using support vector machines and generalized regressions neural network. Journal of Geotechnical and Geoenvironmental Engineering, 134(7), 1021-1024. http://dx.doi.org/10.1061/(ASCE)1090-0241(2008)134:7(1021)
    » http://dx.doi.org/10.1061/(ASCE)1090-0241(2008)134:7(1021)
  • Pal, M., & Deswal, S. (2010). Modelling pile capacity using Gaussian process regression. Computers and Geotechnics, 37(7), 924-947. http://dx.doi.org/10.1016/j.compgeo.2010.07.012
    » http://dx.doi.org/10.1016/j.compgeo.2010.07.012
  • Pham, T., Ly, H., Tran, V., Giap, L., Vu, H., & Duong, H. (2020). Prediction of pile axial bearing capacity using artificial neural network and random forest. Applied Sciences, 10(5), 1871. http://dx.doi.org/10.3390/app10051871
    » http://dx.doi.org/10.3390/app10051871
  • Salgado, R. (2008). The engineering of foundations McGraw Hill.
  • Samui, P. (2008). Support vector machine applied to settlement of shallow foundations on cohesionless soils. Computers and Geotechnics, 35(3), 419-427. http://dx.doi.org/10.1016/j.compgeo.2007.06.014
    » http://dx.doi.org/10.1016/j.compgeo.2007.06.014
  • Santos Junior, O. (1988). Previsão do comportamento carga-recalque de estacas pré-moldadas de concreto [Masters thesis]. Universidade de São Paulo.
  • Shahin, M.A. (2010). Intelligent computing for modeling axial capacity of pile foundations. Canadian Geotechnical Journal, 47(2), 230-243. http://dx.doi.org/10.1139/T09-094
    » http://dx.doi.org/10.1139/T09-094
  • Smola, A.J., & Scholkopf, B. (2004). A tutorial on support vector regression. Statistics and Computing, 14(3), 199-222. http://dx.doi.org/10.1023/B:STCO.0000035301.49549.88
    » http://dx.doi.org/10.1023/B:STCO.0000035301.49549.88
  • Van Der Veen, C. (1953). The bearing capacity of a pile. In International Conference on Soil Mechanics and Foundation Engineering (Org.), International Conference on Soil Mechanics and Foundation Engineering (pp. 367-376). Zurich: ICOSOMEF.
  • Vapnik, V.N. (1998). Statistical learning theory. Wiley.
  • Vapnik, V.N. (1999). An overview of statistical learning theory. IEEE Transactions on Neural Networks, 10(5), 988-999.
  • Vianna, A. (2000). Análise de provas de carga estática em estacas pré-moldadas cravadas na cidade de Curitiba e região metropolitana [Masters thesis]. Universidade de São Paulo. https://doi.org/10.11606/D.18.2000.tde-18102006-101542
    » https://doi.org/10.11606/D.18.2000.tde-18102006-101542
  • Wong, T. (2015). Performance evaluation of classification algorithms by k-fold and leave-one-outcross validation. Pattern Recognition, 48(9), 2839-2846. http://dx.doi.org/10.1016/j.patcog.2015.03.009
    » http://dx.doi.org/10.1016/j.patcog.2015.03.009
  • Yu, K., Ji, L., & Zhang, X. (2002). Kernel nearest-neighbor algorithm. Neural Processing Letters, 15(2), 147-156. http://dx.doi.org/10.1023/A:1015244902967
    » http://dx.doi.org/10.1023/A:1015244902967

Publication Dates

  • Publication in this collection
    26 Nov 2021
  • Date of issue
    2021

History

  • Received
    06 Sept 2021
  • Accepted
    27 Oct 2021
Associação Brasileira de Mecânica dos Solos Av. Queiroz Filho, 1700 - Torre A, Sala 106, Cep: 05319-000, São Paulo - SP - Brasil, Tel: (11) 3833-0023 - São Paulo - SP - Brazil
E-mail: secretariat@soilsandrocks.com