Acessibilidade / Reportar erro

Using Self-Organizing Maps to find spatial relationships between wildlife-vehicle crashes and land use classes

Abstract

The construction and expansion of roads cause significant impacts on the environment. The main potential impacts to biotic environment are vegetation suppression, reduction of the amount and composition of animal distribution due to forest fragmentation and increasing risks of animal (domestic and wildlife) vehicle collisions. The objective of this work was to establish a relationship between the different spatial patterns in wildlife-vehicle crash, by using spatial analysis and machine learning tools. Self-Organizing Maps (SOM), an artificial neural network (ANN), was selected to reorganize the multi-dimensional data according to the similarity between them. The results of the spatial pattern analysis were important to perceive that the point data pattern varies from an animal type to another. The events occur spatially clustered and are not uniformly distributed along the highway. SOM was able to analyze the relationship between multiple variables, linear and non-linear, such as ecological data, and established distinct spatial patterns per each animal type. In the studied area, most of the wildlife was run over very close to forest area and water bodies, and not so close to sugarcane fields, forestry and built environment. A considerable part of the wildlife-vehicle collisions occurred in areas with diverse landscape.

Key words
Accident prevention; artificial neural networks; geographic information systems; machine learning; road safety; wildlife-vehicle crash

INTRODUCTION

Globally, most terrestrial wildlife deaths are caused by wildlife-vehicle collisions (Forman 1998FORMAN RTT. 1998. Road ecology: a solution for the giant embracing us. Landsc Ecol 13(4)., Freitas et al. 2010FREITAS SR, HAWBAKER TJ & METZGER JP. 2010. Effects of roads, topography, and land cover on forest cover dynamics in the Brazilian Atlantic Forest. For Ecol Manage 259(3): 410-417., Teixeira et al. 2013TEIXEIRA FZ ET AL. 2013. Are road-kill hotspots coincident among different vertebrate groups?. Oecologia Aust 17(1): 36-47.). Aside from their direct impact on biodiversity, wildlife-vehicle collisions affect the safety and assets of road users (Huijser et al. 2013HUIJSER MP, ABRA FD & DUFFIELD JW. 2013. Mammal road mortality and cost-benefit analyses of mitigation measures aimed at reducing crashes with capybara (Hydrochoerus hydrochaeris) in São Paulo State, Brazil. Oecologia Aust 17(1): 129-146.). Therefore, it is necessary to investigate the locations and circumstances under which wildlife-vehicle collisions occur and to identify the spatial patterns connecting them. This information would inform not only the efforts of infrastructure planners, road safety specialists, and managers of wildlife populations, but also the development of stable mitigation measures aimed at improving driver and animal safety and minimizing economic losses.

Litvaitis & Tash (2008)LITVAITIS JA & TASH JP. 2008. An approach toward understanding wildlife-vehicle collisions. Environ Manage 42(4): 688-697. affirm that research to understand the factors that contribute to wildlife-vehicle collisions (WVC) can be partitioned into several major themes, including (i) characteristics associated with roadkill hot spots, (ii) identification of road-density thresholds that limit wildlife populations, and (iii) species specific models of vehicle collision rates that incorporate information on roads (e.g., proximity, width, and traffic volume) and animal movements. Results revealed important species-specific differences, with traffic volume and rate of movement by candidate species having the greatest influence on collision rates.

Intending to summarize empirical WVC findings to facilitate the application of this knowledge to the planning, and design of mitigation strategies on roads, Gunson et al. (2011)GUNSON KE, MOUNTRAKIS G & QUACKENBUSH LJ. 2011. Spatial wildlife-vehicle crash models: A review of current work and its application to transportation mitigation project. J Environ Manage 92(4): 1074-1082. conducted a review restricted to manuscripts that used generalized linear models to statistically determine the influence that numerous explanatory predictors have on the location of WVCs. They conclude that WVCs exhibit clustering on roads, which is attributed to specific landscape and road-related factors.

In a similar spatial analysis approach to this our paper, Morelle et al. (2013)MORELLE K, LEHAIRE F & LEJEUNE P. 2013. Spatio-temporal patterns of wildlife-vehicle collisions in a region with a high-density road network. Nature Conserv (5): 53-73. report data on WVC in Wallonia, southern Belgium for wild boar, roe deer, red deer, and red fox, clustering the accidents for all these species, mapping via Kernel density analysis. The Authors also performed a temporal analysis which was not developed by us given the lack of data for this type of analysis.

Sáenz-de-Santa-María & Tellería (2015)SÁENZ-DE-SANTA-MARÍA A & TELLERÍA JL. 2015. Wildlife-vehicle collisions in Spain. Eur J Wildl Res 61(3): 399-406. review official, unpublished information on WVC provided by the Spanish authorities to assess the main features and geographic distribution of this wildlife human interaction. They use the reported vehicle collisions to explore: (a) the identity, conservation status, and biological traits of species involved in most WVC; (b) the distribution of those areas within the country where these collisions are more likely to occur; and (c) the economic and human costs (injured persons) of this wildlife-human interaction.

Bíl et al. (2016)BÍL M, ANDRÁŠIK, R, SVOBODA, T & SEDONÍK J. 2016. The KDE+ software: a tool for effective identification and ranking of animal-vehicle collision hotspots along networks. Landsc Ecol 31(2): 231-237., present an objective method for hotspot identification that can be used for animal-vehicle collisions (AVC) data, using the Kernel density estimation – KDE and determining the significance level of hotspots. According to the Authors, the prioritization of hotspots allows a transportation manager to effectively allocate resources to a feasible number of identified hotspots.

In Pagany (2020)PAGANY R. 2020. Wildlife-vehicle collisions-Influencing factors, data collection and research methods. Biol Conserv 251: 108758. a review of WVC is presented and identify factors such as the proximity to forest, a gentle topography with sparsely curves, street width, and seasonal differences are common denominators for WVCs - independent of the species -, while traffic volume, the distance to urban areas, or road accompanying infrastructure are not assignable influencing or non-influencing factors.

According to Shilling et al. (2020)SHILLING F, COLLINSON W, BIL M, VERCAYIE D, HEIGL F, PERKINS SE & MACDOUGALL S. 2020. Designing wildlife-vehicle conflict observation systems to inform ecology and transportation studies. Biol Conserv 251: 108797.: “Preventing WVC begins with recording locations of conflict, such as vehicle crashes, animal carcasses (roadkill), or animal behavior around roads, such as avoidance of roads or crossing-behavior”. They developed a web-systems for reporting wildlife-vehicle-conflict reviews and provided recommendations for future WVC reporting systems and guidelines for the management of road networks.

Schwartz et al. (2020)SCHWARTZ AL, SHILLING FM & PERKINS SE. 2020. The value of monitoring wildlife roadkill. Eur J Wildl Res 66(1): 1-12. affirm “The number of WVCs has an obvious value in estimating the direct effects of roads on wildlife, i.e. mortality due to vehicle collisions”. They provide a review through a series of case studies contributing to the advancement of knowledge in species distributions, population dynamics, and animal behavior, as well as informing us about the health of the species and of the environment. Propose that monitoring roadkill facilitates five critical areas of ecological study: (1) monitoring of roadkill numbers, (2) monitoring of population trends, (3) mapping of native and invasive species distributions, (4) animal behavior, and (5) monitoring of contaminants and disease. Our paper agrees with “monitoring of roadkill numbers” with the proposal to provide guidelines for transportation planning.

Valerio et al. (2021)VALERIO F, BASILE M & BALESTRIERI R. 2021. The identification of wildlife-vehicle collision hotspots: Citizen science reveals spatial and temporal patterns. Ecol Process 10(1): 1-13. identify spatio-temporal trends of roadkill occurrence using citizen science data from one of the most urbanized and biodiversity-rich regions of Italy. Temporal trends were analyzed using generalized additive models, while landscape patterns were assessed by identifying significant thresholds over land cover gradients, related to increases in relative roadkill abundance, by employing threshold indicator taxa analysis. They develop a map of potential roadkill risk that could assist in planning the placement of mitigation measures. Citizen science contributions from highly populated areas allowed data collection over a large area and a dense road network, and also directly led to the evaluation of management decisional options.

Studies discussing the results of primary surveys of wildlife-vehicle collisions (Rowden et al. 2008ROWDEN P, STEINHARDT D & SHEEHAN M. 2008. Road crashes involving animals in Australia. Accid Anal Prev 40(6): 1865-1871.), focus on the specific species of animals involved in such collisions (Taylor et al. 2002TAYLOR SK, BUERGELT CD, ROELKE-PARKER ME, HOMER BL & ROTSTEIN DS. 2002. Causes of Mortality of Free-Ranging Florida Panthers. J Wildl Dis 38(1): 107-114., Huijser & Bergers 2000HUIJSER MP & BERGERS PJM. 2000. The effect of roads and traffic on hedgehog (Erinaceus europaeus) populations. Biol Conserv 95: 111-116.), evaluating the efficiency of wildlife crossings (Glista et al. 2009GLISTA DJ, DEVAULT TL & DEWOODY JA. 2009. A review of mitigation measures for reducing wildlife mortality on roadways. Landsc Urban Plan 91(1): 1-7.), estimating the costs generated by wildlife-vehicle collisions and implementing mitigating measures (Huijser et al. 2013HUIJSER MP, ABRA FD & DUFFIELD JW. 2013. Mammal road mortality and cost-benefit analyses of mitigation measures aimed at reducing crashes with capybara (Hydrochoerus hydrochaeris) in São Paulo State, Brazil. Oecologia Aust 17(1): 129-146., 2009HUIJSER MP, DUFFIELD JW, CLEVENGER AP, AMENT RJ & MCGOWEN PT. 2009. Cost-benefit analysis of mitigation measures aimed at reducting crashes with large ungulates in the United States and Canada, a decision support tool. Ecol. Soc 14(2): 15.), and using spatial analysis methods to understand the spatial patterns of wildlife-vehicle collisions (Barthelmess 2014BARTHELMESS EL. 2014. Spatial distribution of road-kills and factors influencing road mortality for mammals in Northern New York State. Biodivers Conserv 2(10): 2491-2514., Clevenger et al. 2003CLEVENGER AP, CHRUSZCZ B & GUNSON KE. 2003. Spatial patterns and factors influencing small vertebrate fauna road-kill aggregations. Biol Conserv 109: 15-26., De Freitas et al. 2015DE FREITAS SR, DE OLIVEIRA NA & CIOCHETI G. 2015. How landscape features influence road-kill of three species of mammals in the brazilian savanna? Oecologia Aust 18: 35-45., Gunson et al. 2011GUNSON KE, MOUNTRAKIS G & QUACKENBUSH LJ. 2011. Spatial wildlife-vehicle crash models: A review of current work and its application to transportation mitigation project. J Environ Manage 92(4): 1074-1082.), among others, have been conducted. However, these studies are relatively recent. Further research and a greater amount of data are needed to improve the identification of the species involved in collisions with vehicles, evaluate the consistency of wildlife crossings, and identify spatial patterns in wildlife-vehicle collisions.

Self-organizing maps (SOMs) are tools based on artificial neural networks (ANNs) and are used to analyze and visualize multidimensional data. They reorganize an N-dimensional dataset, where N is the number of variables involved in a two-dimensional (2D) map. Based on sequences of competition and collaboration, best matching units (BMUs) are chosen to represent sample groups according to the similarity of their relationships. A SOM is an ideal method for analyzing complex data, as it can be used to extract linear and non-linear relationships from a data set, and can be effectively applied to classification and association (Park et al. 2003PARK YS, CÉRÉGHINO R, COMPIN A & LEK S. 2003. Applications of artificial neural networks for patterning and predicting aquatic insect species richness in running waters. Ecol Modell 160(3): 265-280.), for example, Kussul et al. (2017)KUSSUL N, LAVRENIUK M, SKAKUN S & SHELESTOV. 2017. A. Deep Learning Classification of Land Cover and Crop Types Using Remote Sensing Data. IEEE Geosci Remote Sens Lett 14(5): 778-782. used a SOM to restore missing values in low-resolution satellite images using neural coefficient weights.

The main objective of this study was to establish a relationship between the different spatial patterns in wildlife-vehicle collisions by using spatial analysis and machine learning tools to understand the connection between wildlife-related collisions, the animal species involved, and the variables that represent land cover and road characterization features. To this end, a SOM algorithm was used to extract the relationships between wildlife species involved in collisions and the spatial characteristics that contextualize crash events. These characteristics included proximity to and the relative area of forest formations, water bodies, silviculture, and built-up areas, as well as the maximum speed allowed on the road, traffic volume, landscape diversity, and time of the day when the crash occurred.

MATERIALS AND METHODS

The study area is a 207 km stretch of toll highway intercepting 14 municipalities that are located in the western part of the state of São Paulo, Brazil. The land cover in the study area consists of pastures and anthropogenic areas, sugarcane fields, silviculture, water bodies, and built-up areas (FBDS 2017FBDS - Fundação Brasileira para o Desenvolvimento Sustentável. 2017. Mapeamento do uso do solo dos biomas Cerrado e Mata Atlântica na escala 1:20.000, Rio de Janeiro.). The study area is located in the western central plateau of Brazil and is characterized by wide and low hills, which range from being minimally eroded to being flat, with valleys that are not deep and a low drainage density (Ross & Moroz 1997ROSS JLS & MOROZ IC. 1997. Geomorphological map of São Paulo State. São Paulo: USP/FAPESP/IPT.).

This study used secondary data collected by public agencies—the Secretaria do Meio Ambiente do Estado de São Paulo (SMA), Fundação Brasileira para o Desenvolvimento Sustentável (FBDS), Instituto Nacional de Pesquisas Espaciais (INPE), and the highway management company. Primary data were not used in this study. The data collected were related to collisions involving animals, land cover, and land cover in the areas around the highway, as shown in Figure 1, and the characteristics of the highway itself (Table I). Animal-related collision data were obtained from CETESB, public process no. 13716/2001 and no. 154/2011. Figure 2 shows a flowchart summarizing the methodology.

Figure 1
Map of the study area.
Figure 2
Flowchart summarizing the adopted methodology.
Table I
List of Secondary Spatial Daa used for the research work.

A total of 1469 wildlife-vehicle collisions involving 16 types of animals that occurred between 2011 and 2015 (Table II) were mapped. Mammals, the animal group most reported, and potentially endangered animals were studied further. This is because wildlife-vehicle collisions are usually monitored from a motor vehicle and because medium or large mammals (those over 1 kg) are easier to see from such vehicles than smaller animals, such as most amphibians, some mammals, and reptiles (Glista et al. 2008GLISTA DJ, DEVAULT TL & DEWOODY JA. 2008. Vertebrate road mortality predominantly impacts amphibians. Herpetol Conserv Biol 3(1): 77-87.). Furthermore, medium and large animals are more likely to be involved in accidents that cause material damage and present a fatal risk to highway users (Huijser et al. 2013HUIJSER MP, ABRA FD & DUFFIELD JW. 2013. Mammal road mortality and cost-benefit analyses of mitigation measures aimed at reducing crashes with capybara (Hydrochoerus hydrochaeris) in São Paulo State, Brazil. Oecologia Aust 17(1): 129-146., Bueno et al. 2013BUENO C, FAUSTINO MT & FREITAS SR. 2013. Influence of landscape characteristics on capybara road-kill on highway BR-040, southeastern Brazil. Oecologia Aust 17(2): 320-327.).

Table II
Animals (Variables) considered in the study.

In addition to the wildlife-vehicle collision locations, this study examined 32 variables (Table II) related to the characteristics of the road and the land use and land coverage of the surrounding area. The variables were chosen based on the data available at the time the research was conducted and the results of previous studies (Barthelmess 2014BARTHELMESS EL. 2014. Spatial distribution of road-kills and factors influencing road mortality for mammals in Northern New York State. Biodivers Conserv 2(10): 2491-2514., Gunson et al. 2011GUNSON KE, MOUNTRAKIS G & QUACKENBUSH LJ. 2011. Spatial wildlife-vehicle crash models: A review of current work and its application to transportation mitigation project. J Environ Manage 92(4): 1074-1082., Bueno et al. 2013BUENO C, FAUSTINO MT & FREITAS SR. 2013. Influence of landscape characteristics on capybara road-kill on highway BR-040, southeastern Brazil. Oecologia Aust 17(2): 320-327., Carneiro et al. 2012CARNEIRO CDC, FRASER SJ, CRÓSTA AP, SILVA AM & BARROS CEDM. 2012. Semiautomated geologic mapping using self-organizing maps and airborne geophysics in the Brazilian Amazon. Geophysics 77(4): K17-K24.). The decision to use the SOM algorithm was justified by the high number of variables.

The distances from each collision point to the nearest forest formation, water bodies, silviculture, sugarcane fields, and built-up area were calculated, and circular buffers were also considered around each crash point. The relative areas of the land cover classes within each of these buffers were measured as a percentage of the total area. The variables related to the relative area, landscape diversity, and total length of rivers in the surroundings were measured in three radius sizes, i.e., 500 m, 1 km, and 5 km, resulting in analysis areas of 0.78 km², 3.14 km², and 78.53 km², respectively. This was done to identify the radius within which the greatest number of collisions with animals occurred.

The Shannon–Wiener Diversity index (1949) was used to measure the number of species within a community (Spellerberg & Fedor 2003SPELLERBERG IF & FEDOR PJ. 2003. A tribute to Claude Shannon (1916 – 2001) and a plea for more rigorous use of species richness, species diversity and the ‘Shannon-Wiener’ Index Glob Ecol Biogeogr 12(3): 177-179.). In this study, the index was used to identify only the number of different land cover types in one parcel. This index does not differentiate types of land cover by biological importance; rather, it considers all types of coverage as equally important. For instance, forest and sugarcane fields received the same weight even though their significance to different animal species, which was not considered in our analysis, is different. It makes sense since we are identifying the hot spot occurrences places and then, trying to associate them with land cover, not the opposite.

The species involved in the collisions were originally codified as categorical variables; however, for SOM to be applied, the variables must be converted into numerical values. Binary code, which transforms each categorical attribute into a set of binary attributes (Hsu 2006HSU CC. 2006. Generalizing self-organizing map for categorical data. IEEE Trans Neural Networks 17(2): 294-304.), was used to convert each categorical value into a binary attribute. As a result, 16 columns were created, one for each species involved in the collisions (Table II), and these were populated with a 1 if an animal of a given species was run over (involved in a crash) and a 0 if an animal of a given species was not run over. In addition to the animal columns, 32 variables related to the characteristics of the road and land cover (Table III) were added as columns. The SOM pre-training database totaled 1469 rows, each of which represented one sample (one collision event), and 48 columns, each representing a variable. To ensure that the variables contributed the same weight to the analysis, they were normalized based on their variance before the SOM analysis was performed.

Table III
Land use types and road characteristics (Variables) considered in the study.

An exploratory SOM analysis was carried out to filter the variables that, according to the presented correlation, were related to wildlife-vehicle collisions (Figure 3). The elimination of variables that had a low correlation with the latter decreased map dimensionality and allowed the main relationships between variables to gain greater prominence in the SOM. The exploratory analysis was completed in two stages.

Figure 3
Flowchart of the exploratory SOM analysis.

In the first stage, exploratory analysis 1 (EA1) only considered 21 variables that were repeated at more than one scale (radius of 500 m, 1 km, and 5 km) and 16 variables related to wildlife-vehicle collisions. Only the scale with the highest correlation to the number of wildlife vehicle collisions was selected per variable. Exploratory Analysis 2 (EA2) considered 16 animal species and 18 variables related to road characteristics and land cover, all of which were eliminated in EA1. It identified and eliminated variables with low correlations (r < 0.099) with wildlife vehicle collisions. In the second stage, a definitive analysis (DA) was developed. This included 31 variables, of which 16 were animal species and the other 15 were related to land cover and road characteristics.

In other similar studies, two common rules were followed when choosing the map size, that is, the number of map units that make up the matrix. Vesanto et al. (2000)VESANTO J, HIMBERG J, ALHONIEMI E & PARHANKANGAS J. 2000. SOM Toolbox for Matlab 5. recommended that the number of map units in the matrix be approximately 5√N, where N is the total number of samples in each variable. Kohonen (2013)KOHONEN T. 2013. Essentials of the self-organizing map. Neural Networks 37: 52-65. stated that it was not possible to estimate the exact map size beforehand and that this number should be determined based on trial and error by comparing the results obtained from each trial and then selecting the size. Our dataset consisted of 1469 samples. The map size calculated using the method indicated by Vesanto et al. (2000)VESANTO J, HIMBERG J, ALHONIEMI E & PARHANKANGAS J. 2000. SOM Toolbox for Matlab 5. was approximately 191.65 units, which resulted in a 14 × 14 map unit matrix. Several maps were created for other sizes, and it was noted that the larger maps tended to have smaller quantization errors, which is the average distance between each sample and its BMU. The topographical error, which is used to measure topology preservation, represents the proportion of all samples for which the first and second BMUs are not adjacent (Céréghino & Park 2009CÉRÉGHINO R & PARK YS. 2009. Review of the Self-Organizing Map (SOM) approach in water resources: Commentary Environ Model Softw 24(8): 945-947.). According to Compin & Céréghino (2007)COMPIN A & CÉRÉGHINO R. 2007. Spatial patterns of macroinvertebrate functional feeding groups in streams in relation to physical variables and land-cover in Southwestern France. Landsc Ecol 22(8): 1215-1225., a map that is too small might not efficiently explain some of the important differences that should be detected; however, if the map is too large, the differences observed would be very small. In other words, the larger the map, the lower the number of samples per neuron, and the fewer the samples that can be grouped based on similarity.

The size of the map was set as 14 × 14 units with toroidal projection and hexagonal cells. The algorithm training commenced with a random sample for which the closest neuron was selected. This process was repeated for all the samples, and the training was completed in two stages. The same training parameters were used for EA1, EA2, and DA. The data generated during DA were displayed using 2D maps for each of the variables. In these maps, the color scale shows the neurons that have a greater variable contribution to the analysis. Furthermore, neuron clustering was completed using a U-matrix based on k-means and the Davies–Bouldin index (DBI) (Davies & Bouldin 1979DAVIES DL & BOULDIN DW. 1979. A Cluster Separation Measure; IEEE Trans. Pattern Anal. Mach. Intell PAMI 1(2): 224-227.). It displays the results more comprehensively and quantitatively.

RESULTS

No relationship was observed between the elevation data and maximum allowable vehicle speed, and wildlife-vehicle collisions. The maximum allowable vehicle speed was the same as that provided on road signs, and it did not vary significantly over the study area. The elevation data did not change significantly, given that the terrain of the study area was predominantly flat and composed of wide and low hills. Other data that could be related more to wildlife-vehicle collisions include the longitudinal profile of the roads and the actual speed of vehicles involved in the collisions. Only data available at the time the research was being conducted were used. Data on the longitudinal profile of the road and actual vehicle speeds were more difficult to obtain.

The following variables were selected from EA1: percentage of forest formation within a buffer of 500 m, percentage of water bodies within a buffer of 500 m, percentage of silviculture within a buffer of 5 km, percentage of the built-up area within a buffer of 5 km, percentage of sugarcane fields within a buffer of 500 m, total length of linear rivers within a buffer of 500 m, and the Shannon–Wiener Diversity index for types of land cover within a 1 km buffer.

EA2 generated a 14 × 14 (196) unit map, and the samples were grouped into 127 BMUs. The final quantization error was 3.35. The eliminated variables were elevation (r < 0.07), maximum speed of the vehicle (r < 0.087), and time of day: afternoon (r < 0.034). The DA generated a 14 × 14 (196) unit map, and the samples were grouped into 126 BMUs. The final quantization error was 3.02.

Two-dimensional maps for each variable are shown in Figure 4. It should be noted that, although most of the studied animals are found in specific regions, a capybara can be found in all regions; the maps of the different distances are quite complementary among them (justifying their choice), and the spatial behavior of the selected variables. The averages for the variables for land cover and highway characterization by animal species are shown in Table IV. The data on the distances from various land cover types (Table IV) confirmed that, in general, the animals were run over 315 m and 311 m, on average, from forest fragments and water bodies, respectively. In comparison, wildlife-vehicle collisions occurred 3.4 km, 2.2 km, and 5.8 km from the closest sugarcane fields, silviculture, and built-up areas, respectively.

Figure 4
Component plots showing two-dimensional maps for the variables used in the SOM analysis. Each map shows the activation of neurons based on the values of the analyzed variables. In this way, warm colors show the high values of the variables, and cool colors the low values. The last map corresponds to the U-Matrix, where it is possible to observe the similarity between neighboring samples. In this case, cool colors correspond to the high similarity between neighboring samples, while warm colors denote greater dissimilarity. The red circles exemplify the common characteristics of running over the ocelot: low distance to the nearest forest formation; regions close to sugarcane and silviculture; low traffic volume; high landscape diversity; at the end of the day or the beginning of the evening; among others.
Table IV
Animals involved incrashes and the average for land cover and highway characterization variables.

A second clustering process was applied to refine the initial process that generated the 2D variable map. This second clustering was optional, and its purpose was to present the results more clearly. The DBI was used to determine the ideal number of clusters for the dataset. To reach a statistically consistent ideal number of clusters, the DBI was applied 70 times, and the ideal mode was selected. This resulted in the creation of 15 clusters (Figure 5).

Figure 5
(a) Clustering of 15 regions with high similarity obtained based on the (e) U-Matrix and three examples for (b) capybara, (c) skunk, and (d) hare, respectively. In these three examples, the red color indicates a high concentration of accidents.

The animals involved in the highest number of collisions, capybaras (41%) and armadillos (18%), were found in large numbers in multiple clusters (Table V). No unique spatial patterns were observed for these species. Hares, canids, and collared anteaters, which were also involved in several recorded collisions, were grouped into more than one cluster. However, they were predominantly in a single cluster, which indicated a more defined pattern. The other species, which were involved in 35 collisions or fewer, were grouped into single clusters. This indicated a more defined special pattern. In general, animals were run over when they were closer to forest formations, i.e., at an average distance of 315.3 m from the forest formations. Canids and maned wolves were involved in collisions further away from forests or in areas with low-density forests. Lowland tapirs, ocelots, pumas, and monkeys were involved in collisions that occurred closer to forests and in high-density forests.

Table V
Averages for distances to different types of land cover and highway characterization, per cluster, in the definitive analysis of the self-ordering map.

These results are consistent with the literature since, according to Silveira et al (2010), some species search for food and water in dry seasons, which increases the contact with roads. In the studies of Abra (2019)ABRA FD. 2019. Mammal-vehicle collisions on toll roads in São Paulo State: implications for wildlife, human safety and costs for society. Doctoral dissertation, Universidade de São Paulo. (Unpublished). and Abra et al. (2021)ABRA FD, HUIJSER MP, MAGIOLI M, BOVO AAA & DE BARROS KMPM. 2021. An estimate of wild mammal roadkill in São Paulo state, Brazil. Heliyon 7(1): e06015., road kills are improved, during dry seasons, for some species, including manned-wolf and ocelot. The maned wolf, the largest canid in South America, is subject to many threats, including road kills and inhabiting habitats such as woodland with an open canopy, mixed forest and grassland, and wet fields (https://seaworld.org/animals/facts/mammals/maned-wolf/). Lowland tapir, also known as the Brazilian tapir, lives in the rain forests of South America (https://tapirs.org/wp-content/uploads/2018/06/TAPIR-TRACKS-A-Curriculum-Guide-for-Educators.pdf). Ocelots have been recorded in a great variety of habitats, from heavily logged and fragmented forests, to early and late successional forests, the outskirts of major cities and towns, disturbed scrub/woodland Savannah, and agricultural areas (de Oliveira et al. 2010DE OLIVEIRA TG ET AL. 2010. Ocelot ecology and its effect on the small-felid guild in the lowland neotropics. In: Biology and Conservation of Wild Felids. New York, NY: Oxford University Press Inc, p. 559-580.). Results from Azevedo et al. (2021)AZEVEDO FC, LEMOS FG, FREITAS-JUNIOR MC, ARRAIS RC, MORATO RG & AZEVEDO FCC. 2021. The importance of forests for an apex predator: spatial ecology and habitat selection by pumas in an agroecosystem. Anim Conserv 24(3): 499-509. revealed that the puma habitat has an association with forest vegetation, followed by pasture with shrubs. Road kills are important factors linked to the reduction of the cougar population in several areas (Azevedo et al. 2013AZEVEDO FC ET AL. 2013. Avaliação do risco de extinção da Onça-parda Puma concolor (Linnaeus, 1771) no Brasil. Biodivers Bras 3(1): 107-121., Benatti 2021BENATTI D, DE SANTI M, WERTHER K, TEBALDI JH & HOPPE EGL. 2021. Helminthfauna of road-killed cougars (Puma concolor) from the Northeastern Region of São Paulo State, Brazil. Braz J Vet Parasitol 30(1): e024120. https://doi.org/10.1590/S1984-29612021008.
https://doi.org/10.1590/S1984-2961202100...
). Titi monkeys occur in the Brazilian states of Rio de Janeiro, São Paulo, and Minas Gerais. Nowadays their population is restricted to forest patches within a highly fragmented landscape, where local extinction is a constant threat (Trevelin et al. 2007TREVELIN LC, PORT-CARVALHO M, SILVEIRA M & MORELL E. 2007. Abundance, habitat use and diet of Callicebus nigrifrons Spix (Primates, Pitheciidae) in Cantareira State Park, São Paulo, Brazil. Rev Bras Zool 24: 1071-1077.).

In most occurrences, animals were run over when they were closer to water bodies. Lowland tapirs, South American coatis, maned wolves, and monkeys were involved in collisions that occurred very close to water bodies (Table IV). No species were run over far from the water.

The animals, in general, were run over far from silviculture, at an average distance of 2.2 km. No species were predominantly run over very close to silviculture. Raccoon collisions (cluster 1), 18.5% of the capybaras’ collisions (clusters 4 and 12), and 14% of the armadillos (cluster 4) collisions occurred farther away from silviculture in areas with low silviculture density.

Sugarcane occupies a significant amount of land in the study area. However, collisions occurred far away from the sugarcane fields, at an average distance of 3.4 km; lowland tapirs were run over farther from sugarcane fields than the average distance; South American raccoons and monkeys were run over closer to sugarcane fields than the average distance. Fourteen percent of the armadillos’ collisions (cluster 4), 47% of the capybaras’ collisions (clusters 4 and 13), and 92% of the canid collisions occurred in areas with high densities of sugarcane fields.

In most cases, collisions with animals occurred far from built-up areas, at an average distance of 5.8 km. No species was run over very close to built-up areas in general, although in highly dense regions, South American raccoons, hares, 28.6% of the capybaras, deer, skunks, and porcupines were run over. Of all the animals studied, a small proportion (2.38%, cluster 9), consisting of capybaras, canids, hares, collared anteaters, and armadillos, were run over very close to silviculture, at an average distance of 242 m. These areas have a very diverse landscape, including a high proportion of water bodies (rivers and ponds) and sugarcane fields, and are near forest fragments.

Another small proportion (4.83%, cluster 11) of the animals studied, particularly capybaras, canids, hares, porcupines, and collared anteaters, was run over in areas with a high density of water bodies, a high density of built-up areas, and a high traffic volume. In these areas, the landscape is composed of forest fragments, rivers, dams, lagoons, and urban patches. Lowland tapirs and South American coatis were run over in areas with high landscape diversity; however, hares and maned wolves were run over in areas with low landscape diversity. Furthermore, the latter, along with monkeys, was also run over in stretches with a high traffic volume, while lowland tapirs were run over in areas where traffic volume was low. Armadillos, South American raccoons, maned wolves, hares, deer, skunks, porcupines, and giant anteaters were run over predominantly at night, whereas capybaras, monkeys, lowland tapirs, and pumas were predominantly run over during the day.

CONCLUSION

The SOM analysis results revealed that the concentrations of wildlife-vehicle collisions were related to road characteristics and land cover. This allowed the relationships between multiple linear and non-linear variables, such as ecological data, to be analyzed.

The following interventions could be implemented in areas where road kill clusters occur. First, underpass tunnels for animals should be installed so that animals can cross the road safely and avoid collisions with vehicles moving at high speeds. Second, fences should be installed parallel to roads that are adjacent to wildlife crossings to funnel animals toward crossing structures (Van Der Ree 2015). Because wildlife-vehicle collisions occurred approximately 300 m from bodies of water and forest formations, the fences should have a minimum length of 300 m, on average, on either side of the wildlife crossing structure. Finally, tools such as mobile phone applications with which road users can report accidents in real-time (Tong et al. 2020TONG Y, ZHOU Z, ZENG Y, CHEN L & SHAHABI C. 2020. Spatial crowdsourcing: a survey. VLDB J 29(1): 217-250.) should be developed so that data from road users can be collected and a richer database on wildlife-vehicle collisions can be built.

ACKNOWLEDGMENTS

The Authors thank Secretaria do Meio Ambiente do Estado de São Paulo (SMA) and the agencies Fundação Brasileira para o Desenvolvimento Sustentável (FBDS) and Instituto Nacional de Pesquisas Espaciais (INPE) for providing data. The Authors would also like to thank Companhia de Saneamento do Estado de São Paulo (CETESB) for providing the necessary data of collision to complete this study and to geologist Lucas Sandré by the final’s figures and tables editions. The Author JAQ received financial support from Conselho Nacional de Desenvolvimento Científico e Tecnológico – CNPq (Process 305188/2020-8).

REFERENCES

  • ABRA FD. 2019. Mammal-vehicle collisions on toll roads in São Paulo State: implications for wildlife, human safety and costs for society. Doctoral dissertation, Universidade de São Paulo. (Unpublished).
  • ABRA FD, HUIJSER MP, MAGIOLI M, BOVO AAA & DE BARROS KMPM. 2021. An estimate of wild mammal roadkill in São Paulo state, Brazil. Heliyon 7(1): e06015.
  • AZEVEDO FC ET AL. 2013. Avaliação do risco de extinção da Onça-parda Puma concolor (Linnaeus, 1771) no Brasil. Biodivers Bras 3(1): 107-121.
  • AZEVEDO FC, LEMOS FG, FREITAS-JUNIOR MC, ARRAIS RC, MORATO RG & AZEVEDO FCC. 2021. The importance of forests for an apex predator: spatial ecology and habitat selection by pumas in an agroecosystem. Anim Conserv 24(3): 499-509.
  • BARTHELMESS EL. 2014. Spatial distribution of road-kills and factors influencing road mortality for mammals in Northern New York State. Biodivers Conserv 2(10): 2491-2514.
  • BENATTI D, DE SANTI M, WERTHER K, TEBALDI JH & HOPPE EGL. 2021. Helminthfauna of road-killed cougars (Puma concolor) from the Northeastern Region of São Paulo State, Brazil. Braz J Vet Parasitol 30(1): e024120. https://doi.org/10.1590/S1984-29612021008.
    » https://doi.org/10.1590/S1984-29612021008
  • BÍL M, ANDRÁŠIK, R, SVOBODA, T & SEDONÍK J. 2016. The KDE+ software: a tool for effective identification and ranking of animal-vehicle collision hotspots along networks. Landsc Ecol 31(2): 231-237.
  • BUENO C, FAUSTINO MT & FREITAS SR. 2013. Influence of landscape characteristics on capybara road-kill on highway BR-040, southeastern Brazil. Oecologia Aust 17(2): 320-327.
  • CARNEIRO CDC, FRASER SJ, CRÓSTA AP, SILVA AM & BARROS CEDM. 2012. Semiautomated geologic mapping using self-organizing maps and airborne geophysics in the Brazilian Amazon. Geophysics 77(4): K17-K24.
  • CÉRÉGHINO R & PARK YS. 2009. Review of the Self-Organizing Map (SOM) approach in water resources: Commentary Environ Model Softw 24(8): 945-947.
  • CLEVENGER AP, CHRUSZCZ B & GUNSON KE. 2003. Spatial patterns and factors influencing small vertebrate fauna road-kill aggregations. Biol Conserv 109: 15-26.
  • COMPIN A & CÉRÉGHINO R. 2007. Spatial patterns of macroinvertebrate functional feeding groups in streams in relation to physical variables and land-cover in Southwestern France. Landsc Ecol 22(8): 1215-1225.
  • DAVIES DL & BOULDIN DW. 1979. A Cluster Separation Measure; IEEE Trans. Pattern Anal. Mach. Intell PAMI 1(2): 224-227.
  • DE FREITAS SR, DE OLIVEIRA NA & CIOCHETI G. 2015. How landscape features influence road-kill of three species of mammals in the brazilian savanna? Oecologia Aust 18: 35-45.
  • DE OLIVEIRA TG ET AL. 2010. Ocelot ecology and its effect on the small-felid guild in the lowland neotropics. In: Biology and Conservation of Wild Felids. New York, NY: Oxford University Press Inc, p. 559-580.
  • FBDS - Fundação Brasileira para o Desenvolvimento Sustentável. 2017. Mapeamento do uso do solo dos biomas Cerrado e Mata Atlântica na escala 1:20.000, Rio de Janeiro.
  • FORMAN RTT. 1998. Road ecology: a solution for the giant embracing us. Landsc Ecol 13(4).
  • FREITAS SR, HAWBAKER TJ & METZGER JP. 2010. Effects of roads, topography, and land cover on forest cover dynamics in the Brazilian Atlantic Forest. For Ecol Manage 259(3): 410-417.
  • GLISTA DJ, DEVAULT TL & DEWOODY JA. 2008. Vertebrate road mortality predominantly impacts amphibians. Herpetol Conserv Biol 3(1): 77-87.
  • GLISTA DJ, DEVAULT TL & DEWOODY JA. 2009. A review of mitigation measures for reducing wildlife mortality on roadways. Landsc Urban Plan 91(1): 1-7.
  • GUNSON KE, MOUNTRAKIS G & QUACKENBUSH LJ. 2011. Spatial wildlife-vehicle crash models: A review of current work and its application to transportation mitigation project. J Environ Manage 92(4): 1074-1082.
  • HSU CC. 2006. Generalizing self-organizing map for categorical data. IEEE Trans Neural Networks 17(2): 294-304.
  • HUIJSER MP, ABRA FD & DUFFIELD JW. 2013. Mammal road mortality and cost-benefit analyses of mitigation measures aimed at reducing crashes with capybara (Hydrochoerus hydrochaeris) in São Paulo State, Brazil. Oecologia Aust 17(1): 129-146.
  • HUIJSER MP & BERGERS PJM. 2000. The effect of roads and traffic on hedgehog (Erinaceus europaeus) populations. Biol Conserv 95: 111-116.
  • HUIJSER MP, DUFFIELD JW, CLEVENGER AP, AMENT RJ & MCGOWEN PT. 2009. Cost-benefit analysis of mitigation measures aimed at reducting crashes with large ungulates in the United States and Canada, a decision support tool. Ecol. Soc 14(2): 15.
  • KOHONEN T. 2013. Essentials of the self-organizing map. Neural Networks 37: 52-65.
  • KUSSUL N, LAVRENIUK M, SKAKUN S & SHELESTOV. 2017. A. Deep Learning Classification of Land Cover and Crop Types Using Remote Sensing Data. IEEE Geosci Remote Sens Lett 14(5): 778-782.
  • LITVAITIS JA & TASH JP. 2008. An approach toward understanding wildlife-vehicle collisions. Environ Manage 42(4): 688-697.
  • MORELLE K, LEHAIRE F & LEJEUNE P. 2013. Spatio-temporal patterns of wildlife-vehicle collisions in a region with a high-density road network. Nature Conserv (5): 53-73.
  • PAGANY R. 2020. Wildlife-vehicle collisions-Influencing factors, data collection and research methods. Biol Conserv 251: 108758.
  • PARK YS, CÉRÉGHINO R, COMPIN A & LEK S. 2003. Applications of artificial neural networks for patterning and predicting aquatic insect species richness in running waters. Ecol Modell 160(3): 265-280.
  • ROSS JLS & MOROZ IC. 1997. Geomorphological map of São Paulo State. São Paulo: USP/FAPESP/IPT.
  • ROWDEN P, STEINHARDT D & SHEEHAN M. 2008. Road crashes involving animals in Australia. Accid Anal Prev 40(6): 1865-1871.
  • SÁENZ-DE-SANTA-MARÍA A & TELLERÍA JL. 2015. Wildlife-vehicle collisions in Spain. Eur J Wildl Res 61(3): 399-406.
  • SCHWARTZ AL, SHILLING FM & PERKINS SE. 2020. The value of monitoring wildlife roadkill. Eur J Wildl Res 66(1): 1-12.
  • SHILLING F, COLLINSON W, BIL M, VERCAYIE D, HEIGL F, PERKINS SE & MACDOUGALL S. 2020. Designing wildlife-vehicle conflict observation systems to inform ecology and transportation studies. Biol Conserv 251: 108797.
  • SPELLERBERG IF & FEDOR PJ. 2003. A tribute to Claude Shannon (1916 – 2001) and a plea for more rigorous use of species richness, species diversity and the ‘Shannon-Wiener’ Index Glob Ecol Biogeogr 12(3): 177-179.
  • TAYLOR SK, BUERGELT CD, ROELKE-PARKER ME, HOMER BL & ROTSTEIN DS. 2002. Causes of Mortality of Free-Ranging Florida Panthers. J Wildl Dis 38(1): 107-114.
  • TEIXEIRA FZ ET AL. 2013. Are road-kill hotspots coincident among different vertebrate groups?. Oecologia Aust 17(1): 36-47.
  • TONG Y, ZHOU Z, ZENG Y, CHEN L & SHAHABI C. 2020. Spatial crowdsourcing: a survey. VLDB J 29(1): 217-250.
  • TREVELIN LC, PORT-CARVALHO M, SILVEIRA M & MORELL E. 2007. Abundance, habitat use and diet of Callicebus nigrifrons Spix (Primates, Pitheciidae) in Cantareira State Park, São Paulo, Brazil. Rev Bras Zool 24: 1071-1077.
  • VALERIO F, BASILE M & BALESTRIERI R. 2021. The identification of wildlife-vehicle collision hotspots: Citizen science reveals spatial and temporal patterns. Ecol Process 10(1): 1-13.
  • VAN DER REE R, GAGNON JW & SMITH DJ. 2015. Fencing: A Valuable Tool for Reducing Wildlife-Vehicle Crashes and Funnelling Fauna to Crossing Structures. Handb Road Ecol 159-171.
  • VESANTO J, HIMBERG J, ALHONIEMI E & PARHANKANGAS J. 2000. SOM Toolbox for Matlab 5.

Publication Dates

  • Publication in this collection
    28 Nov 2022
  • Date of issue
    2022

History

  • Received
    19 May 2021
  • Accepted
    10 Sept 2021
Academia Brasileira de Ciências Rua Anfilófio de Carvalho, 29, 3º andar, 20030-060 Rio de Janeiro RJ Brasil, Tel: +55 21 3907-8100 - Rio de Janeiro - RJ - Brazil
E-mail: aabc@abc.org.br