A new world natural vegetation map for global change studies

We developed a new world natural vegetation map at 1 degree horizontal resolution for use in global climate models. We used the Dorman and Sellers vegetation classification with inclusion of a new biome: tropical seasonal forest, which refers to both deciduous and semi-deciduous tropical forests. SSiB biogeophysical parameters values for this new biome type are presented. Under this new vegetation classification we obtained a consensus map between two global natural vegetation maps widely used in climate studies. We found that these two maps assign different biomes in ca. 1/3 of the continental grid points. To obtain a new global natural vegetation map, non-consensus areas were filled according to regional consensus based on more than 100 regional maps available on the internet. To minimize the risk of using poor quality information, the regional maps were obtained from reliable internet sources, and the filling procedure was based on the consensus among several regional maps obtained from independent sources. The new map was designed to reproduce accurately both the large-scale distribution of the main vegetation types (as it builds on two reliable global natural vegetation maps) and the regional details (as it is based on the consensus of regional maps).


INTRODUCTION
The vegetation cover of the Earth surface plays a recognized important role in global climate regulation (e.g. Charney 1975, Avissar et al. 2002. Thus, to perform more realistic climate simulations, it is essential to use global vegetation maps which are regionally reliable, as well as representing the broad spectrum of natural biomes. Atmospheric general circulation models (AGCM) used in weather and climate prediction or Earth System models require correct representation of the biophysical properties of the land surface, such as those which control the fluxes of radiation, momentum, sensible and latent heat, and trace gases. Since vegeta-tion mediates all these land surface-atmosphere interactions, its accurate representation is mandatory in such models. Natural vegetation maps consider vegetation in its natural distribution without human action, while actual vegetation maps also consider anthropogenic land cover changes in ecosystems. The Centro de Previsão de Tempo e Estudos Climáticos/ Center for Ocean-Land-Atmosphere Studies AGCM (CPTEC/COLA AGCM; Cavalcanti et al. 2002), for example, uses the actual vegetation map and vegetation classification of Dorman and Sellers (1989, DS89) for representation of surface processes. The DS89 classification considers 11 natural biomes and 1 agroecosystem (winter wheat cultivation). In some global change studies, such as prediction of global climate change impacts on Earth's biomes, or large scale paleoclimatic or paleoecological reconstruc-2 DAVID M. LAPOLA et al. tion, it is recommended to use natural vegetation maps (instead of actual vegetations maps), since the past extension of agroecosystems and the factors regulating their future extension are not fully known. Furthermore, in land use change studies (e.g. Ramankutty and Foley 1999), it is useful to compare the distribution of modern croplands to global natural vegetation maps (natural vegetation maps are also referred to as potential vegetation maps, but here we use the term "natural" instead of "potential"; the latter is regarded as the vegetation type diagnosed only from climate), in order to access the consequences of human land use (Foley et al. 2005). However, there are only a few global natural vegetation maps available in the literature, and among them there are marked disagreements in several regions, as we will show here.
In this paper we report the development of a new global natural vegetation map, for use initially in climate studies with the CPTEC/COLA AGCM and with the CPTEC Potential Vegetation Model (Oyama and Nobre 2004). This new map builds on two global natural vegetation maps, which have been widely used by the scientific community, and in more than 100 regional land cover maps. To better represent the tropical biomes, we add a new vegetation type to DS89 classification and present its biogeophysical parameters in the context of SSiB land surface model (Xue et al. 1991). This new classification is hereafter called LONS08. A detailed account of the new map development is given in the following sections.

IMPROVEMENT OF DS89 CLASSIFICATION
The Küchler classification, on which DS89 is based, considers primarily physiognomic features of the vegetation. In the Küchler classification, vegetation types are described by a combination of letters and numbers that resembles the Köppen climate classification (Küchler 1988). For instance, D7 refers to broadleaf deciduous forest (D) with a 20-35 m tall canopy (7). The DS89 vegetation classification simply grouped the 32 main Küchler surface types into 12 major classes (see DS89, p. 838, for further details).
The DS89 classification considers two forest types for the tropics: broadleaf-evergreen trees (type 1) and broadleaf-deciduous trees (type 2). From the DS89 map of natural vegetation (DS89, p. 839), it is clear that type 2 refers to tropical deciduous and semi-deciduous forests (such as in India) as well as to temperate deciduous forest (such as in North America and Europe). It is known that there are many differences between these two forest types, ranging from climate characteristics to the identity of the plant community per se. For example, the oak (Quercus sp.) is a typical tree of temperate deciduous forests, but it never occurs in tropical forests. Moreover, in the winter season of temperate regions, there is a marked reduction of sunlight, while in the tropics the incidence of light is practically the same throughout the year, which illustrates the difference in the very nature of deciduousness in tropical and temperate forests. While in middle latitudes all deciduous trees lose their leaves in a particular season, in the tropics some trees lose in an irregular basis, which is not closely related to temperature or light incidence rhythms, but mostly to water availability. For this reason, tropical trees presenting some degree of deciduousness have been named as "tropophyllous" plants (Küchler 1988). Thus, considering DS89 type 2 as referring strictly to deciduous temperate forests, it would be reasonable to include a new vegetation type in DS89 classification that would represent forests different from the evergreen type (type 1 in DS89) in tropical regions.
According to the dry season length and the amount of trees which lose their leaves, tropical forest formations can be classified in: (i) evergreen, (ii) seasonal semi-deciduous and (iii) seasonal deciduous (Eyre 1968, Barnes et al. 1998). In evergreen forests there is no occurrence of dry season (monthly precipitation always exceeds 100 mm) and, due to this, phytocoenoses as a whole practically do not present deciduousness, standing permanently green (Eyre 1968, Odum 1983, Barnes et al. 1998, Schultz 2005. This kind of forest is found in Amazonia, South America Atlantic Forest, Equatorial Africa, west coast of India, Sri Lanka and Indo-Malaysia (where both evergreen and seasonal forests are found). This category also encompasses mixed evergreen forests, which occur in mountain regions, where gymnosperm and angiosperms elements are found within the same area (Barnes et al. 1998, Schultz 2005. On the other hand, seasonal forests have a dry period which ranges from 2-3 months in semi-deciduous to 5-6 months in A NEW WORLD NATURAL VEGETATION MAP 3 deciduous forest. Within the dry season, 20 to 50% of trees in semi-deciduous forest, and up to 100% of trees in deciduous forest, lose their leaves. Therefore, seasonal tropical forests (semi-deciduous and deciduous) have a close relation with seasonal pulses of precipitation (Eyre 1968, Odum 1983, Barnes et al. 1998, Schultz 2005. This sort of forest occurs generally in the transition between evergreen forest and savannas: southern limit of Amazonia, inland Atlantic Forest, boundaries of the African rainforest, almost in entire India and in Southeastern Asia (mixed with evergreen forest).
Although semi-deciduous and deciduous tropical forests have some distinct characteristics, they could be conveniently considered as belonging to just a single category. This is justified because, in the tropics, the distribution of deciduous forests is small (Eyre 1968, Barnes et al. 1998, Ramankutty and Foley 1999, and the great majority of seasonal forests fits better, due to climatological and biological reasons, into the semi-deciduous or semi-evergreen category (except in India, where very dry forests with pronounced deciduousness are found) (Eyre 1968, Barnes et al. 1998, Schultz 2005. Furthermore, for the purpose of AGCM simulations, it is desirable to classify vegetation with a minimum possible number of categories, since it is necessary to prescribe a large amount of physical and physiological parameters to each vegetation type (cf. Sellers et al. 1986, Xue et al. 1991. Then, while from now on referring to type 1 (tropical forest) exclusively as tropical evergreen forest, we suggest that the following biome be added to the DS89 classification: Tropical seasonal forest: encompassing semi-deciduous and deciduous tropical forests. It has close resemblance with tropical evergreen forests, but, contrastingly, the plant community phenological pattern is clearly and strongly influenced by precipitation seasonality.
The 11 natural biome types of DS89 plus the new type (tropical seasonal forest, hereafter considered as type 13) compounds LONS08 classification (Table I).

SSiB PARAMETERS FOR TROPICAL SEASONAL FOREST
In the SSiB model (Xue et al. 1991), a set of values representing a large number of biogeophysical parameters is assigned to each biome. For the proposed biome, tropical seasonal forest, the SSiB parameters values are shown in Tables II and III on a  where z 0 is the roughness length (m). These relations were adjusted using the annual average values of c b , c d and z 0 for 5 biomes: tropical evergreen forest (biome 1), temperate forest (2), savanna (6), shrubland ("caatinga", 8) and semi-desert vegetation (9).

CONSENSUS MAP
We used the natural vegetation maps of Ramankutty and Foley (1999, RF99) and Matthews (1983, M83), both at 1 degree horizontal resolution, to obtain a consensus map between them. These two maps (hereafter referred to as base-maps) classify vegetation in very distinct forms: RF99 map classification has 15 categories, while M83 has 32. The classification used in RF99 map is derived from the Olson Global Ecosystems (OGE) framework, which originally has 94 vegetation types (Olson 1994 (UNESCO 1988). In order to obtain these maps under the LONS08 classification, it was necessary to relate RF99 and M83 map classifications to LONS08. It was accomplished in two steps: firstly, the M83 map categories were related to RF99 map categories; and then the RF99 map categories were related to LONS08 types. To find the relation between the M83 and RF99 map classifications, we followed a semi-objective procedure. Consider the biome i of M83 map classification, and N i j as the number of grid points classified as i in M83 map and j in RF99 map. The fraction of grid points classified as biome i in M83 map and j in RF99 map is given by Let J 1 and J 2 be the biomes of RF99 associated respectively with the first and second major fraction value. Then the biome i of M83 map classification was associated to biome j of RF99 when one the following objective criteria was met: F i J 1 ≥ 0, 7 (greater than 70%) [represented by * in Table IV]; or 0, 4 ≤ F i J 1 ≤ 0, 7 (between 40 and 70%) and F i J 2 ≤ 0, 2 (lower than 20%) [represented by ** in Table IV]. If these criteria were not satisfied, biome i of M83 map classification was associated (subjectively) to a RF99 biome (represented by *** in Table IV) based on the biomes definition and/or their geographic distribution. With the procedure here described we grouped the 32 M83 types into the 15 of RF99 (Table IV). The relation between the categories of RF99 map classification and LONS08 types was obtained subjectively, based only on the biomes definition and/or their geographic distribution (Table V). Both base-maps are shown under LONS08 classification in Figures 1a and 1b. These maps show only a "fair" agreement according to the kappa statistics (Monserud and Leemans 1992) (κ = 0.49). Our indirect relation between vegetation types of M83 map classification and LONS08 is similar to that obtained by Sellers et al. (1996).
With the base-maps maps under the same classification (LONS08), we elaborated the consensus map, i.e. a map that shows only the grid points for which both base-maps assign the same biome (Fig. 1c). The consensus areas were mainly found in tropical forest and Relation between M83 and RF99 classifications. A * denotes conversion of M83 biome to one of RF99 when F i J 1 ≥ 0.7; ** when 0.4 ≤ F i J 1 ≤ 0.7 and F i J 2 ≤ 0.2; and *** when it did not satisfied both of the two above criteria, then M83 biome i was subjectively associated with one of RF99 (see text for detailed explanation).
RF99 vegetation type names are found in Table V.

NON-CONSENSUS AREAS
In order to fill the non-consensus areas between RF89 and M83 maps, we consulted more than 100 regional natural vegetation maps available on the internet. Although actual land cover data (e.g. from VGT GLC2000, MODIS Modland, AVHRR DisCover) have a great  potential for supporting studies on natural and anthropogenic vegetation distribution, they would not be of great advantage in the current study. That is because most of the non-consensus areas showed in Figure 1c are found in regions no longer covered by natural vegetation (e.g. United States, Europe and Central Asia). Then, in order to obtain the present new map, we had to rely just on maps accounting for the natural vegetation which once existed in these regions. Regional maps were used at this part of the work (instead of additional global maps), because they provide a more accurate vegetation distribution and a more detailed vegetation description on a regional scale. Most of the maps belong to the Perry Castañeda Library Map Collection at University of Texas (http://www.lib.utexas.edu/maps) where original maps were scanned and made available on the World Wide Web. Other digital maps, not present in that collection, such as the natural vegetation map of Brazil (IBGE 1993) and Alaska (Küchler 1966), were also used. We filled the non-consensus areas separately to lowland (continental areas with elevations < 1000 m) and upland areas (elevations >1000 m). This procedure was adopted to allow for a detailed focus on high elevation areas, due to the difficulty in selecting a representative biome in very steep regions (because of vegetation altitudinal zonation) and the fact that in elevated regions extra-tropical biomes may occur within tropical latitudes. The use of digital maps available on the internet is a low-cost alternative to the use of comprehensive collections of scientific paper based maps. However, it is widely recognized that information available on the internet cannot always be regarded as reliable (e.g., Eysenbach and Diepgen 1998). To minimize the risks of using poor quality information, (i) most of the maps came from reliable internet sites (e.g., University of Texas, USDA, IBGE) and (ii) the filling procedure was based on the consensus among several regional maps obtained from independent sources. The constraint of regional consensus not only filters out poor quality maps, but also assures that the new map is not biased towards a particular regional (or global) map.
For a given non-consensus region, we obtained the consensus among at least two regional maps of recognized different sources. Then, we related the LONS08 categories to the vegetation types shown in the regional maps. Since regional maps usually consider more vegetation categories than LONS08, and the categories are often referred to by using local names, we also examined photographs of the vegetation types available on the internet to facilitate the subjective grouping of the regional maps categories. Regions of non-consensus were filled (grid) point by point, according to the consensus among the regional maps.
Table VI presents the main regions of non-consensus (lowlands and uplands), what the base-maps allocates in each region, which base-map resembles better the regional maps consensus, and some remarks on how these areas were filled.

MINOR CONSENSUS MODIFICATION
We verified from the regional maps consensus that there were consensus areas between the base-maps which did not correspond to the best representation (i.e. the consensus between RF99 and M83 did not correspond to the regional consensus). These areas were changed according to the regional consensus.
South America Atlantic Forest: some grid points with tropical seasonal forests were changed to savanna and vice versa.
Eastern North America and Eastern Europe: some temperate forest areas were changed to mixed forest (see the region on Table VI).
Central Russia: some mixed forest areas were changed to boreal seasonal forest (see the region on Table VI).

DISCUSSION
The new global natural vegetation map is shown in Figure 2. Unlike the RF99, this new map must be closer to represent the world's natural vegetation in preagricultural times, since the majority of the regional maps used display natural primary (instead of potential secondary) vegetation. Both base-maps presented a similar level of agreement with the new map (RF99 × new map: κ = 0.60; M83 × new map: κ = 0.58) meaning that no base-map could be regarded as the most representative of the regional consensus.
The biomes distribution in some regions could be a subject of concern, in view of the fact that LONS08 is restricted only to 13 phytophysiognomic types. For instance, shrubland (caatinga, type 8) was found in extratropical regions, such as in Spain and Australia; savanna (type 6), in Southern Africa; and tropical seasonal forest (type 13), in Northeast Argentina. Thus, from a climatic point of view, an inconsistency could be noticed: the occurrence of "tropical" types (caatinga-type shrubland, savanna and tropical seasonal forest) in extratropical latitudes. This climatic inconsistency is due to deficiencies in the LONS08 vegetation classification. For the tropics, LONS08 has biomes of arboreal (tropical forests), arboreal-grassy (savanna) and arbustivegrassy (caatinga-type shrubland) physiognomies. For extra-tropical regions, there are only biomes with arboreal (temperate forests) and grassy (grasslands) physiognomies, what reveals the lack of an "extra-tropical savanna or shrubland". This could be a new vegetation type to be included in LONS08 (in Spain, this new type would be more appropriate than tropical caatinga-type shrubland or grasslands). This classification inconsistency and thus the incorrect representation of land surface properties have the potential to generate errors in the output of climate models, although the new type cited above would cover a relatively small area (ca. 200 grid points).
The new global natural vegetation map developed in this study may be regarded as an advancement for the representation of tropical vegetation in comparison to DS89 classification, after the inclusion of the tropical seasonal forest type. The lack of consensus among the basemaps revealed that these maps, which have been widely used in climate studies, diverge considerably (more than 30% of the continental grid points were non-consensus areas).
Recent trends in climate modeling and land surface processes studies point to future models running at higher resolutions and relying on more than a dozen vegetation cover types. However, presently in most global climate change studies, for instance in IPCC AR4 (IPCC 2007), the majority of GCMs used run at coarse horizontal resolution (about 2.5 degrees; the highest resolution is 1.4 degrees in the CCSM3-NCAR GCM). Additionally, the inclusion of more vegetation categories demands more effort on field measurement of vegetation properties in order to feed GCMs with calibrated values of biophysical parameters. These considerations justify the construction of the present map on 1 degree horizontal resolution and containing a restricted set of vegetation categories, although we recognize the trend of increasing model resolution in global climate models, which will drive the need of defining more vegetation classes. The simple methodology described in here can be used to The main non-consensus regions between M83 and RF99 maps. In the first column, L refers to lowland regions (elevation < 1000 m), and U to upland regions (elevation > 1000 m). Caatinga refers to tropicl shrubland (type 8 of LONS08). part of the region produce vegetation maps with higher spatial resolution and with more vegetation classes. Currently, a 0.5 degree resolution natural vegetation map (partially based on the 1 degree map constructed here) has been developed to be used in regional climate model simulations (L.F. Salazar, personal communication). The use of this new vegetation map, just like the base-maps, is not restricted to be used as reference map for AGCM, but can also be useful (as reference map) for global land cover change studies [e.g. evaluation of historical land use changes (similar to RF99)] and for the assessment of global climate change impacts on vegetation distribution (e.g. Salazar et al. 2007). The improved representation of vegetation distribution may provide a more accurate rendition of large-scale biome-climate interactions.

ACKNOWLEDGMENTS
This work is part of the first author's MSc thesis under the guidance of the second and third authors. We wish to thank Fundação de Amparo à Pesquisa do Estado de São Paulo (FAPESP) for the financial support (DML Proc. no. 04/12235-3) and three anonymous reviewers for helpful comments on this manuscript.