DNA barcodes for soil animal taxonomy

The biodiversity of soil communities remains very poorly known and understood. Soil biological sciences are strongly affected by the taxonomic crisis, and most groups of animals in that biota suffer from a strong taxonomic impediment. The objective of this work was to investigate how DNA barcoding – a novel method using a microgenomic tag for species identifi cation and discrimination – permits better evaluation of the taxonomy of soil biota. A total of 1,152 barcode sequences were analyzed for two major groups of animals, collembolans and earthworms, which presented broad taxonomic and geographic sampling. Besides strongly refl ecting the taxonomic impediment for both groups, with a large number of species-level divergent lineages remaining unnamed so far, the results also highlight a high level (15%) of cryptic diversity within known species of both earthworms and collembolans. These results are supportive of recent local studies using a similar approach. Within an impeded taxonomic system for soil animals, DNA-assisted identifi cation tools can facilitate and improve biodiversity exploration and description. DNA-barcoding campaigns are rapidly developing in soil animals and the community of soil biologists is urged to embrace these methods.


Introduction
Soil animal communities comprise one of the largest biodiversity reservoirs in terrestrial biota (Decaëns et al., 2006).They are part of highly complex ecosystems, the organization and composition of which remain far from being understood or even approximated from fi nest to broadest scales.While that state of the art has been repeatedly stressed in the literature during the past two decades (André et al., 1994(André et al., , 2001)), Dance (2008) reports that the situation in the fi eld is quite unchanged.Soil is populated by probably millions of species whose discrimination remains an unfaced challenge, whether dealing with microbial diversity (Roesch et al., 2007;Fulthorpe et al., 2008) or with organisms of the micro to megafauna (Decaëns et al., 2008), with likely some degree of correlation between knowledge and animal size (Decaëns et al., 2006;Stork et al., 2008).
In this paper, a synthetic review of DNA barcoding in the context of the taxonomic crisis is provided, and preliminary results that assess the effi ciency and reliability of the DNA barcoding approach are presented to document and explore biodiversity within two groups of soil animals: earthworms, as part of the soil megafauna, broadly used models and supposedly relatively well known taxonomically; and collembolans, as part of the mesofauna, an omnipresent group of organisms suffering a strong taxonomic impediment.Although these are early days for large-scale DNA-barcoding campaigns, the aim of targeting these two groups of soil animals is to estimate the global applicability of the method and assess its potential for transcending the fi nal frontier of soil biodiversity.

The taxonomy crisis
Taxonomy is a science in crisis (Godfray, 2002;Hopkins & Freckleton, 2002;Wilson, 2003;Agnarsson & Kuntner, 2007).Despite its status of "fundamental discipline" (Wilson, 2004) with a cascade of derivative functions, taxonomy is one of the weakest and most underfunded biological disciplines (Wilson, 2003).The few encouraging major initiatives (e.g. the Global Taxonomy Initiative launched after the adoption of the Convention on Biological Diversity -CBD; the National Science Foundation's Partnerships for Enhancing Expertise in Taxonomy -NSF-PEET, USA; the European Distributed Institute of Taxonomy -EDIT; and a number of other biodiversity-oriented projects) are facing a dramatic, so far irreversible, general trend of ever-growing defi cit of taxonomists coupled with a strong imbalance between the species diversity of large taxonomic groups and the effective taxonomists for these particular groups (Brown & Lomolino, 1998).Although the distribution of taxonomic work between the professional and amateur communities is certainly disparate among groups and remains diffi cult to evaluate, Hopkins (2002) showed that the effectiveness of both communities tends to decrease more dramatically for professional scientists, whose proportion as authors of taxonomic papers also diminishes.An even more critical concern is the legacy of the knowledge and expertise of taxonomists which, after what is frequently a life-time work devoted to a group of organisms, is never fully transmitted to the next generation of either taxonomists or other users of taxonomy.At best, a number of publications, taxonomic descriptions, revisions and identifi cation keys are available -more or less accessible to non-expert users throughout the world -but, as a general rule, it is a large part of a yet unpublished expertise and unique ability to identify species that simply vanish.In addition, the general consideration and understanding of taxonomists' work by the public and policy makers is part of the current crisis: biologists still have to convince and educate about the interest and value of documenting biodiversity, of becoming "bioliterate" (Janzen, 2004;Janzen et al., 2005).Only by doing so can taxonomy remain on the "front seat" of environmental policies (Samper, 2004), evolve as a dynamic fi eld of biological sciences and thus be given the means for its survival and development.Different authors have stressed the need for a mutation, a "renaissance" of the fi eld (Godfray, 2002;Wheeler, 2004;Agnarsson & Kuntner, 2007).In particular, it has been urged that taxonomists embrace the use of new technologies to ease access to various sets of information relevant to taxonomy: taxonomy should move on the web, and its instances and rules should rapidly adapt to new ways of disseminating and producing taxonomic information (Godfray, 2002;Wilson, 2004;International Commission on Zoological Nomenclature, 2008).There exist here, at least in part, solutions that address the expertise legacy issue.Also, by facilitating the work of taxonomists, one could expect acceleration in the pace of biodiversity exploration and description, a crucial point at a time where we may have documented as little as 10% of our planet's biodiversity during the last 250 years.

About the taxonomic impediment
The existence of a taxonomic impediment (Godfray, 2002;Samper, 2004;Wheeler et al., 2004) or "taxonomy deficit" (Blaxter, 2004) has been widely acknowledged.It is sometimes mentioned as a characteristic of some groups whose diversity remains in a very large part unknown and as a consequence undescribed.In a broader sense, as stated on the CBD website (Global Taxonomy Initiative, 2009), the taxonomic impediment consists of "the knowledge gaps in our taxonomic system (including those associated with genetic systems), the shortage of trained taxonomists and curators, and the impact these deficiences have on our ability to conserve, use and share the benefits of our biological diversity."The extent of this impediment results in prejudices that are highly diverse and affect us at large.For instance, the choice of model organisms is dramatically unbalanced towards the selection of the best-known taxa.Conservation strategies are frequently driven by the use of umbrella or flagship species as surrogates for a more diverse set of poorly known organisms (Andelman & Fagan, 2000).According to values in the 2003 IUCN (International Union for Conservation of Nature) Red List of Threatened Species as reported by McNeely (2006), only 768 (0.08%) insect species were evaluated, of which 552 (72%) were considered as threatened, that is only 0.06% of described species.In birds and mammals, 100 and 98.9% of the described species were evaluated, and 12 and 24% were considered threatened, respectively.As palliatives to the taxonomic impediment, higher taxa have also been proposed as surrogates (Mandelik et al., 2007) using coarse taxonomic scales, as well as recourse to parataxonomy and the identification of organisms down to "recognizable taxonomic units", also known as morphospecies or morphotypes (Krell, 2004).Both methods have serious pitfalls (Krell, 2004), unpredictable error rate and hardly reproducible results.Additionally, in groups such as earthworms, which are expected to be relatively taxonomically well known, at least in temperate countries where they are much used in various kinds of soil studies (e.g.ecology, ecotoxicology, bioindicators), a striking and unexpected level of cryptic diversity was recently evidenced through molecular data (King et al., 2008;Richard, 2008) (see also the Results section below).This jeopardizes the correct identification of target organisms and thus the reliability, comparability and reproducibility of results.Bad taxonomy -"identification, classification, and nomenclature of organisms without following the appropriate procedures and rules that specialist taxonomists define" -either due to taxonomic shortcuts or overlooked cryptic species, affects soil sciences at large, with a significant and a yet unevaluated risk of error cascades (Bortolus, 2008).

DNA barcoding -a standardized identification method for animals
A 658-bp fragment of the mitochondrial gene cytochrome c oxidase I (COI) has been proposed as a standard barcode for animal species (Hebert et al., 2003).The principle is that the bimodality of intra and interspecifi c genetic variations will permit the use of this genetic marker as a tag to group conspecifi cs together and distinguish them from other taxa.Coupled with a database associating genetic tags with taxon names, this system provides a reliable and straightforward identifi cation of an unknown specimen (Figure 1).The advantages are numerous: 1, it is a testable and reproducible system, as a link between any barcode and a voucher specimen is maintained; 2, it is fast (the sequencing process from tissue sampling to sequence upload in the database currently approximates seven hours, but recent advances may reduce that time to two hours only) and accessible in virtually any place where sequencing facilities exist; 3, it is cheap (the cost per sample in a high-throughput facility approximates 5 USD and is constantly being reduced); 4, it can be used for the smallest animals for which special voucher preservation methods are used (Rowley et al., 2007); 5, it works for any life stage and 6, for any kind of organic tissue types.The simplicity of the method opens outstanding perspectives, such as the miniaturization of the process likely to become handheld and thus fi eld-accessible within the next few decades; also, direct coupling of DNA barcoding and environmental sequencing through massively parallel pyrosequencing can potentially revolutionize access to local biodiversity information.On the other hand, as one could expect for a newly emergent and fast-developing approach, several potential drawbacks and pitfalls of DNA barcoding have been heavily discussed in literature within the past fi ve years, ranging from methodological and scientifi c to societal concerns (Will & Rubinoff, 2004;Rubinoff & Holland, 2005;Will et al., 2005;Cameron et al., 2006;Meier et al., 2006;Elias et al., 2007;Larson, 2007;Wiemers & Fiedler, 2007;Song et al., 2008;Trewick, 2008).Of all criticisms, the most redundant is that the use of Figure 1.Identifi cation pathway of the query sequence from an unknown specimen by comparison to DNA barcodes present in the reference library.Records in the library are associated to specimen data, including a species name assigned by an expert taxonomist according to his conception of the species by reference to a type specimen usually outside the library.Taxonomists can be relieved from redundant standard identifi cation of unknown specimens.a single mitochondrial marker and an arbitrary level of divergence cannot be used to discriminate species because of potential false negatives (identical DNA sequences can be found in two actually different species if the divergence time was too short for the fi xation of substitutions, or because of gene introgression), and potential false positives (individuals belonging to the same species may have highly divergent DNA sequences because of ancestral polymorphism or, again, genetic introgression).To overcome these pitfalls, it has then been repeatedly claimed that DNA barcodes should be used in an integrative framework, that is, associated to other sets of data, such as morphology or additional nuclear genetic marker, as well as ecological, ethological or biogeographical features.Interestingly, such data are almost always intimately associated to DNA barcodes from the very beginning of the construction of the reference library (the Barcode of Life Data Systems -BOLD) (Ratnasingham & Hebert, 2007) for each record that associates a DNA barcode with a taxon name; indeed taxon names are generally hypotheses based on traditionally used characters and thus represent an "in-kind" integrated data complement summarizing the taxonomic expertise available for a particular specimen.As far as no confl ict exists between these two sets of information, DNA barcodes then represent reliable identifi cation tags that can be used without further combination of different sources of evidence.Nevertheless, for groups with a strong taxonomic impediment and no available expertise, DNA barcoding may represent the only available source of discriminating data.It should then be used with the necessary caution caused by the pitfalls of the use of a single marker (King et al., 2008), and Molecular Operational Taxonomic Units (MOTUs) should be preferred -at least provisionally -as surrogates to species in poorly known groups (Blaxter, 2004).Overall, the utility of DNA barcoding for species identifi cation and of DNA barcodes as an additional set of data for alpha-taxonomic works has now been revealed in a broad range of taxonomic groups of vertebrates: birds (Hebert et al., 2004), bats (Clare et al., 2007), mammals (Borisenko et al., 2007;Dalebout et al., 2007); and invertebrates: nematodes (Bhadury et al., 2006); earthworms (Huang et al., 2007;Chang et al., 2009); butterfl ies (Burns et al., 2008); moths (Decaëns et al., 2008;Vaglia et al., 2008); ants (Fisher & Smith, 2008); mayfl ies (Webb et al., 2007); caddisfl ies (Zhou et al., 2007).Outcomes of these works range from biotic surveys to re-evaluation of species complexes, unveiling of cryptic species, new synonyms, etc.; some include formal taxonomic acts (Webb et al., 2007;Burns et al., 2008;Decaëns et al., 2008;Fisher & Smith, 2008;Vaglia et al., 2008) based on combined evidence from genetic and traditional approaches.

Building the reference library
Although DNA barcoding studies fl ourish throughout the world, the results are collectively compiled in a central integrative bioinformatics platform -BOLD (Barcode of Life Data Systems, 2009) -that supports all phases of the analytical pathway, from specimen collection to tightly validated barcode library (Ratnasingham & Hebert, 2007), and can also accommodate externally produced sequences, either through direct submission or regular incorporation of GenBank sequences.The key role of taxonomists in the assemblage and curation of the reference library is strongly stressed here, and taxonomists are encouraged to participate in it.Taxonomists have a unique ability to scrutinize the results of the barcode analyses for their group of interest and, when necessary, to ensure their integration within the classical workfl ow of new species descriptions and taxonomic revisions (Fisher & Smith, 2008;Vaglia et al., 2008).Such association enables reliable identifi cation through DNA barcodes by associating this genetic tag with its homologue in the library and with the attached taxonomic hypotheses referring to a species concept and, more practically, to a name incarnated by the type specimen (Figure 1).Quite interestingly, in such framework the DNA sequences represent surrogates to the taxonomic expertise and will ensure the legacy of the integrated expertise independently of further input from taxonomists.This is viewed as a very signifi cant improvement to counterbalance the dramatic loss of expertise when taxonomists stop their activity.Moreover, this does not question the need to develop taxonomy for all these poorly known groups, and emphasizes the primary role that taxonomists should have in further documenting and describing the large unknown part of our planet's biodiversity.While the use of DNA barcodes will likely reduce the taxonomist standard-specimen identifi cation task, an often redundant one, taxonomists will also benefi t from DNA sequences for their revisionary work Pesq. agropec. bras., Brasília, v.44, n.8, p.789-801, ago. 2009 Barcodes (but also possibly other genes, since access to them will be facilitated in having at hand the DNA extracts produced and archived during the barcoding process) represent a new set of characters to address taxonomic issues.They can also be used as a screening tool to detect specimens of special interest, and focus on them for further morphological or ecological investigation.

DNA barcoding for earthworms and collembolans
A large sampling campaign was recently initiated for both collembolans and earthworms in order to further evaluate the potential of DNA barcoding in these groups.The method had already been tested for collembolans in the early time of its development for a limited set of arctic taxa (Hogg & Hebert, 2004).The results revealed unambiguous and reliable discrimination of species through barcode data.In earthworms, Huang et al. (2007) were the fi rst to envision the broader application of DNA barcoding for species identifi cation and discrimination.Their work was recently completed and further examined through broader taxonomic sampling (Chang et al., 2009), confi rming a general pattern of deep interspecifi c divergences opposed to shallow interspecifi c variation of the barcode fragment in Southeast Asian species of earthworms.King et al. (2008), though not explicitly referring to DNA barcoding, used the same gene region to uncover a striking level of intraspecifi c divergence and cryptic diversity among a set of common and well-known species of British lumbricid earthworms.In the following part of this paper, the general pattern of variation of COI barcodes in collembolans and earthworms will be further examined by enlarging both the taxonomic and geographic coverage for these groups.

Materials and Methods
All the specimens sampled and analyzed are part of the DNA-barcoding campaigns initiated for collembolans and earthworms at the Biodiversity Institute of Ontario.All samples were processed at the Canadian Centre for DNA Barcoding (CCDB).DNA was extracted from small pieces of muscular tissue for earthworms and from whole specimens for collembolans.A specifi c protocol was developed for the latter, so that voucher specimens could be recollected after the lysis step and thus be used for further morphological examination, if necessary.
DNA extraction followed a routine silica-based 96-well extraction automation protocol (Ivanova et al., 2006).The 658 bp region of COI proposed for use as a DNA barcode (Hebert et al., 2003) was amplifi ed with the M13-tailed primers LCO1490 and HCO2198 (Folmer et al., 1994).PCR amplifi cations were performed according to the standard PCR reaction protocol used in CCDB (Hajibabaei et al., 2005) and were checked on a 2% E-gel 96 Agarose (Invitrogen).Unpurifi ed PCR products were bidirectionally sequenced using M13F and M13R primers; sequencing reactions followed CCDB protocols (Hajibabaei et al., 2005), with products subsequently purifi ed using Agencourt CleanSEQ protocol (Agencourt, Beverly, MA, USA).The samples that did not yield sequences after this standard procedure were selectively re-amplifi ed with the primer pairs LepF1/MLepR1 and mLepF1/LepR1 (Hajibabaei et al., 2006) targeting shorter DNA fragments.Sequencing reactions were then conducted using LepF1 and LepR1 primers, producing a unidirectional reading except for the 55 bp overlap between the two fragments.The sequences were managed in SeqScape version 2.1.1 (Applied Biosystems, Foster City, CA, USA) and Sequencher 4.5 (Gene Code Corporation, Ann Arbor, MI, USA) and aligned using BioEdit version 7.0.5.3 (Hall, 1999) and MEGA4 (Tamura et al., 2007).Regularly updated protocols used at the CCDB can be found at: http:// www.dnabarcoding.ca/pa/ge/research/protocols.Distance analyses were performed using MEGA4.

Results and Discussion
As the objective of this paper is to provide a global overview of how DNA barcodes behave when considered in a taxonomic and geographic sampling broader than what has been reported to date, we do not address in depth the results of our analyses.More detailed studies are in preparation, addressing taxonomic questions more thoroughly and considering additional independent datasets.The 1,152 sequences obtained for both studied groups were analyzed separately; they are deposited in GenBank with the following accession numbers: GU013791 to GU014232, FJ937285, FJ937287, FJ937289, FJ937295 to FJ937300, FJ937308 and FJ937315 to FJ937319 for earthworms, and GQ373428 to GQ374122 for Collembolans.In earthworms, a total of 457 sequences were included in the analysis, representing 49 genera within 8 families (Figure 2).The geographic coverage includes North America (USA), South America (Brazil, Paraguay), the Caribbean islands, Europe (Andorra, Cyprus, France, Hungary, Romania, Spain), the Middle East (Israel), Southeast Asia (Thailand, Philippines), and Australia.The dataset encompasses a number of unidentifi ed species (mostly those samples from the Philippines and Brazil) and 87 species identifi ed using morphological characters.The total number of genetic clusters is reported (Table 1) using different threshold levels.It varies from 211 using a 4% divergence threshold to 192 with a conservative divergence value of 10%.Overall, if considering clusters separated by 10% or more sequence divergence, then 90.1% of them reveal Figure 2. Neighbour-joining tree based on K2P distances from 457 DNA barcode sequences of earthworms.Circles refer to species which are: unidentifi ed (white), identifi ed (black), or identifi ed with intraspecifi c divergence >10% (grey).Pesq. agropec. bras., Brasília, v.44, n.8, p.789-801, ago. 2009 less than 4% intra-cluster divergence.Also, at that very conservative level of interspecifi c divergence, 13 of the 87 identifi ed species do split in two or more clusters, revealing potential cases of cryptic diversity.Overall, as apparent from the tree representation of genetic distances (Figure 2), the pattern of DNA barcode variation is very similar to the one reported by Chang et al. (2009) for a more limited regional dataset; indeed, the results obtained clearly show a dominance of deep interspecifi c divergences and limited intraspecifi c variability.
In collembolans, 695 sequences belonging to 88 genera in 16 families and including samples from Europe (France, Spain, Slovenia and Romania), North Africa (Algeria), North-America (USA), Southeast Asia (Vietnam, Indonesia), Pacifi c islands (Vanuatu) and Far East Russia were analyzed.Because of the diffi cult and poorly known taxonomy of these organisms, only 44 species have been formally identifi ed using morphological characters so far.The genetic clusters are overall very clearly defi ned (Figure 3).Table 1 summarizes the number of these clusters for different distance thresholds.Interestingly, in the very conservative hypothesis that the 215 observed genetic clusters distinct by 10% or more from others do represent actual species -though such hypothesis requires additional data to be formally accepted -, then 94.4% of them have less than 4% intraspecifi c divergence.Such pattern, as clearly apparent from the topology of the distance tree (Figure 3), is typical of a bimodal distribution of intra-versus interspecifi c divergences such as the one also reported in earthworms.On the other hand, out of the 44 identifi ed species, 7 displayed an intraspecifi c split superior to 10% and are likely to represent cases of cryptic diversity.
The geographically and taxonomically extended sampling presented in this paper is strongly congruent with the patterns preliminarily documented in earthworms (Huang et al., 2007;King et al., 2008;Chang et al., 2009) and collembolans (Carapelli et al., 1995;Hogg & Hebert, 2004) at local scales.Both groups show a highly structured variation of the barcode sequences which, in the hypothesis that clusters represent distinct species, indicates a high interspecifi c barcode divergence opposed to shallow intraspecifi c variation (Figures 1 and 2).The dataset used appears quite representative of the relative taxonomic impediment of each group.Indeed, 44 collembolan species (20.4% of the total number of >10% divergent clusters) were identifi ed using only morphology, and the same is valid for 87 earthworms species (45.3%).Quite interestingly however, DNA barcode analysis revealed a similar level of potential cryptic diversity, with about one for six identifi ed species having one or more divisions exceeding 10% divergence.At that point, the lack of taxonomic data unfortunately precludes us from safely interpreting these "species-level" divergences and from linking the observed genetic clusters to actual species.Further investigation based on additional independent evidence is needed to address both the exact status of all these unnamed clusters and the striking cryptic diversity revealed within known species.In earthworms, a very close match was found between traditional taxonomic identifi cations of closely related species and barcode clusters for Taiwanese species, suggesting that these genetic data would reliably identify species and point out cases of cryptic diversity (Chang et al., 2009).Facing that striking pattern of deep "species-level" divergences within British lumbricid earthworms, King et al. (2008) found a global support of AFLP results to all but one case, suggesting that overlooked sibling species would account for a large part of the observed pattern.The use of nuclear genes was shown to be a potentially very effi cient complement to mtDNA (Smith et al., 2008), but thus far results remain inconclusive for earthworms (King et al., 2008) and collembolans where it has to be further explored and developed.Much is to be expected, however, from the re-evaluation of morphological features once DNA barcodes have unveiled putative cases of cryptic diversity.For instance, coloration and size, two sets of characters reluctantly used in these groups because of their individual variation, and problems related to specimen preservation after collecting may be far more informative than previously assumed (Chang & Chen, 2005;Chang et al., 2007;Lowe & Butt, 2008).In earthworms, breeding experiments (Lowe & Butt, 2008) also represent an excellent alternative to test for lineage isolation and complement genetic data; it may indeed be expected that large genetic divergences would be accompanied by some level of genetic incompatibility.An excellent example of how the combination of genetic data and breeding experiments could successfully complement each other is the evidence brought that Eisenia fetida and E. andrei are two distinct species (Domínguez et al., 2005;Perez-Losada et al., 2005), thus confi rming an earlier congruent conclusion by Jaenike (1982).
Overall, the observed pattern of DNA barcode variation is very favorable to the use of this marker as a species tag, providing a unique and unequaled way of identifying organisms that otherwise require a high and scarcely available level of expertise.However, the very strong taxonomic impediment of both collembolans and earthworms is obvious, and characterized both by a high level of cryptic diversity within already known species and by a high proportion of taxa that are currently undescribed or need thorough taxonomic scrutiny and investigation to be properly linked to an existing name.Actually, the latter task remains very challenging because of the way type specimens of soil animals are usually preserved.Indeed, types are often very old and preserved (or prepared) using DNA-unfriendly methods (e.g.formaldehyde) that impede their easy use for integrative studies combining genetic and traditional data.Specifi c protocols and approaches (Herniou et al., 1998;Hajibabaei et al., 2006;Skage & Schander, 2007;Zimmermann et al., 2008) may prove useful in that prospect of the use of type specimens.In collembolans, the situation is probably worse since most types have been prepared on microscopy slide after clearing preparations that defi nitely destroy DNA molecules.In any case, the use of DNA to re-evaluate diagnostic morphological characters may allow a robust assignment of recently collected specimens to an ancient type using morphology, or alternatively, in these groups where overlooked cryptic species may be very frequent, the recourse to designation of neotypes for which DNA data are available might be the safest and most effi cient way to proceed.
Interestingly, the use of DNA barcoding in soil animal studies both strongly emphasizes the measure of the taxonomic defi cit, evidencing a huge and challenging knowledge gap, and comes as a solution to address it.Becoming more and more accessible, DNA data are expected to support soil biologists in species identifi cations, stimulate accurate soil biodiversity surveys or any ecological research based on species lists, and help soil systematists to solve taxonomic as well as phylogenetic problems.The benefi ts of adopting such methods are immediate and the statement stressing that "now is the time" concurs with Janzen (2004).DNA barcoding campaigns are already running, some of them global and directly addressing soil animals; soil zoologists -taxonomists in particular -are urged to join efforts, benefi ting from these new data and contributing to build the reference libraries necessary to document soil biodiversity.While facilitating and accelerating the reduction of the taxonomic impediment in a number of groups, the fi rst benefi ts that would be gained are the repeatability and refutability of studies through objective assessment of the identity of the organisms studied, at last providing the long-claimed "taxonomic affi davit" (Por, 2007) toward reducing the "species uncertainty" (Hey et al., 2003) in soil sciences.Whereas priority should be given to develop taxonomy itself for groups suffering the most of that taxonomic impediment, the community of soil biologists is urged to systematically preserve vouchers of their study material to make them accessible for further study; these vouchers should ideally be preserved in a DNA-friendly way in order to allow the use of genetic methods for species identifi cation, phylogeography, etc. Misidentifi cations will then be traceable, and error cascades could be stopped and tracked back after discovery of the original error.The need for expert taxonomists is overwhelming in most groups of soil animals, but placing the role of as a central feature of ongoing and future DNA barcoding projects -as for instance in the rapidly developing earthworm and collembolan campaignswill eventually benefi t the community of taxonomists.This will help in raising funds, in attracting students and developing training, thus ensuring the legacy of expertise.Nevertheless, given the extent of the taxonomic impediment, it is likely that soil biologists at large will have to embrace, at least temporarily for a number of taxa, the use of MOTUs (Blaxter, 2004;Blaxter et al., 2005); see for instance the use of genetic data as proxy for phylogenetic diversity (PD) estimations (Faith, 2008).Though not ideal, this system has no alternative if to address these groups with a huge taxonomic defi cit.In nematodes for instance, 26,000 species have been described, but estimates of actual diversity range from 40,000 to 100 million species (Blaxter et al., 1998;Blaxter, 2004).Also, by being developed as collaborative and decentralized efforts involving expert taxonomists all over the world, the DNA barcoding campaigns can contribute in redistributing wealth from developed countries, where most of the funding currently originates, to developing countries, where most of the species live and where most of the zoologists need to be trained and employed.The anticipated rapid progress and enhancement of biodiversity exploration and description will offer the opportunity to develop in-depth knowledge in a much wider range of taxa, possibly demonstrating the generality of ecological theories and patterns, increasing the probability to discover novelties or to reassess some previously erroneously or incompletely addressed questions (Wilson et al., 2007).The cost-effectiveness of biodiversity surveys for soil animals will be greatly improved (Janzen et al., 2005;Smith et al., 2006Smith et al., , 2008;;Gardner et al., 2008).With little doubt, by coupling DNA barcoding and their research, soil biologists will strongly reduce the effect of the taxonomic impediment and will start transcending the last frontier of soil zoology.

Conclusions
1.The observed pattern of DNA barcode variation is very favorable to the use of this marker as a species identifi cation and discrimination tag.
2. Both collembolans and earthworms share a strong taxonomic impediment characterized both by a high level of cryptic diversity within already known species, and by a high proportion of taxa currently undescribed or needing further taxonomic scrutiny to be correctly identifi ed.
3. DNA barcoding strongly emphasizes the taxonomic defi cit in collembolans and earthworms, and comes as a promising solution to address it.
4. Adopting DNA barcoding enhances the accuracy of scientifi c studies and more generally benefi ts the community of soil biologists.

Table 1 .
Number of barcode clusters in earthworms and collembolans when considering different divergence thresholds; values in brackets report the percentage of >10% divergent clusters showing an internal divergence inferior to the given threshold.