Acessibilidade / Reportar erro

Transcriptomic markers in pediatric septic shock prognosis: an integrative analysis of gene expression profiles

Abstract

The goal of this study was to identify potential transcriptomic markers in pediatric septic shock prognosis by an integrative analysis of multiple public microarray datasets. Using the R software and bioconductor packages, we performed a statistical analysis to identify differentially expressed (DE) genes in pediatric septic shock non-survivors, and further performed functional interpretation (enrichment analysis and co-expression network construction) and classification quality evaluation of the DE genes identified. Four microarray datasets (3 training datasets and 1 testing dataset, 252 pediatric patients with septic shock in total) were collected for the integrative analysis. A total of 32 DE genes (18 upregulated genes; 14 downregulated genes) were identified in pediatric septic shock non-survivors. Enrichment analysis revealed that those DE genes were strongly associated with acute inflammatory response to antigenic stimulus, response to yeast, and defense response to bacterium. A support vector machine classifier (non-survivors vs survivors) was also trained based on DE genes. In conclusion, the DE genes identified in this study are suggested as candidate transcriptomic markers for pediatric septic shock prognosis and provide novel insights into the progression of pediatric septic shock.

Pediatric; Septic shock; Transcriptomic markers; Co-expression network; Prognosis


Introduction

Sepsis is the world's leading cause of death of children, and it represents a complex disease with dysregulated inflammatory responses and a high mortality rate (11 Kissoon N, Carapetis J. Pediatric sepsis in the developing world. J Infect 2015; 711: S21–S26, doi: 10.1016/j.jinf.2015.04.016.
https://doi.org/10.1016/j.jinf.2015.04.0...
). An immune response initiated through an invading pathogen cannot be controlled to restore homeostasis in sepsis, which generates a pathological syndrome with sustained excessive inflammation as well as immune suppression (22 van der Poll T, van de Veerdonk FL, Scicluna BP, Netea MG. The immunopathology of sepsis and potential therapeutic targets. Nat Rev Immunol 2017; 17: 407–420, doi: 10.1038/nri.2017.36.
https://doi.org/10.1038/nri.2017.36...
). Toll-like receptors, nuclear factor κB, and cytokines including tumor necrosis factor α, interleukins 1, 2, 6, 8, and neutrophils all take part in the generation of sepsis (33 Cecconi M, Evans L, Levy M, Rhodes A. Sepsis and septic shock. Lancet 2018; 392: 75–87, doi: 10.1016/S0140-6736(18)30696-2.
https://doi.org/10.1016/S0140-6736(18)30...
). Although there are common points between adult and pediatric sepsis, crucial differences do exist in pathophysiology, clinical presentation, as well as therapeutic approaches (44 Emr BM, Alcamo AM, Carcillo JA, Aneja RK, Mollen KP. Pediatric sepsis update: how are children different? Surg Infect (Larchmt) 2018; 19: 176–183, doi: 10.1089/sur.2017.316.
https://doi.org/10.1089/sur.2017.316...
).

High-throughput transcriptomic data grow rapidly, enabling gene expression profiling and identification of prognostic targets in disease. Over the last decade, several studies have explored the prognosis of pediatric septic shock by transcriptional profiling using microarrays (55 Cvijanovich N, Shanley TP, Lin R, Allen GL, Thomas NJ, Checchia P, et al. Validating the genomic signature of pediatric septic shock. Physiol Genomics 2008; 34: 127–134, doi: 10.1152/physiolgenomics.00025.2008.
https://doi.org/10.1152/physiolgenomics....
6 Wynn JL, Cvijanovich NZ, Allen GL, Thomas NJ, Freishtat RJ, Anas N, et al. The influence of developmental age on the early transcriptomic response of children with septic shock. Mol Med 2011; 17: 1146–1156, doi: 10.2119/molmed.2011.00169.
https://doi.org/10.2119/molmed.2011.0016...
7 Wong HR, Cvijanovich N, Lin R, Allen GL, Thomas NJ, Willson DF, et al. Identification of pediatric septic shock subclasses based on genome-wide expression profiling. BMC Med 2009; 7: 34, doi: 10.1186/1741-7015-7-34.
https://doi.org/10.1186/1741-7015-7-34...
88 Wong HR, Shanley TP, Sakthivel B, Cvijanovich N, Lin R, Allen GL, et al. Genome-level expression profiles in pediatric septic shock indicate a role for altered zinc homeostasis in poor outcome. Physiol Genomics 2007; 30: 146–155, doi: 10.1152/physiolgenomics.00024.2007.
https://doi.org/10.1152/physiolgenomics....
). Integrative analysis based on multiple transcriptomic datasets could help to discover robust prognostic candidates. Therefore, in this study, the gene expression patterns of pediatric septic shock patients (of survivors and non-survivors) were investigated using public microarray datasets. The differently expressed (DE) genes identified were then further interpreted by enrichment analysis, construction of co-expression networks, and receiver operating characteristic (ROC) curve analysis.

In this study, data pre-processing, identification of DE genes, ROC analysis, and support vector machine (SVM) model training were performed using R software (http://www.r-project.org/) and bioconductor packages (99 Gentleman RC, Carey VJ, Bates DM, Bolstad B, Dettling M, Dudoit S, et al. Bioconductor: open software development for computational biology and bioinformatics. Genome Biol 2004; 5: R80, doi: 10.1186/gb-2004-5-10-r80.
https://doi.org/10.1186/gb-2004-5-10-r80...
). Enrichment analysis and construction of co-expression networks were also performed using Database for Annotation, Visualization and Integrated Discovery (DAVID) (1010 Huang DW, Sherman BT, Lempicki RA. Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources. Nat Protoc 2009; 4: 44–57, doi: 10.1038/nprot.2008.211.
https://doi.org/10.1038/nprot.2008.211...
,1111 Huang DW, Sherman BT, Lempicki RA. Bioinformatics enrichment tools: paths toward the comprehensive functional analysis of large gene lists. Nucleic Acids Res 2009; 37: 1–13, doi: 10.1093/nar/gkn923.
https://doi.org/10.1093/nar/gkn923...
) and Cytoscape (1212 Shannon P, Markiel A, Ozier O, Baliga NS, Wang JT, Ramage D, et al. Cytoscape: a software environment for integrated models of biomolecular interaction networks. Genome Res 2003; 13: 2498–2504, doi: 10.1101/gr.1239303.
https://doi.org/10.1101/gr.1239303...
) software, respectively.

Material and Methods

Microarray datasets search and selection

In this study, public microarray datasets were searched from the earliest dataset until Dec 3, 2018 using the keywords “sepsis”/“septic shock” in the Gene Expression Omnibus (GEO) database (http://www.ncbi.nlm.nih.gov/geo/) (1313 Barrett T, Wilhite SE, Ledoux P, Evangelista C, Kim IF, Tomashevsky M, et al. NCBI GEO: archive for functional genomics data sets--update. Nucleic Acids Res 2013; 41: D991–D995, doi: 10.1093/nar/gks1193.
https://doi.org/10.1093/nar/gks1193...
). Further selection of the datasets was performed for subsequent analyses. The selection criteria were: i) dataset using whole blood for gene expression analysis; ii) dataset with detailed gene expression data for both survivors and non-survivors; and iii) dataset with sample size not fewer than 30. Animal studies and adult studies were excluded.

Data collection from each eligible dataset was performed by two investigators independently. The data consisted of the sample size, GEO accession, sample source, raw gene expression data, and platform used. Final data collection was determined by checking between the two investigators.

Data analysis methods

Four Affymetrix datasets were included in this study (3 datasets for training and 1 dataset for testing). Raw data (CEL files) of the 3 training datasets were downloaded from the GEO database, merged, and pre-processed using the R package “affy”, based on the Robust Multichip Average (RMA) method (1414 Gautier L, Cope L, Bolstad BM, Irizarry RA. affy--analysis of Affymetrix GeneChip data at the probe level. Bioinformatics 2004; 20: 307–315, doi: 10.1093/bioinformatics/btg405.
https://doi.org/10.1093/bioinformatics/b...
). Hybridization probes were then mapped to genes (Entrez IDs) based on the platform table. Probes not mapping to genes and probes mapping to multiple genes were excluded. If two or more probes mapped to the same gene, we calculated the arithmetic mean of the probe values to represent the gene expression. Using the R package “limma” (1515 Smyth GK. limma: Linear Models for Microarray Data. In: Gentleman R, Carey V, Huber W, Irizarry R, Dudoit S, Bioinformatics and Computational Biology Solutions Using R and Bioconductor. New York: Springer; 2005. p 397–420.,1616 Ritchie ME, Phipson B, Wu D, Hu Y, Law CW, Shi W, et al. limma powers differential expression analyses for RNA-sequencing and microarray studies. Nucleic Acids Res 2015; 43: e47, doi: 10.1093/nar/gkv007.
https://doi.org/10.1093/nar/gkv007...
), DE genes in septic shock non-survivors compared with survivors were identified based on the following criteria: i) absolute log2 fold change (LFC) greater than 0.8; and ii) false discovery rate (FDR)-adjusted P-value less than 0.05.

We further performed functional interpretation (both gene ontology (GO) analysis and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway analysis) of the DE genes identified by DAVID 6.8 (1010 Huang DW, Sherman BT, Lempicki RA. Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources. Nat Protoc 2009; 4: 44–57, doi: 10.1038/nprot.2008.211.
https://doi.org/10.1038/nprot.2008.211...
,1111 Huang DW, Sherman BT, Lempicki RA. Bioinformatics enrichment tools: paths toward the comprehensive functional analysis of large gene lists. Nucleic Acids Res 2009; 37: 1–13, doi: 10.1093/nar/gkn923.
https://doi.org/10.1093/nar/gkn923...
). The P-value threshold was set at 0.05 in the GO analysis (1717 Falcon S, Gentleman R. Using GOstats to test gene lists for GO term association. Bioinformatics 2007; 23: 257–258, doi: 10.1093/bioinformatics/btl567.
https://doi.org/10.1093/bioinformatics/b...
). Hypergeometric testing (P-value threshold: 0.05) was used in the pathway analysis (1818 Kanehisa M, Goto S. KEGG: kyoto encyclopedia of genes and genomes. Nucleic Acids Res 2000; 28: 27–30, doi: 10.1093/nar/28.1.27.
https://doi.org/10.1093/nar/28.1.27...
). Then, calculation of Pearson's correlation coefficients was performed between the DE genes based on their expression levels. A correlation threshold of 0.8 was used, and highly correlated gene pairs were chosen to construct the co-expression network using Cytoscape software, version 3.4.0 (1212 Shannon P, Markiel A, Ozier O, Baliga NS, Wang JT, Ramage D, et al. Cytoscape: a software environment for integrated models of biomolecular interaction networks. Genome Res 2003; 13: 2498–2504, doi: 10.1101/gr.1239303.
https://doi.org/10.1101/gr.1239303...
). Furthermore, using the R package “pROC” (1919 Robin X, Turck N, Hainard A, Tiberti N, Lisacek F, Sanchez JC, et al. pROC: an open-source package for R and S+ to analyze and compare ROC curves. BMC Bioinformatics 2011; 12: 77, doi: 10.1186/1471-2105-12-77.
https://doi.org/10.1186/1471-2105-12-77...
), we evaluated the classification performance of each DE gene by ROC curve plotting and calculation of area under the ROC curve (AUC) values. Then, we performed recursive feature selection of DE genes using the R package “caret” (2020 Kuhn M. Building predictive models in r using the caret package. J Stat Softw 2008; 28: 1–26, doi: 10.18637/jss.v028.i05.
https://doi.org/10.18637/jss.v028.i05...
) and trained an SVM model for pediatric septic shock prognosis according to the selected features using the R package “kernlab” (2121 Karatzoglou A, Smola A, Hornik K. Kernlab - an S4 package for kernel methods in R. J Stat Softw 2004; 11: 1–20.) (Gaussian RBF kernel; 10-fold cross-validation). External model validation was also performed using the testing dataset.

Results

DE genes in pediatric septic shock non-survivors

A total of 8 studies were obtained through the original search, among which 4 adult studies were excluded. The remaining 4 microarray datasets (3 training datasets and 1 testing dataset all from the Affymetrix microarrays) (55 Cvijanovich N, Shanley TP, Lin R, Allen GL, Thomas NJ, Checchia P, et al. Validating the genomic signature of pediatric septic shock. Physiol Genomics 2008; 34: 127–134, doi: 10.1152/physiolgenomics.00025.2008.
https://doi.org/10.1152/physiolgenomics....
88 Wong HR, Shanley TP, Sakthivel B, Cvijanovich N, Lin R, Allen GL, et al. Genome-level expression profiles in pediatric septic shock indicate a role for altered zinc homeostasis in poor outcome. Physiol Genomics 2007; 30: 146–155, doi: 10.1152/physiolgenomics.00024.2007.
https://doi.org/10.1152/physiolgenomics....
) were used for subsequent analysis. Table 1 presents detailed information about the 4 datasets.

Table 1
Microarray datasets used in this analysis.

For the 3 training datasets, pre-processing resulted in expression data of 20,464 genes in 35 pediatric septic shock non-survivors and 175 survivors. A total of 32 genes were found to be differentially expressed between non-survivors and survivors across the microarray datasets based on the criteria (absolute LFC >0.8, FDR-adjusted P-value <0.05) (Supplementary Table S1). Figure 1 shows the heat map of expression data for the 32 DE genes in the collection of the 3 training datasets. Among the 32 DE genes, 18 genes were up-regulated and 14 genes were downregulated. Table 2 presents the top 10 most significantly upregulated and top 10 most significantly downregulated genes. The top upregulated gene was olfactomedin 4 (OLFM4) (LFC=1.72). OLFM4 is a glycoprotein that functions in innate immunity, inflammation, and cancer (2222 Liu W, Rodgers GP. Olfactomedin 4 expression and functions in innate immunity, inflammation, and cancer. Cancer Metastasis Rev 2016; 35: 201–212, doi: 10.1007/s10555-016-9624-2.
https://doi.org/10.1007/s10555-016-9624-...
). It was suggested to be a neutrophil subset marker in patients suffering from septic shock (2323 Alder MN, Opoka AM, Lahni P, Hildeman DA, Wong HR. Olfactomedin-4 is a candidate marker for a pathogenic neutrophil subset in septic shock. Crit Care Med 2017; 45: e426–e432, doi: 10.1097/CCM.0000000000002102.
https://doi.org/10.1097/CCM.000000000000...
). The most downregulated gene was membrane metalloendopeptidase (MME) (LFC=-1.11). MME is widely present in organs and tissues, contributing to the inactivation of many biological peptides (2424 Zhang D, Gu T, Forsberg E, Efendic S, Brismar K, Gu HF. Genetic and functional effects of membrane metalloendopeptidase on diabetic nephropathy development. Am J Nephrol 2011; 34: 483–490, doi: 10.1159/000333006.
https://doi.org/10.1159/000333006...
,2525 Martens A, Eppink GJ, Woittiez AJ, Eidhof H, de Leij LF. Neutrophil function capacity to express CD10 is decreased in patients with septic shock. Crit Care Med 1999; 27: 549–553, doi: 10.1097/00003246-199903000-00034.
https://doi.org/10.1097/00003246-1999030...
). Decreased MME expression capacity was detected in the neutrophils from patients with septic shock (2525 Martens A, Eppink GJ, Woittiez AJ, Eidhof H, de Leij LF. Neutrophil function capacity to express CD10 is decreased in patients with septic shock. Crit Care Med 1999; 27: 549–553, doi: 10.1097/00003246-199903000-00034.
https://doi.org/10.1097/00003246-1999030...
).

Table 2
The 10 most significantly upregulated and most significantly downregulated genes in pediatric septic shock non-survivors.

Figure 1
Heatmap of expression data for 32 differentially expressed genes in the collection of 3 training datasets.

Enrichment analysis and co-expression network construction

For further functional investigation, we performed advanced analyses (GO and pathway analysis) of the DE genes. Figure 2A and Supplementary Table S2 show the results of GO biological process analysis, which were similar to those reported in the original microarray studies (55 Cvijanovich N, Shanley TP, Lin R, Allen GL, Thomas NJ, Checchia P, et al. Validating the genomic signature of pediatric septic shock. Physiol Genomics 2008; 34: 127–134, doi: 10.1152/physiolgenomics.00025.2008.
https://doi.org/10.1152/physiolgenomics....
88 Wong HR, Shanley TP, Sakthivel B, Cvijanovich N, Lin R, Allen GL, et al. Genome-level expression profiles in pediatric septic shock indicate a role for altered zinc homeostasis in poor outcome. Physiol Genomics 2007; 30: 146–155, doi: 10.1152/physiolgenomics.00024.2007.
https://doi.org/10.1152/physiolgenomics....
). The DE genes were significantly enriched in 6 GO terms, and the top 3 were “acute inflammatory response to antigenic stimulus” (P=0.013), “response to yeast” (P=0.021), and “defense response to bacterium” (P=0.024). In the pathway analysis, when DE genes were mapped to the KEGG database, no significant pathway was identified.

We also constructed a co-expression network using significantly correlated DE gene pairs (correlation coefficients greater than 0.8). The co-expression network contained 16 nodes and 23 edges (Figure 2B). We further extracted sub-networks for the GO enrichment analysis. One GO term (“chemotaxis”) was significantly enriched in the sub-network composed of ENPP2, LOC100134822, CREB5, CXCR2, and FCGR3B. In the sub-network containing ELANE, MPO, and PRTN34, 4 significant GO terms (“response to yeast”, “negative regulation of growth of symbiont in host”, “defense response to bacterium”, and “response to lipopolysaccharide”) were identified.

Figure 2
Enrichment analysis and co-expression network construction results. A, Summary of the gene ontology analysis results for the differentially expressed (DE) genes in pediatric septic shock non-survivors. B, Co-expression network of highly correlated DE gene pairs in pediatric septic shock non-survivors (correlation coefficients >0.8). Edge widths are proportional to the correlation coefficients.

Classification performance evaluation and validation

ROC curves and AUC values suggested that the top performing DE genes were DDIT4, PRG2, ARHGEF40, MME, CD24, CEP55, CD69, UHRF1, and KIF11 (Figure 3), with the best single gene's AUC as great as 0.829. The results of recursive feature selection showed that 11 genes (Supplementary Table S3) were sufficient to achieve close to 91% prediction accuracy. Based on the selected features, we trained an SVM classifier (pediatric septic shock non-survivors vs survivors) and evaluated its performance by 10-fold cross-validation (cross-validation error: 0.133). The SVM classifier performed well in the training dataset (AUC=0.986, Figure 4A). The pre-processed testing dataset was also used for independent validation, and the SVM classifier was constructed according to the training datasets, showing modest performance in the testing dataset (AUC=0.722, Figure 4B).

Figure 3
Receiver operating characteristic curves of top performing differentially expressed genes in prognostic prediction of pediatric septic shock. A, DDIT4; B, PRG2; C, ARHGEF40; D, MME; E, CD24; F, CEP55; G, CD69; H, UHRF1; I, KIF11. AUC: area under the receiver operating characteristic curve.

Discussion

Many genes are reported to be associated with septic shock mortality (2323 Alder MN, Opoka AM, Lahni P, Hildeman DA, Wong HR. Olfactomedin-4 is a candidate marker for a pathogenic neutrophil subset in septic shock. Crit Care Med 2017; 45: e426–e432, doi: 10.1097/CCM.0000000000002102.
https://doi.org/10.1097/CCM.000000000000...
,2626 Sekino M, Funaoka H, Sato S, Okada K, Inoue H, Yano R, et al. Intestinal fatty acid-binding protein level as a predictor of 28-day mortality and bowel ischemia in patients with septic shock: a preliminary study. J Crit Care 2017; 42: 92–100, doi: 10.1016/j.jcrc.2017.07.012.
https://doi.org/10.1016/j.jcrc.2017.07.0...
,2727 Tomino A, Tsuda M, Aoki R, Kajita Y, Hashiba M, Terajima T, et al. Increased PD-1 expression and altered T cell repertoire diversity predict mortality in patients with septic shock: a preliminary study. PLoS One 2017; 12: e0169653, doi: 10.1371/journal.pone.0169653.
https://doi.org/10.1371/journal.pone.016...
). It is a challenge currently to identify the most important candidate genes and pathways in septic shock prognosis. Growth of high-throughput transcriptomic data has enabled the integrative analysis of multiple datasets to discover robust candidates for prognosis and treatment. Therefore, to identify potential transcriptomic markers in pediatric septic shock prognosis, we performed an integrative analysis of multiple public microarray datasets.

In this study, 32 DE genes were identified in pediatric septic shock non-survivors, among which 18 genes were upregulated and 14 were downregulated. The most upregulated gene was OLFM4. The most downregulated gene was MME. Among the 32 DE genes identified, OLFM4 was reported to be present in only 20-25% of peripheral blood neutrophils and identified a subset of human neutrophils (2828 Clemmensen SN, Bohr CT, Rørvig S, Glenthøj A, Mora-Jensen H, Cramer EP, et al. Olfactomedin 4 defines a subset of human neutrophils. J Leukoc Biol 2012; 91: 495–500, doi: 10.1189/jlb.0811417.
https://doi.org/10.1189/jlb.0811417...
). MME is widely present in organs and tissues, contributing to the inactivation of many biological peptides (2424 Zhang D, Gu T, Forsberg E, Efendic S, Brismar K, Gu HF. Genetic and functional effects of membrane metalloendopeptidase on diabetic nephropathy development. Am J Nephrol 2011; 34: 483–490, doi: 10.1159/000333006.
https://doi.org/10.1159/000333006...
,2525 Martens A, Eppink GJ, Woittiez AJ, Eidhof H, de Leij LF. Neutrophil function capacity to express CD10 is decreased in patients with septic shock. Crit Care Med 1999; 27: 549–553, doi: 10.1097/00003246-199903000-00034.
https://doi.org/10.1097/00003246-1999030...
). Decreased MME expression capacity was detected in the neutrophils from patients with septic shock (2525 Martens A, Eppink GJ, Woittiez AJ, Eidhof H, de Leij LF. Neutrophil function capacity to express CD10 is decreased in patients with septic shock. Crit Care Med 1999; 27: 549–553, doi: 10.1097/00003246-199903000-00034.
https://doi.org/10.1097/00003246-1999030...
). CXCR2 is a key stimulant to immune cell migration as well as recruitment, especially for neutrophils. The CXCR2 signaling pathway is suggested to be a potential target for modification of neutrophil dynamics in inflammatory disorders (2929 Zhang X, Guo R, Kambara H, Ma F, Luo HR. The role of CXCR2 in acute inflammatory responses and its antagonists as anti-inflammatory therapeutics. Curr Opin Hematol 2019; 26: 28–33, doi: 10.1097/MOH.0000000000000476.
https://doi.org/10.1097/MOH.000000000000...
). CEACAM8 belongs to CEA family, which is known to function as intercellular adhesion molecules. CEACAM8 is suggested to be involved in the regulation of the neutrophil adhesion (3030 Zhao L, Furebring M, Xu S, Venge P. Subcellular localization and mobilization of carcinoembryonic antigen-related cell adhesion molecule 8 in human neutrophils. Br J Haematol 2004; 125: 666–673, doi: 10.1111/j.1365-2141.2004.04963.x.
https://doi.org/10.1111/j.1365-2141.2004...
). ELANE encodes neutrophil elastase, which is a cytotoxic serine protease stored in azurophil granules and released after neutrophil activation. ELANE also functions in neutrophil extracellular traps (3131 Skokowa J, Dale DC, Touw IP, Zeidler C, Welte K. Severe congenital neutropenias. Nat Rev Dis Primers 2017; 3: 17032, doi: 10.1038/nrdp.2017.32.
https://doi.org/10.1038/nrdp.2017.32...
). Neutrophil dysfunction is known to promote sepsis, and the alterations of chemokines, cytokines, as well as other mediators lead to neutrophil dysfunction in sepsis. Thus, the DE genes identified in this study, such as OLFM4, MME, CXCR2, CEACAM8, and ELANE, probably take part in pediatric sepsis development by regulation of neutrophil function (3232 Zhang F, Liu AL, Gao S, Ma S, Guo SB. Neutrophil dysfunction in sepsis. Chin Med J (Engl) 2016; 129: 2741–2744.).

Furthermore, a series of DE genes identified in this study have not yet been investigated in pediatric septic shock prognosis, and the exact contributions of these genes to pediatric septic shock are not yet clear. Since these DE genes could serve as potential transcriptomic markers for pediatric septic shock, further research is required.

In the enrichment analysis of the 32 DE genes, “response to yeast” (P=0.021) was the second most enriched GO term. This is interesting because yeast infection has been reported to cause sepsis (3333 Toh J, Snowball V. Postpartum sepsis caused by an intra-abdominal yeast infection. J Obstet Gynaecol 2020; 40: 432–433, doi: 10.1080/01443615.2019.1645821.
https://doi.org/10.1080/01443615.2019.16...
).

According to the literature, machine learning methods, including SVM, usually have high prediction accuracy (3434 Steyerberg EW. Clinical prediction models: a practical approach to development, validation, and updating. NY: Springer; 2009.). Hence, in this study, we trained an SVM classifier (pediatric septic shock in non-survivors vs survivors) using gene expression data to improve the performance of prognostic prediction. The SVM classifier performed well in the training dataset with an AUC value of 0.986 (Figure 4A), compared with Pediatric Sepsis Biomarker Risk Model-II (3535 Wong HR, Cvijanovich NZ, Anas N, Allen GL, Thomas NJ, Bigham MT, et al. Pediatric sepsis biomarker risk model-II: redefining the pediatric sepsis biomarker risk model with septic shock phenotype. Crit Care Med 2016; 44: 2010–2017, doi: 10.1097/CCM.0000000000001852.
https://doi.org/10.1097/CCM.000000000000...
). However, it showed modest performance in the testing dataset (AUC=0.722, Figure 4B), suggesting that there could be an over-fitting problem. To the best of our knowledge, no gene expression-based SVM model has yet been reported for pediatric septic shock prognostic prediction. We hope the predictive model trained in this study facilitates better prognosis and treatment of pediatric septic shock.

Figure 4
Receiver operating characteristic curves of the support vector machine classifier in prognostic prediction of pediatric septic shock. A, Performance with the training dataset. B, Performance with the testing dataset. AUC: area under the receiver operating characteristic curve.

Because of the existence of several limitations, the results of our study should be considered cautiously. First, the sample size used in our study was insufficient. Second, stratified analyses according to potential influential factors (age, sex, and platform usage) could not be performed in this study due to the lack of corresponding data. Third, both the biological knowledge base and the pathway data are currently incomplete. In consideration of the reported impact of age and sex on pediatric sepsis patients (3636 Maat M, Buysse CM, Emonts M, Spanjaard L, Joosten KF, de Groot R, et al. Improved survival of children with sepsis and purpura: effects of age, gender, and era. Crit Care 2007; 11: R112, doi: 10.1186/cc6161.
https://doi.org/10.1186/cc6161...
), stratified analyses, based on factors including age, sex, and platform usage, are required in the future. Further analysis based on larger sample sizes is also necessary. Moreover, to explore the exact roles of candidate DE genes in pediatric septic shock, experimental verification and functional studies of these DE genes must also be performed in the future.

In conclusion, consistently DE genes in pediatric septic shock non-survivors were identified in this study, and they could be potential transcriptomic markers. GO analysis suggested that these candidates were strongly correlated with acute inflammatory response to antigenic stimuli, response to yeast, and defense responses to bacteria. We also trained an SVM classifier (non-survivors vs survivors) using candidate transcriptomic markers. These results provided novel insights into the prognosis of pediatric septic shock and promoted the generation of prognostic gene sets.

Acknowledgments

This work was supported by grants from National Natural Science Foundation (grant numbers 81971867, 81501840); Jiangsu Provincial Medical Youth Talent (grant number QNRC2016768); Natural Science Foundation of Jiangsu Province (grant number BK20180206); Suzhou Science and Technology project (grant numbers SYS2019082, SYS2018072); National Natural Science Foundation (grant number 31670853); and Major International (Regional) Joint Research Project (grant number 81420108022).

References

  • 1
    Kissoon N, Carapetis J. Pediatric sepsis in the developing world. J Infect 2015; 711: S21–S26, doi: 10.1016/j.jinf.2015.04.016.
    » https://doi.org/10.1016/j.jinf.2015.04.016
  • 2
    van der Poll T, van de Veerdonk FL, Scicluna BP, Netea MG. The immunopathology of sepsis and potential therapeutic targets. Nat Rev Immunol 2017; 17: 407–420, doi: 10.1038/nri.2017.36.
    » https://doi.org/10.1038/nri.2017.36
  • 3
    Cecconi M, Evans L, Levy M, Rhodes A. Sepsis and septic shock. Lancet 2018; 392: 75–87, doi: 10.1016/S0140-6736(18)30696-2.
    » https://doi.org/10.1016/S0140-6736(18)30696-2
  • 4
    Emr BM, Alcamo AM, Carcillo JA, Aneja RK, Mollen KP. Pediatric sepsis update: how are children different? Surg Infect (Larchmt) 2018; 19: 176–183, doi: 10.1089/sur.2017.316.
    » https://doi.org/10.1089/sur.2017.316
  • 5
    Cvijanovich N, Shanley TP, Lin R, Allen GL, Thomas NJ, Checchia P, et al. Validating the genomic signature of pediatric septic shock. Physiol Genomics 2008; 34: 127–134, doi: 10.1152/physiolgenomics.00025.2008.
    » https://doi.org/10.1152/physiolgenomics.00025.2008
  • 6
    Wynn JL, Cvijanovich NZ, Allen GL, Thomas NJ, Freishtat RJ, Anas N, et al. The influence of developmental age on the early transcriptomic response of children with septic shock. Mol Med 2011; 17: 1146–1156, doi: 10.2119/molmed.2011.00169.
    » https://doi.org/10.2119/molmed.2011.00169
  • 7
    Wong HR, Cvijanovich N, Lin R, Allen GL, Thomas NJ, Willson DF, et al. Identification of pediatric septic shock subclasses based on genome-wide expression profiling. BMC Med 2009; 7: 34, doi: 10.1186/1741-7015-7-34.
    » https://doi.org/10.1186/1741-7015-7-34
  • 8
    Wong HR, Shanley TP, Sakthivel B, Cvijanovich N, Lin R, Allen GL, et al. Genome-level expression profiles in pediatric septic shock indicate a role for altered zinc homeostasis in poor outcome. Physiol Genomics 2007; 30: 146–155, doi: 10.1152/physiolgenomics.00024.2007.
    » https://doi.org/10.1152/physiolgenomics.00024.2007
  • 9
    Gentleman RC, Carey VJ, Bates DM, Bolstad B, Dettling M, Dudoit S, et al. Bioconductor: open software development for computational biology and bioinformatics. Genome Biol 2004; 5: R80, doi: 10.1186/gb-2004-5-10-r80.
    » https://doi.org/10.1186/gb-2004-5-10-r80
  • 10
    Huang DW, Sherman BT, Lempicki RA. Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources. Nat Protoc 2009; 4: 44–57, doi: 10.1038/nprot.2008.211.
    » https://doi.org/10.1038/nprot.2008.211
  • 11
    Huang DW, Sherman BT, Lempicki RA. Bioinformatics enrichment tools: paths toward the comprehensive functional analysis of large gene lists. Nucleic Acids Res 2009; 37: 1–13, doi: 10.1093/nar/gkn923.
    » https://doi.org/10.1093/nar/gkn923
  • 12
    Shannon P, Markiel A, Ozier O, Baliga NS, Wang JT, Ramage D, et al. Cytoscape: a software environment for integrated models of biomolecular interaction networks. Genome Res 2003; 13: 2498–2504, doi: 10.1101/gr.1239303.
    » https://doi.org/10.1101/gr.1239303
  • 13
    Barrett T, Wilhite SE, Ledoux P, Evangelista C, Kim IF, Tomashevsky M, et al. NCBI GEO: archive for functional genomics data sets--update. Nucleic Acids Res 2013; 41: D991–D995, doi: 10.1093/nar/gks1193.
    » https://doi.org/10.1093/nar/gks1193
  • 14
    Gautier L, Cope L, Bolstad BM, Irizarry RA. affy--analysis of Affymetrix GeneChip data at the probe level. Bioinformatics 2004; 20: 307–315, doi: 10.1093/bioinformatics/btg405.
    » https://doi.org/10.1093/bioinformatics/btg405
  • 15
    Smyth GK. limma: Linear Models for Microarray Data. In: Gentleman R, Carey V, Huber W, Irizarry R, Dudoit S, Bioinformatics and Computational Biology Solutions Using R and Bioconductor. New York: Springer; 2005. p 397–420.
  • 16
    Ritchie ME, Phipson B, Wu D, Hu Y, Law CW, Shi W, et al. limma powers differential expression analyses for RNA-sequencing and microarray studies. Nucleic Acids Res 2015; 43: e47, doi: 10.1093/nar/gkv007.
    » https://doi.org/10.1093/nar/gkv007
  • 17
    Falcon S, Gentleman R. Using GOstats to test gene lists for GO term association. Bioinformatics 2007; 23: 257–258, doi: 10.1093/bioinformatics/btl567.
    » https://doi.org/10.1093/bioinformatics/btl567
  • 18
    Kanehisa M, Goto S. KEGG: kyoto encyclopedia of genes and genomes. Nucleic Acids Res 2000; 28: 27–30, doi: 10.1093/nar/28.1.27.
    » https://doi.org/10.1093/nar/28.1.27
  • 19
    Robin X, Turck N, Hainard A, Tiberti N, Lisacek F, Sanchez JC, et al. pROC: an open-source package for R and S+ to analyze and compare ROC curves. BMC Bioinformatics 2011; 12: 77, doi: 10.1186/1471-2105-12-77.
    » https://doi.org/10.1186/1471-2105-12-77
  • 20
    Kuhn M. Building predictive models in r using the caret package. J Stat Softw 2008; 28: 1–26, doi: 10.18637/jss.v028.i05.
    » https://doi.org/10.18637/jss.v028.i05
  • 21
    Karatzoglou A, Smola A, Hornik K. Kernlab - an S4 package for kernel methods in R. J Stat Softw 2004; 11: 1–20.
  • 22
    Liu W, Rodgers GP. Olfactomedin 4 expression and functions in innate immunity, inflammation, and cancer. Cancer Metastasis Rev 2016; 35: 201–212, doi: 10.1007/s10555-016-9624-2.
    » https://doi.org/10.1007/s10555-016-9624-2
  • 23
    Alder MN, Opoka AM, Lahni P, Hildeman DA, Wong HR. Olfactomedin-4 is a candidate marker for a pathogenic neutrophil subset in septic shock. Crit Care Med 2017; 45: e426–e432, doi: 10.1097/CCM.0000000000002102.
    » https://doi.org/10.1097/CCM.0000000000002102
  • 24
    Zhang D, Gu T, Forsberg E, Efendic S, Brismar K, Gu HF. Genetic and functional effects of membrane metalloendopeptidase on diabetic nephropathy development. Am J Nephrol 2011; 34: 483–490, doi: 10.1159/000333006.
    » https://doi.org/10.1159/000333006
  • 25
    Martens A, Eppink GJ, Woittiez AJ, Eidhof H, de Leij LF. Neutrophil function capacity to express CD10 is decreased in patients with septic shock. Crit Care Med 1999; 27: 549–553, doi: 10.1097/00003246-199903000-00034.
    » https://doi.org/10.1097/00003246-199903000-00034
  • 26
    Sekino M, Funaoka H, Sato S, Okada K, Inoue H, Yano R, et al. Intestinal fatty acid-binding protein level as a predictor of 28-day mortality and bowel ischemia in patients with septic shock: a preliminary study. J Crit Care 2017; 42: 92–100, doi: 10.1016/j.jcrc.2017.07.012.
    » https://doi.org/10.1016/j.jcrc.2017.07.012
  • 27
    Tomino A, Tsuda M, Aoki R, Kajita Y, Hashiba M, Terajima T, et al. Increased PD-1 expression and altered T cell repertoire diversity predict mortality in patients with septic shock: a preliminary study. PLoS One 2017; 12: e0169653, doi: 10.1371/journal.pone.0169653.
    » https://doi.org/10.1371/journal.pone.0169653
  • 28
    Clemmensen SN, Bohr CT, Rørvig S, Glenthøj A, Mora-Jensen H, Cramer EP, et al. Olfactomedin 4 defines a subset of human neutrophils. J Leukoc Biol 2012; 91: 495–500, doi: 10.1189/jlb.0811417.
    » https://doi.org/10.1189/jlb.0811417
  • 29
    Zhang X, Guo R, Kambara H, Ma F, Luo HR. The role of CXCR2 in acute inflammatory responses and its antagonists as anti-inflammatory therapeutics. Curr Opin Hematol 2019; 26: 28–33, doi: 10.1097/MOH.0000000000000476.
    » https://doi.org/10.1097/MOH.0000000000000476
  • 30
    Zhao L, Furebring M, Xu S, Venge P. Subcellular localization and mobilization of carcinoembryonic antigen-related cell adhesion molecule 8 in human neutrophils. Br J Haematol 2004; 125: 666–673, doi: 10.1111/j.1365-2141.2004.04963.x.
    » https://doi.org/10.1111/j.1365-2141.2004.04963.x
  • 31
    Skokowa J, Dale DC, Touw IP, Zeidler C, Welte K. Severe congenital neutropenias. Nat Rev Dis Primers 2017; 3: 17032, doi: 10.1038/nrdp.2017.32.
    » https://doi.org/10.1038/nrdp.2017.32
  • 32
    Zhang F, Liu AL, Gao S, Ma S, Guo SB. Neutrophil dysfunction in sepsis. Chin Med J (Engl) 2016; 129: 2741–2744.
  • 33
    Toh J, Snowball V. Postpartum sepsis caused by an intra-abdominal yeast infection. J Obstet Gynaecol 2020; 40: 432–433, doi: 10.1080/01443615.2019.1645821.
    » https://doi.org/10.1080/01443615.2019.1645821
  • 34
    Steyerberg EW. Clinical prediction models: a practical approach to development, validation, and updating. NY: Springer; 2009.
  • 35
    Wong HR, Cvijanovich NZ, Anas N, Allen GL, Thomas NJ, Bigham MT, et al. Pediatric sepsis biomarker risk model-II: redefining the pediatric sepsis biomarker risk model with septic shock phenotype. Crit Care Med 2016; 44: 2010–2017, doi: 10.1097/CCM.0000000000001852.
    » https://doi.org/10.1097/CCM.0000000000001852
  • 36
    Maat M, Buysse CM, Emonts M, Spanjaard L, Joosten KF, de Groot R, et al. Improved survival of children with sepsis and purpura: effects of age, gender, and era. Crit Care 2007; 11: R112, doi: 10.1186/cc6161.
    » https://doi.org/10.1186/cc6161

Supplementary Material

Click here to view [pdf].

Publication Dates

  • Publication in this collection
    25 Jan 2021
  • Date of issue
    2021

History

  • Received
    12 Apr 2020
  • Accepted
    25 Sept 2020
Associação Brasileira de Divulgação Científica Av. Bandeirantes, 3900, 14049-900 Ribeirão Preto SP Brazil, Tel. / Fax: +55 16 3315-9120 - Ribeirão Preto - SP - Brazil
E-mail: bjournal@terra.com.br