Open-access Discrimination between COVID-19 Positive and Negative Blood Sera Using an Unmodified Disposable Impedimetric Sensor and Multivariate Analysis

Abstract

The present study introduces a direct approach for classifying blood serum samples as either positive or negative for coronavirus disease (COVID-19) by associating the electrochemical impedance data of the sample with multivariate analysis. The hypothesis is that the systematic alterations in blood composition resulting from a severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) infection give rise to a distinct impedance spectrum when infected serum is subjected to analysis. A total of 201 serum samples were analyzed using the gold standard method, reverse transcription-polymerase chain reaction (RT-PCR), which served to train and validate the classification models. Two variations of discriminant analysis (partial least squares discriminant analysis (PLS-DA) and principal component analysis-discriminant analysis (PCA-DA)) and a one-class modeling approach (soft independent modeling of class analogies (SIMCA)) were used to classify impedance data in different formats (as complex or real numbers). PCA-DA applied to imaginary impedance spectra was found to be the most effective strategy, achieving sensitivity, specificity, and precision of 94, 94, and 91%, respectively, with classification error rates as low as 6%. These findings are encouraging and could facilitate the development of an inexpensive and reliable screening method for COVID-19.

Keywords:
pattern recognition; SARS-CoV-2; electrochemical impedance; discriminant analysis; one-class modeling


Introduction

The coronavirus disease (COVID-19) pandemic, originating from the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) in Wuhan, China in 2019, swiftly engulfed the globe, leaving a profound mark on humanity. Declared a pandemic by the World Health Organization (WHO) in March 2020, it has resulted in over 770 million confirmed cases and more than 7 million deaths worldwide.1 Despite hallmark symptoms such as coughing, fever, and difficulty breathing, many infected individuals experience mild or no symptoms. This asymptomatic (or minimally symptomatic) group can still transmit the virus, which facilitated silent transmission. Progress in vaccination campaigns and rising population immunity prompted the WHO to declare an end to the pandemic in May 2023.2 Despite this, COVID-19 remains a persistent threat. Conventional techniques for detecting SARS-CoV-2 include antigen, antibody, and molecular tests were essential to curb its spread and minimize infection rates. Antigen tests detect virus-specific proteins, while antibody tests assess anti-SARS-CoV-2 levels in the blood, indicating antibody development. Molecular tests analyze different regions of the ribonucleic acid (RNA) genome of the virus, with reverse transcription-polymerase chain reaction (RT-PCR) considered the gold standard and recommended by WHO since the beginning of the pandemic.3 Despite its high sensitivity and specificity, RT-PCR requires specialized laboratories and trained personnel, with results taking several hours to obtain.

Many studies on the diagnosis of COVID-19 using electrochemical biosensors have been published using potentiometry,4 amperometry,5 voltammetry6 and electrochemical impedance spectroscopy (EIS).7 EIS is particularly advantageous for biosensing due to its high sensitivity to changes in the electrical properties of the surface of the sensor as it interacts with target biomolecules.8 These interactions induce sensitive changes in impedance, mainly the increase of the charge-transfer resistance of a redox probe, that is directly correlated to the presence and concentration of the target.9 With its ability to operate with no need for labeling or extensive sample preparation, EIS makes it an appealing choice for rapid, on-site diagnostic applications.

EIS is a sensitive and robust technique but combining it with multivariate analysis and machine learning techniques - referred to as chemometrics when applied to chemical data - can be very advantageous, especially when it comes to improving the efficiency of impedimetric sensors.10,11 However, scarce examples are found in the context of COVID-19 testing. One such example is the use of a molecularly imprinted polymer for the selective recognition of the SARS-CoV-2 virus in saliva samples aided by principal component analysis (PCA).12 The developed sensor achieved 75% agreement with the reference method, loop-mediated isothermal amplification (LAMP). It is worth noting that although the test set is limited to 15 positive samples and 9 negative samples, and PCA is an exploratory technique rather than classification one, the findings are interesting. In another study, a label-free impedimetric biosensor was developed based on modifying a glassy carbon electrode with Au nanoparticles that were modified with a low-cost, easy-to-synthesize peptide consisting of five amino acids. The peptide was designed to replicate the N-terminal subunit in the S protein, allowing it to recognize the S protein antibody. Impedance data were used to differentiate between 15 prepandemic individuals and 19 convalescent patients using partial least-squares discriminant analysis (PLS-DA) and sure independence screening and sparsifying operator (SISSO), obtaining 100% accuracy in both cases.13

Combining EIS with multivariate analysis or machine learning techniques can also allow for some degree of simplification of the sensor assembly by avoiding complex modifications that use expensive reagents, or even by using unmodified electrodes. This simplification may result in a less selective sensor, which is balanced by the high efficiency of the treatment of the produced multivariate analytical signal (impedance spectrum) with multivariate analysis. Each impedance value in the spectrum is a low-selective and cross-sensitive pseudo-sensor, whose response is related to ionic, redox, or molecular interactions at the sensor/solution interface.14 The global selectivity resulting from the joint use of such sensors, combined with the chemometric treatment of the signals, produce an “electrochemical fingerprint”15 that allows both the qualitative recognition of the target biomolecule and its quantification.

There are several successful examples in quantitative and qualitative modeling including the determination of pharmaceuticals16 and environmental contaminants,17 as well as electronic tongues.18 Among the examples of the association between chemometrics and EIS data produced by unmodified electrodes in clinical analysis are the discrimination between normal and cancerous human urothelial cells using random forest19 and the discrimination between different types of blood thrombi using support vector machines,20 both with an accuracy of over 90%. This study aimed to evaluate the effectiveness of using an unmodified disposable impedimetric sensor to distinguish between blood serum samples infected with COVID-19 and non-infected samples. The samples were analyzed using the gold standard method of reverse transcription-polymerase chain reaction (RT-PCR). The impedance data obtained were used to develop classification models based on discriminant analysis and one-class modeling.

Experimental

All reagents used in this study, including potassium chloride (KCl), disodium hydrogen phosphate (Na2HPO4), monopotassium phosphate (KH2PO4), sodium chloride (NaCl), sodium hydroxide (NaOH), potassium ferrocyanide (K4Fe(CN)6), and potassium ferricyanide (K3Fe(CN)6), were of analytical grade and were purchased from LabSynth (São Paulo, Brazil). The study employed a 0.1 mol L-1 pH 7.4 phosphate buffered saline (PBS) solution, which was prepared with ultrapure water (resistivity > 18 Mil cm-1, Milli-Q, Millipore, USA).

The blood serum samples were obtained from hospitals in the metropolitan area of João Pessoa and Campina Grande, located in the state of Paraíba, Brazil. The study was approved by the Human Research Ethics Committee of the Lauro Wanderley University Hospital (CAAE:31562720.9.0000.5183). In order to guarantee that the multivariate models obtained are sufficiently robust and have an acceptable chance of correctly discriminating future samples, it is essential that the training and test sets are constructed in such a way as to include samples that represent the greatest possible variability of positive and negative case situations. Consequently, the positive samples represent situations of infection and reinfection. The set of negative samples contained occurrences of patients with symptoms similar to those of the disease, such as a cold or influenza, as well as patients at different stages of immunization (post-infection and post-vaccination), including healthcare workers. However, it is important to note that these differences were not considered in the classification of the samples, which was based solely on the results of the RT-PCR test. To prepare for analysis, all serum samples were diluted in PBS pH 7.4 at a 1:10 ratio.

Impedance measurements were conducted using a potentiostat Autolab PGSTAT 101 (Metrohm, Switzerland) connected to a screen-printed electrode boxed connector (Ref. DSC, Metrohm Dropsens, Spain) and a microcomputer running NOVA 2.1.6 software (Metrohm, Switzerland). Commercially available screen-printed electrodes DRP-220BT (Metrohm Dropsens, Spain) were used, featuring a silver reference electrode, gold working electrode (diameter 4 mm) and gold counter electrode.

For the measurements, a 100 μL aliquot of the background solution (comprising an equimolar mixture of potassium ferricyanide and potassium ferrocyanide at 0.1 mmol L-1 in PBS pH 7.4) was applied to the electrode. The open circuit potential was recorded for 60 s prior the scan and used as the base potential for the impedance spectrum collection, which was obtained from 100 KHz to 1 Hz, at an amplitude of 10 mV, with ten frequencies per decade, all conducted at ambient temperature (20 ± 1 °C). Subsequently, 10 μL of the prepared blood serum sample were added to the background solution on the electrode and incubated for 30 min to allow for the serum constituents to interact with the electrode surface. A new scan was then carried out. This scan was used in the modeling stage. Samples were analyzed in triplicate. The background solution scan was used to check for electrode reproducibility.

A total of 201 blood serum samples were analyzed using the gold standard method RT-PCR, with 121 testing positive and 80 testing negative for SARS-CoV-2. The spectral data obtained was organized into matrices of size 201 × 51 (samples × frequencies). Five different matrices were obtained considering the following representations of the impedance data as real numbers - real part (Z’), imaginary part (Z"), phase angle (cp) and absolute impedance (IZI) - and complex numbers (Z’ - jZ’’). Each dataset was individually assessed in the model development. First, the samples were divided into training and test sets in a 60:40 ratio by using the Kennard-Stone uniform sampling algorithm.21 Thus, the training set included 74 seropositive samples and 47 seronegative samples, while the test set comprised 47 seropositive samples and 33 seronegative samples.

The impedance data was used to develop classification models based on discriminant analysis and one-class modeling. One-class modeling is focused on learning the characteristics of a single class to distinguish typical instances from outliers. This approach was selected because the negative samples (non-infected blood serum) have a high degree of variability and are not well-defined, unlike the positive samples. We employed the Soft Independent Modeling of Class Analogies (SIMCA), a widely used one-class approach based on Principal Component Analysis (PCA). SIMCA builds a PCA model that captures most of the class variance and classifies new samples by measuring their distance to the class model within the reduced-dimensional space. A sample is classified as belonging to the class if its distance to the class centroid is below a predefined threshold; otherwise, it is classified as not belonging to the class.

By contrast, discriminant analysis (DA) is a multiclass technique that distinguishes between two or more predefined classes using linear or nonlinear combinations of features. It can be prone to overfitting in datasets with many variables or multicollinearity, such as EIS data. One approach to mitigate the impact of multicollinearity (and at the same time reducing the number of variables, if necessary) is to transform the original variables into a smaller set of uncorrelated variables through PCA. The obtained scores are used as classification variables in DA, and this technique is referred to as PCA-DA. Similarly, Partial Least Squares (PLS) regression can generate latent variables (LV) that best explain predictor variance and their covariance with the response (class index). The process, referred to as PLS-DA, finds the directions in the predictor variable space that maximize class separation in the response variable. Both PCA-DA and PLS-DA were evaluated in this study.

The chemometric models were developed using the Classification Toolbox 6.0 in the Matlab R2023a environment.22 Leave-one-out cross-validation was used to select the number of PCs (or latent variables, LVs) and validate the model. The models were obtained using between 1 and 20 PCs (or LVs). The number of PCs (or LVs) was chosen based on the lowest classification error rate. Column preprocessing involved data centering and autoscaling, while row preprocessing included the use of spectrum logarithmization in addition to non-preprocessed data. The discriminant function type, linear and quadratic, was also evaluated in the application of PCA-DA. The models developed were then applied to the test set, and their efficiency was assessed based on the figures of merit sensitivity (Sens), specificity (Spec), and error rate (ER), calculated as follows:23

(1) Sens = TP TP + FN

(2) Spec = TN TN + FP

(3) Prec = TP TP + FP

(4) ER = FP + FN TP + TN + FP + FN

The terms TP, FP, TN, and FN represent the number of true positives, false positives, true negatives, and false negatives, respectively. Here, true positives (TP) are positive samples correctly identified as positive, false positives (FP) are negative samples incorrectly identified as positive, true negatives (TN) are negative samples correctly identified as negative, and false negatives (FN) are positive samples incorrectly identified as negative. Sensitivity and specificity are metrics that measure the ability of the method to correctly identify positive and negative samples, respectively. Precision, also known as the positive predictive value (PPV), is the proportion of true positive identifications among all samples assigned to the positive class. This metric is also applicable to the negative class, where it is known as the negative predictive value (NPV). The error rate is defined as the ratio of the total number of misclassified samples (errors) to the total number of samples.

Results and Discussion

The hypothesis supporting this study is that COVID-19 infection can be identified indirectly by EIS, from the significant changes in the blood serum chemical composition due to COVID-19.24,25,26 These changes have been evaluated as a basis for the diagnosis of COVID-19 by applying machine learning on blood test results.27,28,29 The serum comprises biomolecules containing thiol, amine, and carboxylic moieties, as well as N-containing heterocycles and ions. These species interact with gold on the surface of the working electrode in various ways. The S-containing groups are the main functional groups that interact with gold, mainly thiols and disulfides, that form some of the strongest known chemisorptive bonds to gold, resulting in stable Au-S bonds.30,31 Thiol and disulfide groups are found in small molecules such as glutathione and thiamine, as well as in cysteine-containing peptides and proteins. Other biomolecules can also be directly adsorbed (physiosorption) onto a clean gold electrode via non-covalent interactions.30,32

It is anticipated that the imbalance resulting from COVID-19 infection will increase the interaction between the biological matrix and the working electrode, impeding the redox process of an electrochemical probe such as the redox pair Fe(CN)63-/4-, altering the impedance spectrum (Figure 1). A classification algorithm can capture the systematic changes in impedance spectra that can be utilized to discriminate COVID-19 positive from negative blood samples.

Figure 1
Hypothesis on the working principle of the sensor. The imbalance in blood composition caused by the COVID-19 infection causes a systematic increase in the impedance of the redox probe reaction, which can be modelled by a multivariate classification algorithm. In the graph on the right, the colors refer to the results provided by the reference method (RT-PCR): blue for positive and red for negative. The symbols represent the predictions obtained in the training (circles) and test (squares) stages.

Figure 2a shows the spectra of independent triplicates for three different seropositive samples (as tested by RT-PCR). The spectra with the same color/symbol in Figure 2a represent the measurements taken before (supporting electrolyte only) and after the serum sample was added to the electrode and incubated for 30 min. The impedance spectra exhibit the benchmark profile of the interaction between the Fe(CN)63-/4- redox couple and a bare electrode surface.33,34 It features a small semicircle at high to mid-frequency region, representing a low charge transfer resistance (Rct) and a fast electron exchange, followed by a linear region at lower frequencies, which is the manifestation of the diffusion-controlled process of the redox species to and from the electrode surface. It should be noted that the Rct (diameter of the semicircle) significantly increases after adding the serum and incubating for 30 min, suggesting that serum components obstruct electron transfer at the electrode surface. It is noteworthy that the spectra obtained in the absence of serum are highly similar, indicating excellent reproducibility of the screen-printed sensors. The inset plot shows an expanded view of the Nyquist plot to highlight the differences among the samples’ spectra.

Figure 2
(a) Each set of spectra with the same color/symbol represents the analysis of a different seropositive sample performed before (supporting electrolyte, smaller semicircle) and after sample incubation on the electrode (larger semicircle). (b) Impedance spectra of the 201 blood samples colored according to the diagnosis obtained by RT-PCR (blue: negative; orange: positive).

Figure 2b presents the Nyquist plot with the impedance spectra of the 201 blood samples colored according to the diagnostic obtained by RT-PCR (blue: negative; orange: positive). The spectra demonstrate the same Fe(CN)63-/4-, redox couple profile, with the positive samples showing a systematically different behavior from the obtained with the negative samples. A systematic increase in Rct and maximum Z’’ is observed for the seropositive samples (as observed in the inset plot of Figure 2b), indicating an increase in the impedance of the interaction of the redox couple with the electrode caused by the blocking of the surface by the constituents of the serum infected with the SARS-CoV-2 virus. This indirectly indicates that the composition of the COVID-19 positive serum is indeed different.

The work of Pushalkar et al.26 provides an excellent summary of the findings to date relating to the main changes in serum composition due to COVID-19. Proteomics reveal dysregulated pathways in lipid homeostasis, immunoglobulins, cytokines, chemokines, and coagulation. Proinflammatory cytokines (IL-6, IL-1β, TNF-α) trigger a ‘cytokine storm’. Regulatory proteins in coagulation (APOH, FN1, HRG, KNG1, PLG) and lipid homeostasis (APOA1, APOC1, APOC2, APOC3, PON1) increase as the disease progresses. Even asymptomatic COVID-19 patients show altered coagulation and inflammation markers like fibrinogen, von Willebrand factor (VWF), and thrombospondin-1 (TSP1). Regarding small biomolecules, Tristán et al.25 identified specific metabolites such as trimethylamine-N-oxide (TMAO), phenylalanine, N-acetylglycoproteins (NAG), tyrosine, lysine, acetone, mannose, citrate, glycerol, and fatty acids, particularly unsaturated fatty acids (UFA), as the key molecules driving the separation between COVID-19 patients and healthy controls in the serum metabolome.

It is important to note that differential pulse voltammetry (DPV) and square wave voltammetry (SWV) were considered in addition to EIS. The measurement principle remains consistent, comprising the following three steps: (i) obtaining a voltammogram of the supporting electrolyte containing the redox pair; (ii) adding the blood serum and incubating for 30 min; (iii) obtaining a new voltammogram. Figure S1 illustrates the average voltammograms obtained before (step i) and after the addition of three seropositive and three seronegative samples. The voltammogram obtained in the initial step displays a single peak at approximately 0.2 V, which is characteristic of the oxidation of Fe2+ to Fe3+. The peaks obtained in stage iii (following the addition of the sample) occur at a slightly higher potential and with a lower current intensity than that observed for the first peak.

This increase in potential and decrease in current intensity indicates the blocking of the electrode surface by the serum components, as observed for the impedance spectrum. However, as evidenced by Figure S1, the difference between the peak currents produced by the negative and positive samples is minimal. A similar observation was noted in the context of SWV (results not shown). These findings suggest that the voltammetric techniques examined in this study exhibit significantly lower sensitivity compared to EIS and are not indicated for distinguishing between positive and negative samples within the context of this method.

Figure 3 shows the Bode plots obtained from the EIS measurements of the serum samples. Figures 3a, 3b and 3c show the Bode plots of the real component (Z’), the imaginary component (Z") and the phase angle (φ), respectively. The Bode plot for the absolute impedance (IZI) is very similar to the one obtained for the real impedance and has therefore not been shown. The inset plots show the average spectra of each class according to the impedance component used. The spectral profiles in all impedance representations are similar throughout the whole set of measurements, although differences are observed in the low frequency region (<100 Hz) for the spectra of the seropositive samples. At this range, they exhibit higher real impedance values than seronegative samples (Figure 3a), as well as significantly higher imaginary impedance values and lower resonant frequencies (Figure 3b). On the other hand, the phase angle spectra are more similar to each other, although there is a tendency for the spectra from each class to have a different span of time constants (Figure 3c).

Figure 3
Bode plots of the serum samples considering the real (a) and imaginary (b) impedances, and the phase angle (c). Blue: negative samples; orange: positive samples. Inset: average spectra.

The higher Z’ values indicate the emergence of additional resistive phenomena to the main electrochemical process (redox couple reaction), possibly caused by formation of an “insulating layer” by adsorption of inhibitory species on the electrode surface after electrode incubation with serum.35,36 This reflects the increase in Rct previously discussed. The higher Z’’ values suggest an increase in the capacitive component of the system, which could be related to an increase in double-layer capacitance (Cdl) or introduction of additional capacitive elements. Adsorption of ions or molecules can increase the effective thickness of the double layer.35

Both increases in Rct and Cdl for seropositive samples produce an increase in the overall time constant of the system, shifting the resonant frequency to a lower value. A lower characteristic frequency means that the dominant electrochemical process has become slower. These distinctions, especially notable when comparing average spectra of each class, suggest that electrode surface blockage is more pronounced in samples from the positive serum class, as hypothesized. Figure S2 (SI section) shows the spectra after row preprocessing by applying the logarithm. The logarithmization produces a reduction in the measurement dispersion, which may favor the production of models based on fewer PCs (or LVs).37

Tables S1 to S15 (SI section) demonstrate the optimal classification results using Z’, Z’’, Φ, |Z| and complex impedance (Z’ - jZ’’) data in PCA-DA, PLS-DA, and SIMCA models across different row preprocessing (none and logarithmization), column preprocessing (mean-centering and autoscaling), PCA-DA discriminant function types (linear and quadratic), and the selection of 1 to 20 PCs/LVs. Results are displayed in a contingence table format, detailing the number of samples classified in each class (type I and type II errors), and the figures of merit resulting from such classifications. The best-performing models, highlighted in grey in Tables S1-S15, showcase the lowest combined error rates (validation and test sets) and highest sensitivity, specificity, and precision. These top models are summarized in Tables 1, 2, and 3 for an easier comparison of the best scenarios for PCA-DA, PLS-DA, and SIMCA, respectively.

Table 1
Best-performing PCA-DA models based on real (Z’), imaginary (Z’’), phase angle (Φ), absolute impedance (IZI), and complex (Z’ - jZ’’) impedance data
Table 2
Best-performing PLS-DA models based on real (Z’), imaginary (Z’’), phase angle (Φ), absolute impedance (|Z|), and complex (Z’ -jZ’’) impedance data
Table 3
Best-performing SIMCA models based on real (Z’), imaginary (Z’’), phase angle (Φ), absolute impedance (IZI), and complex (Z’ - jZ’’) impedance data

Table 1 reveals that excellent classification results were obtained with PCA-DA applied to imaginary impedance data, not requiring row preprocessing, and using a linear discriminant function. Sensitivity, specificity, PPV, and NPV values obtained for predicting samples from the test set are 94, 94, 96, and 91%, respectively, with error rate as low as 6%. Although real impedance, phase angle and absolute impedance data also delivered strong outcomes, they incurred slightly more errors than those with the imaginary component and required different row and column preprocessing methods to effectively distinguish between classes. The imaginary impedance was also the best choice for producing a PLS-DA model with excellent prediction power, as seen in Table 2. However, in this case, spectra preprocessing by applying the logarithm is required, as well as data autoscaling. Prediction results are remarkable, mainly for the test set, where all figures of merit also present values at least equal to 90%.

The literature describes impedimetric sensors for COVID-19 diagnostics with performance comparable to those in this study, all relying on electrode modification to detect changes in the Rct of the redox probe [Fe(CN)6]3-/4-arising from interactions between the modifier and a viral component. For example, Huang et al.38 used a nanoporous gold electrode modified with single-stranded DNA oligonucleotides (ss-DNA) to capture SARS-CoV-2 RNA (30-min incubation), achieving 100% sensitivity and 93% specificity. Torres et al.39 functionalized a carbon electrode with angiotensin-converting enzyme 2 (ACE2) to bind the SARS-CoV-2 spike protein (4-min analysis), yielding 80.6% sensitivity and 89.0% specificity. Gevaerd et al.40 developed a gold-conjugate-modified carbon screen-printed electrode for detecting the SARS-CoV-2 N protein, reporting 83.0% sensitivity and 96.2% specificity. With a similar approach to this work, Khayamian et al.41 employed a label-free graphene monolayer on copper to measure the cytokine storm, observing Rct increases of about 65% and 138% in moderate and severe cases, respectively. Applying the same principle for COVID-19 diagnosis resulted in 92% sensitivity and 50% specificity compared to RT-PCR.

The findings in Tables 1 and 2 align with those presented in Figure 3, which illustrates that the variation in imaginary impedance is more pronounced between classes than that observed for the other impedance representations, being the most sensitive property to the adsorption of serum components on the electrode surface. The application of complex impedance in both PCA-DA and PLS-DA did not result in the development of a superior classification model when compared to the optimal DA model (PCA-DA applied to imaginary impedance data). This is due to the fact that the complex number introduces non-discriminating information into the models produced, in contrast to the model based on the most discriminating information (imaginary impedance). A comparison of the best DA models revealed that the application of PLS resulted in a reduction in the number of factors, from 12 PCs in PCA-DA to 8 LVs with PLS-DA. This is attributed to the distinct approach employed in constructing the LVs, which is oriented towards maximizing covariance with the class indices (0 and 1 in this case).22,42

Figure 4 displays the prediction results for training (circles) and the test (stars) sets using the PCA-DA model based on the imaginary impedance data. One can see the seven false negatives (orange circles with probability < 0.5) and four false positive results (blue circles with probability > 0.5) in the training stage, as well as the two false positives and three false negatives in the test stage. The model exhibits excellent discrimination between classes in both the training and testing stages. The chosen set of PCs effectively captured the information underlying the impedance spectra, which provides the discrimination and adequately describes most of the seropositive samples (orange circles and stars). These samples are highly similar with a probability > 0.9, indicating a strong association with COVID-19 infection. The remaining samples in this class exhibit dispersion, which is expected for a complex matrix such as blood serum during a pandemic. Seronegative samples (blue circles and stars) display more variability, reflecting the broader range of conditions within uninfected individuals, where non-infection does not necessarily mean that the patient is healthy.

Figure 4
Predictions obtained by the PCA-DA model for positive (orange) and negative (blue) in the training (circles) and test (stars) stages. The horizontal dotted line corresponds to the class boundary.

The SIMCA models (Table 3) performed approximately the same considering the different types of data and preprocessing methods, and this performance was below that obtained with the best DA models. These models refer to modeling the seropositive class and showed a satisfactory prediction power, with sensitivity and specificity close to 90% for the test samples in the best scenario. Logarithmic transformation of spectra significantly improved SIMCA’s predictive performance for impedance data as real numbers, likely due to reduced intra-class variability. However, the best model was the one based on complex impedance data, with the lowest prediction errors both when training and analyzing the test samples.

SIMCA defines independent multivariate spaces for each class, which may even overlap, thereby creating a risk of incorrect classification. The definition of these spaces depends on the information available in the training set. In this case, using all the information contained in the complex impedance data resulted in a synergistic effect that provided a slightly better model than models based on each impedance representation as real numbers. In contrast, an alternative phenomenon was observed when employing DA, whose discriminating property was the imaginary impedance rather than its association with the real impedance in the complex number. The objective of DA is to identify a mathematical function that maximizes the separation between the classes involved, based on the combination of the measured properties. Consequently, the utilization of complex numbers, which encompass non-discriminating information in this case, hindered the success of models employing this type of data.

Of the classification models obtained, the most successful ones yielded excellent classification results, although they did not achieve maximum sensitivity and selectivity. Therefore, the use of data fusion was considered to ascertain whether the predictive capacity of the models generated could be improved. The real and imaginary impedance components, as well as the phase angle and absolute impedance representations, were utilized in the low-level fusion, as they are deemed to be representations of complementary electrochemical phenomena. Consequently, the real and imaginary impedance spectra (or the phase angle and absolute impedance spectra) were concatenated to formulate a vector of 102 variables per sample. The resulting matrices (201 × 102) were then submitted to the Kennard-Stone algorithm to generate the training and test sets, as well as to produce and validate the models. The results obtained are included in the Supplementary Information section (Tables S16, S17 and S18). In summary, the fusion did not improve the predictive capacity of the models, with figures of merit similar to those obtained with the best models using the impedance representations separately.

Conclusions

The obtained results confirm the initial hypothesis of a systematic differentiated interaction between the constituents of blood serum and the surface of an unmodified disposable gold electrode for samples that tested positive for COVID-19 with the reference diagnostic method. Supervised pattern recognition methods were applied to the electrochemical impedance data to exploit such differential interaction as a source of information for screening patients infected with SARS-CoV-2. The analysis of the spectra of the imaginary component of the impedance using discriminant analysis based on PCA or PLS was found to be more efficient, producing classification error rates as low as 6%. The sensitivity of the imaginary component to the adsorption of serum constituents on the working electrode surface and to the blocking of the interface to the redox process of the electrochemical probe makes it the most discriminating information. SIMCA-based one-class modeling showed higher error rates in the training stage (around 20%), but acceptable performance for prediction of test samples (a 10% error rate) using the complex form of EIS. The potential of low-level data fusion was investigated, encompassing the concatenation of real and imaginary impedance, or phase angle and absolute impedance. However, no enhancement in performance was observed when compared with the superior models utilizing the impedance representations individually.

The proposed method allows for the differentiation of blood serum samples from patients with and without SARS-CoV-2, making it a viable option for COVID-19 screening, identifying positive cases so that they can be directed to confirmation by RT-PCR, thus reducing the impact of a high demand for the more expensive and more time-consuming method. The assay is straightforward and can be easily administered by non-specialists in ‘black box’ mode, i.e., the analyst only needs to connect the electrode and take the measurements before and after adding the serum sample. All the settings for obtaining the impedance spectrum of a sample, as well as those related to data processing and machine learning, can be set by default. This feature enables the analysis to be performed after very simple training, which could involve switching the equipment on and off, taking the measurements and reporting the result, requiring no specific understanding of the underlying processes. Finally, the approach is cost-effective and produces a negligible amount of waste.

Supplementary Information

Supplementary information is available free of charge at http://jbcs.sbq.org.br as PDF file.

Acknowledgments

This study was financed in part by the Coordenação de Aperfeiçoamento de Pessoal de Nível Superior - Brasil (CAPES) - Finance Code 001 and by the Conselho Nacional de Desenvolvimento Científico e Tecnológico (grant No. 313738/2018-1). This study is supported by the Fundação de Apoio à Pesquisa do Estado da Paraíba (FAPESQ), grant No. 071/20; by Public Call No. 03 Produtividade em Pesquisa PROPESQ/PRPG/UFPB PVA13304-2020; and by the National Institute of Science and Technology on Molecular Sciences (INCT-CiMol), grant CNPq 406804/2022-2.

References

  • 1 Word Health Organization (WHO); WHO COVID-19 Dashboard, https://covid19.who.int/, accessed in February 2025.
    » https://covid19.who.int/
  • 2 Word Health Organization (WHO); WHO Director-General’s Opening Remarks at the Media Briefing - 5 May 2023, https://www.who.int/director-general/speecfies/detail/wfio-director-general-s-opening-remarks-at-the-media-briefing---5-may-2023, accessed in February 2025.
    » https://www.who.int/director-general/speecfies/detail/wfio-director-general-s-opening-remarks-at-the-media-briefing---5-may-2023
  • 3 Word Health Organization (WHO); Diagnostic Testing for SARS-CoV-2; https://www.who.int/publications/i/item/ diagnostic-testing-for-sars-cov-2, accessed in January 2025.
    » https://www.who.int/publications/i/item/ diagnostic-testing-for-sars-cov-2
  • 4 Lee, W.-I.; Subramanian, A.; Mueller, S.; Levon, K.; Nam, C.-Y.; Rafailovich, M. H.; ACS Appl. Nano Mater. 2022, 5, 5045. [Crossref]
    » Crossref
  • 5 Erdem, A.; Senturk, H.; Yildiz, E.; Maral, M.; Talanta 2022, 244, 123422. [Crossref]
    » Crossref
  • 6 Fabiani, L.; Saroglia, M.; Galatà, G.; De Santis, R.; Fillo, S.; Luca, V.; Faggioni, G.; D’Amore, N.; Regalbuto, E.; Salvatori, P.; Terova, G.; Moscone, D.; Lista, F.; Arduini, F.; Biosens. Bioelectron. 2021, 171, 112686. [Crossref]
    » Crossref
  • 7 Laleh, S.; Ibarlucea, B.; Stadtmüller, M.; Cuniberti, G.; Medina-Sánchez, M.; Biosens. Bioelectron. 2023, 236, 115362. [Crossref]
    » Crossref
  • 8 Ribeiro, J. A.; Jorge, P. A. S.; Sens. Actuators Rep. 2024, 8, 100205. [Crossref]
    » Crossref
  • 9 Uygun, H. D. E.; Uygun, Z. O. In Handbook of Nanobioelectrochemistry; Springer Nature Singapore: Singapore, 2023, p. 919-932. [Crossref]
    » Crossref
  • 10 Tesfalidet, S.; Geladi, P.; Shimizu, K.; Lindholm-Sethson, B.; Anal. Chim. Acta 2016, 914, 1. [Crossref]
    » Crossref
  • 11 Rodrigues, D. R.; Fragoso, W. D.; Lemos, S. G.; J. Chemom. 2020, 34, e3271. [Crossref]
    » Crossref
  • 12 EL Sharif, H. F.; Dennison, S. R.; Tully, M.; Crossley, S.; Mwangi, W.; Bailey, D.; Graham, S. P.; Reddy, S. M.; Anal. Chim. Acta 2022, 1206, 339777. [Crossref]
    » Crossref
  • 13 Castro, A. C. H.; Bezerra, Í. R. S.; Pascon, A. M.; da Silva, G. H.; Philot, E. A.; de Oliveira, V. L.; Mancini, R. S. N.; Schleder, G. R.; Castro, C. E.; de Carvalho, L. R. S.; Fernandes, B. H. V.; Cilli, E. M.; Sanches, P. R. S.; Santhiago, M.; Charlie-Silva, I.; Martinez, D. S. T.; Scott, A. L.; Alves, W. A.; Lima, R. S.; ACS Nano 2022, 16, 14239. [Crossref]
    » Crossref
  • 14 Kumar, R.; Bhondekar, A. P.; Kaur, R.; Vig, S.; Sharma, A.; Kapur, P.; Sens. Actuators, B 2012, 171-172, 1046. [Crossref]
    » Crossref
  • 15 de Morais, T. C. B.; Rodrigues, D. R.; Souto, U. T. C. P.; Lemos, S. G.; Food Chem. 2019, 273, 31. [Crossref]
    » Crossref
  • 16 Soleimani, S.; Arkan, E.; Farshadnia, T.; Mahnam, Z.; Jalili, F.; Goicoechea, H. C.; Jalalvand, A. R.; Sens. Bio-Sensing Res. 2020, 29, 100366. [Crossref]
    » Crossref
  • 17 Rodrigues, D. R.; Olivieri, A. C.; Fragoso, W. D.; Lemos, S. G.; Anal. Chim. Acta 2019, 1080, 1. [Crossref]
    » Crossref
  • 18 Rodrigues, D. R.; Fragoso, W. D.; Lemos, S. G.; Electrochim. Acta 2021, 397, 139312. [Crossref]
    » Crossref
  • 19 Jeong, H.-J.; Kim, K.; Kim, H. W.; Park, Y.; Sensors 2022, 22, 7969. [Crossref]
    » Crossref
  • 20 Messina, P.; Garcia, C.; Rambeau, J.; Darcourt, J.; Balland, R.; Carreel, B.; Cottance, M.; Gusarova, E.; Lafaurie-Janvore, J.; Lebedev, G.; Bozsak, F.; Barakat, A. I.; Payrastre, B.; Cognard, C.; J. Neurointerv. Surg. 2023, 15, 526. [Crossref]
    » Crossref
  • 21 Kennard, R. W.; Stone, L. A.; Technometrics 1969, 11, 137. [Crossref]
    » Crossref
  • 22 Matlab, R2023a; MathWorks, USA, 2023; Ballabio, D.; Consonni, V.; Anal. Methods 2013, 5, 3790. [Crossref]
    » Crossref
  • 23 Cuadros-Rodríguez, L.; Pérez-Castaño, E.; Ruiz-Samblás, C.; TrAC, Trends Anal. Chem. 2016, 80, 612. [Crossref]
    » Crossref
  • 24 Shi, D.; Yan, R.; Lv, L.; Jiang, H.; Lu, Y.; Sheng, J.; Xie, J.; Wu, W.; Xia, J.; Xu, K.; Gu, S.; Chen, Y.; Huang, C.; Guo, J.; Du, Y.; Li, L.; Metabolism 2021, 118, 154739. [Crossref]
    » Crossref
  • 25 Tristán, A. I.; Jiménez-Luna, C.; Abreu, A. C.; Arrabal-Campos, F. M.; Salmerón, A. del M.; Rodríguez, F. I.; Maresca, M. Á. R.; García, A. B.; Melguizo, C.; Prados, J.; Fernández, I.; Sci. Rep. 2024, 14, 23713. [Crossref]
    » Crossref
  • 26 Pushalkar, S.; Wu, S.; Maity, S.; Pressler, M.; Rendleman, J.; Vitrinel, B.; Jeffery, L.; Abdelhadi, R.; Chen, M.; Ross, T.; Carlock, M.; Choi, H.; Vogel, C.; Sci. Rep. 2024, 14, 4479. [Crossref]
    » Crossref
  • 27 Kistenev, Y. V.; Vrazhnov, D. A.; Shnaider, E. E.; Zuhayri, H.; Heliyon 2022, 8, e11185. [Crossref]
    » Crossref
  • 28 Kukar, M.; Guncar, G.; Vovko, T.; Podnar, S.; Cernelc, P.; Brvar, M.; Zalaznik, M.; Notar, M.; Moskon, S.; Notar, M.; Sci. Rep. 2021, 11, 10738. [Crossref]
    » Crossref
  • 29 Babaei Rikan, S.; Sorayaie Azar, A.; Ghafari, A.; Bagherzadeh Mohasefi, J.; Pirnejad, H.; Biomed. Signal Process. Control 2022, 72, 103263. [Crossref]
    » Crossref
  • 30 Nuzzo, R. G.; Dubois, L. H.; Allara, D. L.; J. Am. Chem. Soc. 1990, 112, 558. [Crossref]
    » Crossref
  • 31 Nuzzo, R. G.; Allara, D. L.; J. Am. Chem. Soc. 1983, 105, 4481. [Crossref]
    » Crossref
  • 32 Ulman, A.; Chem. Rev. 1996, 96, 1533. [Crossref]
    » Crossref
  • 33 Zakaria, N. D.; Omar, M. H.; Ahmad Kamal, N. N.; Abdul Razak, K.; Sönmez, T.; Balakrishnan, V.; Hamzah, H. H.; ACS Omega 2021, 6, 24419. [Crossref]
    » Crossref
  • 34 Lee, S.; Kim, W. J.; Chung, M.; Analyst 2021, 146, 5236. [Crossref]
    » Crossref
  • 35 Lazanas, A. Ch.; Prodromidis, M. I.; ACS Meas. Sci. Au 2023, 3, 162. [Crossref]
    » Crossref
  • 36 Cheng, T.-J.; Hsiao, H.-Y.; Tsai, P.-C.; Chen, R. L. C.; Chemosensors 2023, 11, 343. [Crossref]
    » Crossref
  • 37 Oliveira, L. G.; Lemos, S. G.; Fragoso, W. D.; Electrochim. Acta 2020, 354, 136591. [Crossref]
    » Crossref
  • 38 Huang, Z.; Wang, W.; Wang, Y.; Wang, H.; Pang, Y.; Yuan, Q.; Tan, J.; Tan, W.; Anal. Chem. 2023, 95, 16668. [Crossref]
    » Crossref
  • 39 Torres, M. D. T.; de Lima, L. F.; Ferreira, A. L.; de Araujo, W. R.; Callahan, P.; Dávila, A.; Abella, B. S.; de la Fuente-Nunez, C.; iScience 2022, 25, 104055. [Crossref]
    » Crossref
  • 40 Gevaerd, A.; Carneiro, E. A.; Gogola, J. L.; Nicollete, D. R. P.; Santiago, E. B.; Riedi, H. P.; Timm, A.; Predebon, J. V.; Hartmann, L. F.; Ribeiro, V. H. A.; Rochitti, C.; Marques, G. L.; Loesch, M. M. O. N.; de Almeida, B. M. M.; Rogal-Junior, S.; Figueredo, M. V. M.; Sensors 2024, 24, 3772. [Crossref]
    » Crossref
  • 41 Khayamian, M. A.; Parizi, M. S.; Ghaderinia, M.; Abadijoo, H.; Vanaei, S.; Simaee, H.; Abdolhosseini, S.; Shalileh, S.; Faramarzpour, M.; Naeini, V. F.; Hoseinpour, P.; Shojaeian, F.; Abbasvandi, F.; Abdolahad, M.; RSCAdv. 2021, 11, 34503. [Crossref]
    » Crossref
  • 42 Silva, A. C.; Paz, J. E. M.; Lira Pontes, L. F. B.; Lemos, S. G.; Pontes, M. J. C.; Electrochim. Acta 2013, 111, 160. [Crossref]
    » Crossref

Edited by

  • Editor handled this article:
    Rodrigo A. A. Muñoz (Associate)

Publication Dates

  • Publication in this collection
    21 Mar 2025
  • Date of issue
    2025

History

  • Received
    22 Nov 2024
  • Accepted
    14 Feb 2025
location_on
Sociedade Brasileira de Química Instituto de Química - UNICAMP, Caixa Postal 6154, 13083-970 Campinas SP - Brazil, Tel./FAX.: +55 19 3521-3151 - São Paulo - SP - Brazil
E-mail: office@jbcs.sbq.org.br
rss_feed Stay informed of issues for this journal through your RSS reader
Accessibility / Report Error