- Research
- Published:
The use of wastewater surveillance to estimate SARS-CoV-2 fecal viral shedding pattern and identify time periods with intensified transmission
樱花视频 volume听25, Article听number:听1108 (2025)
Abstract
Background
Wastewater-based surveillance is an important tool for monitoring the COVID-19 pandemic. However, it remains challenging to translate wastewater SARS-CoV-2 viral load to infection number, due to unclear shedding patterns in wastewater and potential differences between variants.
Objectives
We utilized comprehensive wastewater surveillance data and estimates of infection prevalence (i.e., the source of the viral shedding) available for New York City (NYC) to characterize SARS-CoV-2 fecal shedding pattern over multiple COVID-19 waves.
Methods
We collected SARS-CoV-2 viral wastewater measurements in NYC during August 31, 2020 鈥 August 29, 2023 (N鈥=鈥3794 samples). Combining with estimates of infection prevalence (number of infectious individuals including those not detected as cases), we estimated the time-lag, duration, and per-infection fecal shedding rate for the ancestral/Iota, Delta, and Omicron variants, separately. We also developed a procedure to identify occasions with intensified transmission.
Results
Models suggested fecal viral shedding likely starts around the same time as and lasts slightly longer than respiratory tract shedding. Estimated fecal viral shedding rate was highest during the ancestral/Iota variant wave, at 1.44 (95% CI: 1.35 鈥 1.53) billion RNA copies in wastewater per day per infection (measured by RT-qPCR), and decreased by鈥塧round 20% and 50-60% during the Delta wave and Omicron period, respectively. We identified around 200 occasions during which the wastewater SARS-CoV-2 viral load exceeded the expected level in any of the city's听14 sewersheds. These anomalies disproportionally occurred during late January, late April鈥攅arly May, early August, and from late-November to late-December, with frequencies exceeding the expectation assuming random occurrence (P鈥<鈥0.05; bootstrapping test).
Discussion
These estimates may be useful in understanding changes in underlying infection rate and help quantify changes in COVID-19 transmission and severity over time. We have also demonstrated that wastewater surveillance data can support the identification of time periods with potentially intensified transmission.
Background
Since the early phase of the COVID-19 pandemic, studies have reported that wastewater SARS-CoV-2 viral loads often closely track or lead case and/or hospitalization trajectories and, as such, can serve as a cost-effective surveillance tool for monitoring the COVID-19 pandemic [1,2,3,4,5]. Thus, wastewater-based surveillance systems have been built worldwide on local and national scales. With decreasing clinical testing and genomic sequencing [6, 7], there has been increased interest in wastewater surveillance, given听the results are generated independently of clinical testing practice.
Though there are advantages of SARS-CoV-2 wastewater surveillance, a large US national survey of public health agencies completed in 2022 noted the results were often deemed supplementary to surveillance involving clinical laboratory tests [8]. One of the hurdles is that while the trends could indicate changes in SARS-CoV-2 community circulation, it remains challenging to directly translate wastewater SARS-CoV-2 viral loads to a specific number of infections in the population, due to the unclear fecal viral shedding rate (after accounting for the recovery rate of virus genomes) in wastewater samples. In addition, with the fast emergence and turnover of new SARS-CoV-2 variants, it is unclear how fecal shedding of the virus may have altered over time by variant. Further, linking wastewater viral load to clinical health metrics has become more challenging since鈥塺ound 2022 with the decrease in clinical testing and changing reporting requirements. For instance, previous studies [9,10,11] examining the association between wastewater viral load and clinical health metrics (e.g., cases and/or hospitalizations) have reported decreased strength of correlation, likely to due to changes in test- and healthcare-seeking behavior over time. To address these questions and challenges, we utilize comprehensive wastewater surveillance data and estimates of infection prevalence (versus clinical health metrics in previous work) available for New York City (NYC) to characterize SARS-CoV-2 fecal shedding over multiple COVID-19 pandemic and epidemic waves.
NYC experienced the earliest pandemic wave in the United States (US), and shortly after the initial wave, established a wastewater surveillance program that covers all of its 14 sewersheds which serve over 8 million residents [2, 12]. Since August 31, 2020, the program has continuously measured SARS-CoV-2 viral load weekly. Independently, we have developed and used a comprehensive model-inference system 鈥 calibrated to case, Emergency Department (ED) visit, and mortality data 鈥 to reconstruct the underlying transmission dynamics and estimate key epidemiological characteristics [13, 14]. In particular, the model-inference system estimates the number of actively infectious individuals including those not detected as cases (i.e., infection prevalence) in each of the city鈥檚 42 neighborhoods during each week since March 1, 2020 [13, 14]. Combining the wastewater SARS-CoV-2 viral load data and infection prevalence estimates over a 3-year period (i.e., August 31, 2020 鈥 August 29, 2023), we are able to characterize the viral shedding pattern (i.e., time-lag, duration, and per-infection shedding rate) for the ancestral/Iota, Delta, and Omicron variants, separately. Importantly, our estimates are based on all active听infections regardless of symptoms and test-seeking behavior (i.e., not subject to changes in medical seeking propensity and clinical testing practice) and thus better reflect SARS-CoV-2 variant-specific fecal shedding pattern. Further, linking wastewater viral load to infection prevalence would provide a more complete picture of the underlying transmission dynamics and help to better anticipate future epidemic trajectory, as asymptomatic/mild infections can also transmit SARS-CoV-2 [15]. In addition, we are also able to identify time periods with greater transmission.
Methods
SARS-CoV-2 wastewater surveillance data
The SARS-CoV-2 wastewater surveillance program in NYC started on August 31, 2020 [2, 12]. Wastewater samples were taken at each of the city鈥檚 14 wastewater treatment plants, usually twice per week on Sundays and Tuesdays (N鈥=鈥3794 samples; see variations and details in Table S1). Specifically, 24-h flow-weighted composite influent wastewater samples were collected from the influent of each wastewater treatment plant; the use of composite samples reduces biases due to sampling time during the day when people may be at work, sleeping, etc. Each composite sample consisted of eight grab samples collected every three hours beginning at 7:00 AM on the sampling date. The volume of each grab sample added to the composite was determined based on the flowrate during the associated 3-h collection period. Samples were transported on ice and stored at 4掳C until processing.
SARS-CoV-2 RNA concentration was measured using quantitative reverse transcription polymerase chain reaction (RT-qPCR) assays during August 31, 2020, through April 11, 2023, and reverse transcription digital PCR (RT-dPCR) assays from November 1, 2022, through August 29, 2023. For RT-qPCR, the samples were tested on the ABI 7500 system, with the forward primer (5鈥- GACCCCAAAATCAGCGAAAT-3鈥), reverse primer (5鈥- TCTGGTTACTGCCAGTTGAATCTG-3鈥), and probe (5鈥-FAM-ACCCCGCATTACGTTTGGTGGACC-BHQ-1鈥3'). The limit of detection and limit of quantification for the RT-qPCR assay were 180 copies/L of wastewater sample and 590 copies/L of wastewater sample, respectively. For RT-dPCR, samples were tested on the QIAcuity Eight Platform System, using the GT-Digital SARS-CoV-2 Wastewater Surveillance Assay For QIAcuity鈩 kit (Catalog #100,179). The limit of detection for RT-dPCR was 0.225 copies/渭L, or anything below 3 partitions. All measurements adjusted for sewershed-specific flow rate and service population size. Specifically, per-capita SARS-CoV-2 viral load (RNA copies per day per population) was computed as the viral concentration measure multiplied by the daily sewage flow rate and then divided by the service population听size.
For weeks after April 11, 2023, when the samples were measured using RT-dPCR alone, we converted the RT-dPCR measurements to RT-qPCR equivalents, to allow characterization of SARS-CoV-2 viral shedding during the entire Omicron period. Specifically, we first computed the conversion ratio using measurements from November 1, 2022, through April 11, 2023, when both assays were conducted, simply as the mean of all RT-qPCR measurements divided by the mean of all RT-dPCR measurements, during these weeks. That is, the conversion ratio \(p={\sum }_{i=1}^{n}{VL}_{\text{i},\text{ qPCR}}/{\sum }_{i=1}^{n}{VL}_{\text{i},\text{dPCR}}\), where and n is the number of samples measured by both assays and \({VL}_{\text{i},\text{ qPCR}}\) and \({VL}_{\text{i},\text{dPCR}}\) (i鈥=鈥1, 2, 鈥, n) are the viral load measurements by the two assays for each sample. We then multiplied the RT-dPCR measurements by the conversion ratio to obtain the converted RT-qPCR equivalents. As an alternative, we stratified the data by sewershed and performed the conversion using sewershed-specific conversion ratios (see Sensitivity Analysis). In addition, the RT-qPCR and RT-dPCR measures differed substantially (by a factor of 16.7 based on the aforementioned overlapping measurements), likely due to difference in methodology [12, 16]. To facilitate comparison with studies primarily using RT-dPCR, we also converted all RT-qPCR measurements to RT-dPCR equivalents when reporting the viral shedding rates.
SARS-CoV-2 infection prevalence estimates
Estimated SARS-CoV-2 infection prevalence came from a model-inference system [13, 14, 17], independent of the wastewater surveillance data. Briefly, the model-inference system fit a neighborhood-level Susceptible-Exposed-Infectious-(re)Susceptible-Vaccination (SEIRSV) model to age-grouped, neighborhood-specific COVID-19 case, ED visit, and mortality data, using the ensemble adjustment Kalman filter [18], a Bayesian data assimilation method. The system simultaneously accounted for concurrent nonpharmaceutical interventions, vaccinations, under-detection of infection, and seasonal changes. As reported previously, this model-inference system has been validated using serology data [13] and shown to generate accurate COVID-19 forecasts [19] as well as estimates of key epidemiological variables and parameters (e.g., transmission rate and infection-fatality risk) consistent with other studies [13, 14].
Using the model-inference system, we estimated the number of infectious individuals 鈥 i.e., anyone who can actively transmit SARS-CoV-2 and infect others regardless of symptoms and test-seeking behaviors 鈥 present in the population, during each week of the study period [17]. That is, similar to the wastewater SARS-CoV-2 viral loads measuring the total population fecal shedding regardless of clinical testing, the infection prevalence estimates included all individuals actively transmitting SARS-CoV-2 (primarily via shedding from the respiratory tracts) regardless of whether they were detected as cases. The infection prevalence estimates are United Hospital Fund neighborhood- [20] and age group specific, and available for each week starting March 1, 2020 (the pandemic onset in NYC) to the week starting August 27, 2023. To match with the sewershed-level wastewater SARS-CoV-2 viral load data, we first mapped each neighborhood (42 in total vs. 14 sewersheds) to the corresponding sewershed based on geolocation; if a neighborhood overlapped multiple sewersheds, we assigned it to the one with the maximal overlap. For each sewershed and week, we then aggregated all estimated infectious individuals from all related neighborhoods.
Estimating the fecal viral shedding time-lag, duration and rate
To analyze the fecal viral shedding pattern by variant, we defined three time periods based on data availability and the predominant circulating variant [21] (i.e., to be more variant-specific): i) the 2nd wave (predominantly the ancestral and Iota variants), from August 31, 2020 (i.e., the first day of wastewater surveillance) through June 26, 2021; ii) the Delta wave (predominantly the Delta variant), from June 27, 2021 (i.e., the first week the share of Delta exceeding 50% among the sequenced specimens) through December 4, 2021; and iii) the Omicron period (predominantly Omicron subvariants and included multiple Omicron-subvariant waves), from December 5, 2021 (i.e., the first week the share of Omicron BA.1 exceeding 25% among the sequenced samples; note that we used a lower threshold here given the milder severity of Omicron BA.1 [22] and thus likely fewer infections detected and sequenced) though August 29, 2023 (i.e., the last wastewater sample during the study period).
SARS-CoV-2 viral load in wastewater represents the pooled fecal shedding of the virus by the population, whereas the infection prevalence represents the proportion of population actively infectious at a given time (i.e., the source of the viral shedding after a potential time-lag). Thus, to estimate the viral shedding rate for each variant (per the time period defined above), we used a linear regression model, accounting for circulating variants and spatial variations by sewershed, per Eq.听1:
where, \({VL}_{t\in \{t\}+\tau ,{Sewershed}_{s}, {Period}_{p}}\) is the wastewater SARS-CoV-2 viral load measured during time-window \(\left\{t\right\}\), adjusted by a time-lag or lead of \(\tau\) days (see details below); the 2nd and 3rd subscripts denote the corresponding sewershed and time-period, respectively. \({\varepsilon }_{t}\) is the error term. \({Sewershed}_{s}\) is an indicator variable for one of the 14 sewersheds in NYC (see the names of sewersheds in Fig.听1 and Table S2) to account for spatial variation (here, in essence, to allow different intercepts for different sewersheds). \({Period}_{p}\) is an indicator variable for one of the three epidemic time periods (i.e., 2nd wave, Delta wave, or Omicron period) as defined above for the corresponding week, included as a proxy for circulating variants during week-t. \({I}_{t}\) is the infection prevalence estimated for week-t. The interaction term \({I}_{t}{Period}_{p}\) is included to account for potential nonadditive interaction of the two variables (here, in essence, to allow different viral shedding rates by variant). Per Eq.听1, we computed the estimates of fecal viral shedding rate for each variant using the coefficients \({\beta }_{I}\) and \({\beta }_{IP}{\prime}s\). For example, using the 2nd wave as the reference period (i.e., \({Period}_{1}=\) 2nd wave, so that \({\beta }_{IP,1}=0)\), the fecal viral shedding rate is \({\beta }_{I}\) during the 2nd wave, \({\beta }_{I}+{\beta }_{IP,2}\) during the Delta wave (i.e., \({Period}_{2}=\)听Delta wave), and \({\beta }_{I}+{\beta }_{IP,3}\) during the Omicron period (i.e., \({Period}_{3}=\)听Omicron period).
Trends in wastewater SARS-CoV-2 viral load in the 14 sewersheds in NYC. The map in (A) shows 14 sewersheds (delineated by color) and 42 United Hospital Fund neighborhoods (delineated by lines). Dots show the per-capita SARS-CoV-2 viral load in each of the 14 sewersheds (right y-axis, in million copies per day per population by RT-qPCR; color coded per the legend) during the 2nd wave (B), Delta wave (C), and Omicron period (D). For comparison, we overlay the citywide estimates of infection prevalence (left y-axis; blue line鈥=鈥塵edian; darker blue area鈥=鈥50% CI and lighter blue area鈥=鈥95% CI)
Given the different surveillance schedules and likely difference between fecal and respiratory viral shedding, we tested three sliding time-windows (i.e., \(\left\{t\right\}\) in Eq.听1) for matching the wastewater measurements (twice per week, representing fecal shedding) with the infection prevalence estimates (weekly estimates, representing respiratory shedding); specifically, we averaged 2, 3, or 4 consecutive wastewater samples, corresponding to roughly a 1-, 1.5-, or 2-week window, respectively, depending on the wastewater sampling schedule and time-adjustment used. For each time-window \(\left\{t\right\}\), to identify a proper time-adjustment (\(\tau\) in Eq.听1), we tested five settings to capture the time difference from becoming infectious via respiratory shedding to fecal shedding per the population-level surveillance data:
-
i)
a 6- to 7-day lead, i.e., the wastewater samples included in time-window \(\left\{t\right\}\)听started from the 1st sample taken the week before the infection prevalence estimate; note the 1st sample was taken on Sunday (corresponding to a maximum of 7-day lead) or Monday (corresponding to a maximum of 6-day lead);
-
ii)
a 4- to 5-day lead, i.e., the wastewater samples included in time-window \(\left\{t\right\}\)听started from the 2nd sample taken the week before the infection prevalence estimate; note the 2nd sample was taken on Tuesday (corresponding to a maximum of 5-day lead) or Wednesday (corresponding to a maximum of 4-day lead);
-
iii)
concurrent (no time-difference, \(\tau\)=0), i.e., the wastewater samples included in time-window \(\left\{t\right\}\)听started from the 1st sample taken the week of the infection prevalence estimate;
-
iv)
a 2- to 3-day lag, i.e., the wastewater samples included in time-window \(\left\{t\right\}\)听started from the 2nd sample taken the week of the infection prevalence estimate (a Tuesday sample corresponded to a 2-day lag and a Wednesday sample corresponded to a 3-day lag); and
-
v)
a 7- to 8-day lag, i.e., the wastewater samples included in time-window \(\left\{t\right\}\)听started from the 1st sample taken the week after the infection prevalence estimate (a Sunday sample corresponded to a 7-day lag and a Monday sample corresponded to a 8-day lag).
In addition, we performed variant/period-specific analyses for each of the three time-periods defined above, using a similar model form as Eq.听1 but without the terms related to time-period (\({Period}_{p}\)). Since the Omicron period included multiple Omicron-subvariant waves, we also performed stratified analyses for the Omicron BA.1 wave (December 5, 2021, through March 4, 2022, i.e., the last week the share of Omicron BA.1 exceeding 50%) and for weeks from March 5, 2022 onwards, separately.
Identifying timings with higher-than-expected transmission
Visual inspection of the wastewater data showed there were occasional spikes in SARS-CoV-2 viral load, potentially due to intensified transmission. Due to the temporal dynamics and sampling noise, it is challenging to distinguish such potential instances (i.e., a true signal) based on the wastewater data alone. Thus, here we used the infection prevalence estimates, which had accounted for the main underlying transmission factors, to construct the expected SARS-CoV-2 viral load for comparison. Specifically, we first computed the daily infection prevalence using the weekly estimates with a spline smoothing function, and then used those as inputs in Eq.听1 to compute the expected daily SARS-CoV-2 viral load (median and 90% confidence intervals [CI]). Given the large variance in both the infection prevalence estimates and SARS-CoV-2 viral load data, we deemed a wastewater measurement higher than expected, if it was higher than the 95th percentile (i.e., the upper bound of the 90% CI) of the expected SARS-CoV-2 viral load.
To examine the timing with higher-than-expected SARS-CoV-2 viral load, we grouped the identified anomaly dates into 10-day bins based on calendar time, i.e., the 1st (early), 2nd (mid), and last (late) 10 days of each month; for example, January 1 of 2021, January 5 of 2022, and January 10 of 2023 would all be grouped as 鈥渆arly-January鈥. This allows recurrent and/or seasonal events to be grouped in the same or nearby bins. To test whether the identified anomalies occurred at random (e.g., due to noise in the data), we further performed a bootstrap test with 5000 random samples. For each bootstrapping set, we randomly sampled nanomaly (i.e., the number of identified anomalies) dates from the wastewater measurements (N鈥=鈥3794), and then grouped the dates into the same 10-day bins as done for the identified anomalies. We then pooled the 5000 sets together to construct the distribution of each timing. For example, for early-January (the first 10-day calendar bin), with n1, n2, 鈥, and n5000 of the dates falling in that bin for the 5000 sets, the likelihood of having k (k鈥=鈥0, 鈥, nanomaly, i.e., from none to all) anomalies during early-January would be:
and the likelihood of having k or more anomalies during early-January would be:
Sensitivity analyses
In a first sensitivity analysis, we only included SARS-CoV-2 viral load measured by RT-qPCR (i.e., August 31, 2020鈥 April 11, 2023), to examine if the viral shedding rate estimates were affected by converting RT-dPCR measurements to RT-qPCR equivalents due to changes in testing assays. That is, for this sensitivity analysis, we restricted the study period to August 31, 2020鈥 April 11, 2023, and used the same analysis methods described for the main analysis. In a second sensitivity analysis, we included all SARS-CoV-2 viral load measurements but used the sewershed-specific conversion ratios instead of the citywide conversion ratio for all sewersheds. That is, the conversion ratio (p) was computed for each sewershed using samples taken at the corresponding sewershed alone (vs. pooling all samples from all sewersheds in the main analysis).
Results
General trends in measured wastewater SARS-CoV-2 viral load and estimated infection prevalence
During the 3-year study period (August 31, 2020 鈥 August 29, 2023), trends in wastewater SARS-CoV-2 viral load were generally consistent with the trends in estimated infection prevalence (Fig.听1). Across the 14 NYC sewersheds (Fig.听1A), wastewater SARS-CoV-2 viral load tended to rise and fall around the same time (Fig.听1B-D and Figs. S1-3), indicating epidemic waves were highly synchronized across the city. However, the magnitudes of wastewater SARS-CoV-2 viral load and infection prevalence estimates both varied substantially over time and across sewersheds and may not scale consistently. For example, even though certain sewersheds tended to detect higher SARS-CoV-2 viral loads than others, the rankings changed across different waves (see Fig. S1-3, ranked by average viral load). Similar spatial heterogeneity was apparent in the estimated infection prevalence and the discrepancies between wastewater SARS-CoV-2 viral load and estimated infection prevalence appeared larger during the 2nd wave (Fig. S1). Such spatial heterogeneity is not unexpected, since several factors such as RNA degradation [23] and dilution [23], and the contribution of infected animals [24] could all vary by sewershed, and ultimately affect wastewater measurements. In addition, uncertainty in the infection prevalence estimate could also vary by sewershed (e.g., larger uncertainty for those with smaller population size; see, e.g., the wider uncertainty bounds for Oakwood Beach sewershed in Fig. S1).
Estimated fecal viral shedding patterns
Using the wastewater SARS-CoV-2 viral load data and infection prevalence estimates (i.e., source of fecal viral shedding), we examined fecal viral shedding patterns over the entire study period or stratified by variant/time-period, separately. The estimates from the two analyses are generally consistent; see similar estimated shedding rate, time lag, and number of consecutive samples to aggregate for the best-fit models in Table听1. Among the 15 combinations of fecal viral shedding time-differences and durations tested, the main model (including all waves) identified concurrent infection prevalence estimates (i.e., no time-difference between becoming infectious via respiratory shedding and fecal shedding) and SARS-CoV-2 viral load aggregated over 3 wastewater samples (2 during the same week and 1 in the beginning of the following week, i.e., a 8- to 9- day-time-interval) as the best setting (highest adjusted R-squared; Fig.听2A). Figure听2B shows the model fit and Table S2 shows the estimated coefficients for the best fit model. Using a 4鈥5-day-lead and aggregation over 4 wastewater samples (i.e., one sample 4鈥5 days before, two during, and one 1鈥2 days after the infection prevalence estimate) led to the second-best model fit (Fig.听2A, 2nd dark bar), and was the best setting for the Delta wave and weeks after the BA.1 wave in the stratified analysis (Table听1). Model fit degraded quickly with changing time-differences (both leads and lags), when only 2 (roughly a 1-week duration) or 3 (roughly a 1.5-week duration) wastewater samples were included.
Model fit. A shows model performance based on the adjusted R-squared (higher number represents better performance) for different settings of time from becoming infectious to fecal viral shedding and time window of the wastewater samples are aggregated. The asterisk indicates the setting with the highest adjusted R-squared (i.e., best-fit model). B shows the model fit compared to the data
Estimated fecal viral shedding rate was highest for infections during the 2nd wave (mostly due to the ancestral and Iota variants), at 1.44 (95% CI: 1.35 鈥 1.53) billion RNA copies by RT-qPCR in wastewater per day per infectious person [or 24 (95% CI: 22.49鈥25.51) billion RNA copies per RT-dPCR conversion; see Methods]. The estimated rate decreased by鈥墌20% during the subsequent Delta wave and by 50鈥60% during the Omicron period (Table听1). Importantly, we note the lower estimates for Delta and Omicron may in part reflect reduced shedding among vaccinees and recoverees, in addition to variant-specific variations.
Timings with higher-than-expected transmission
The infection prevalence estimates have accounted for the general transmission factors (here, population-level mobility, vaccinations, variant-specific properties, and seasonal risk of infection; see Methods), but may have not fully accounted for activities such as increased gatherings during certain time-periods that might increase transmission. In contrast, wastewater SARS-CoV-2 viral load is a composite measure of all transmission events. Thus, comparison of these two quantities could support identification of such events. Following a procedure designed per this mechanism (see Methods), we identified 198 occasions where wastewater SARS-CoV-2 viral loads exceeded the expected levels in any of the 14 sewersheds (see Fig.听3A for identified anomalies for Newtown Creek, the sewershed with the largest service population). These anomalies disproportionally occurred during late January, late April鈥攅arly May, early August, and mid-November to late-December (Fig.听3B), with frequencies exceeding the expectation assuming random occurrence. Among the 5000 bootstrapping sets, none had as many or more anomalies as observed in early August or late November (P鈥=鈥0) and less than 5% had as many or more anomalies as observed in late January, late April, early May, late November, and late December (P鈥<鈥0.05 for all these calendar bins; Table S3).
Identified time-periods with intensified transmission in any of 14 NYC sewersheds. A shows an example of the measured (dots) and expected wastewater SARS-CoV-2 viral load (blue line鈥=鈥塵edian; darker blue area鈥=鈥50% CI and lighter blue area鈥=鈥95% CI), and identified anomalies with SARS-CoV-2 viral load exceeding the expected (red labels). B shows the distribution of all identified anomalies. Asterisks indicate time-periods that exceeded the expected wastewater SARS-CoV-2 viral load with a frequency higher than chance assuming random occurrence per a bootstrapping test (* for P鈥<鈥0.1 and ** for P鈥<鈥0.05). Spatial distribution of the anomalies is shown in Fig. S4
Sensitivity analyses
Results from the two sensitivity analyses are consistent with the main analysis. In the 1st sensitivity analysis (i.e., using SARS-CoV-2 viral load measured by RT-qPCR alone, for a shorter study period from 8/31/20 to 4/11/23), similar fecal viral shedding rates were estimated (Table S4). The 2nd sensitivity analysis (using sewershed-specific conversion ratios to convert the RT-dPCR measurements after 4/11/23, same study period as the main analysis) estimated the same fecal viral shedding rates as the main analysis, and identified three additional anomalies (i.e., 1 in late-January, 1 in mid-August, and 1 in early-July).
Discussion
Wastewater surveillance can be a valuable tool for monitoring SARS-CoV-2 circulation in the population. To further develop understanding of wastewater surveillance data, we have combined independent model-inference estimates of infection prevalence to characterize fecal viral shedding patterns for multiple major SARS-CoV-2 variants. Using NYC as an example, we have also demonstrated that these data and estimates can support the identification of time periods with potentially intensified transmission.
Importantly, here we examined how wastewater SARS-CoV-2 viral shedding is related to estimated infection prevalence, rather than health outcomes as in previous studies. This choice could lead to certain apparent differences but has several advantages. First, previous studies have reported detection of SARS-CoV-2 in wastewater (e.g., an increase in viral load, or the presence of a new variant) several days ahead of the detection of cases, hospitalizations, or deaths, due to the delay in health outcomes [25,26,27]. Here, infection prevalence is a proxy of respiratory tract shedding, which could precede fecal viral shedding. Indeed, we found wastewater SARS-CoV-2 viral loads measured round 1.5 week of the infection prevalence estimate afforded the best model fit (Table听1). This finding suggests that fecal viral shedding likely starts around the same time an individual becomes infectious and lasts slightly longer than the shedding from respiratory tract. Consistent with our finding, studies have shown that fecal SARS-CoV-2 RNA was detectable in patients within the first week of COVID-19 diagnosis and could last longer than respiratory shedding [23, 28].
Second, case-, hospitalization-, or death-to-wastewater-viral-load ratio could decrease with increased vaccinations/reinfections and circulation of milder variants (e.g., Omicron) due to reduced severity or testing, and such reductions have been reported [26, 29]. In contrast, as our estimates included all infections regardless of severity or testing, the infection-to-wastewater-viral-load ratio (roughly, the inverse of estimated per-infection fecal viral shedding rate; Table听1) is relatively stable during each variant wave. For example, the wave-stratified analysis estimated similar fecal viral shedding rates for the BA.1 wave and weeks after BA.1 (Table听1). Importantly, using the infection prevalence estimates, we are able to quantify the fecal viral shedding rate for each major SARS-CoV-2 variant/time-period (Table听1). These estimates could be used to help account for changes in underlying infection rate and help examine changes in COVID-19 severity during this study period, when detailed infection estimates are not available; for instance, converting wastewater SARS-CoV-2 viral loads to infection prevalence per Table听1听to help examine changes in hospitalization rate and infection-fatality risk. Such wastewater-viral-load- and infection-听based estimates may be more accurate than case-based measures, which are subject to test-seeking biases.
Third, previous studies have measured viral loads in clinical samples from the respiratory tract. Based on the reported cycle threshold (CT) values, the respiratory tract viral load was higher in Delta and Omicron infections than the ancestral variant [30,31,32,33,34,35], consistent with the higher infectiousness of these variants of concern. In contrast, fecal viral shedding is not a main mode of transmission [36, 37], and here using variant circulation time-period as a proxy, we estimate that the fecal viral shedding rate was the highest for the ancestral/Iota variants, followed by Delta (~20% lower), and then Omicron (~50鈥60% lower; Table听1). Early studies of ancestral SARS-CoV-2 infections found that patients with diarrhea shed more viruses than patients without diarrhea (see, e.g., a review in ref [23]), suggesting fecal viral shedding may be associated with diarrhea. In addition, studies found that vaccinations reduced the number of diarrhea episodes [38], and that rates of diarrhea were highest among patients infected with the ancestral SARS-CoV-2, followed by patients infected with Delta and then Omicron [39, 40]. Our estimates are consistent with the fecal viral shedding studies [23, 38,39,40], and support a difference in viral load between SARS-CoV-2 fecal shedding and respiratory tract shedding, in addition to the timing difference noted above.
In addition to characterizing SARS-CoV-2 fecal viral shedding pattern, we are also able to identify certain time-periods with intensified transmission. In NYC, analysis based on calendar timing showed likely intensified transmission during late-November through December (Fig.听3B). Increased transmission also occurred during early August and late January. It is possible that other factors such as travel, holidays, or specific COVID-19 sub-variants could help explain these periods of intensified transmission, but further investigation is needed to determine their impact.
Lastly, we note several limitations. First, given the biweekly sampling dates for wastewater and weekly estimates for infection prevalence, we were unable to test finer time-differences and durations when examining SARS-CoV-2 fecal shedding pattern. Second, the estimates here were based on population data and thus represent an average of all individuals undergoing different disease stages in the population. As such, the estimated fecal shedding duration may be shorter than that reported in studies based on individual patient data (e.g., days or weeks after respiratory tract samples became negative [23]). Third, our infection prevalence estimates have accounted for the main transmission factors, through the information encapsulated in the COVID-19 case, ED visit, and mortality data used for model estimation. Thus, the expected SARS-CoV-2 viral load constructed using these estimates and in turn the identified anomalies are both conservative estimates and may have missed additional anomalies. In addition, wastewater collected from sewersheds may represent individuals who are residents of NYC as well as outside NYC, while infection prevalence estimates are based on NYC residents only.
Conclusion
In summary, we have characterized the fecal viral shedding pattern of SARS-CoV-2 in wastewater in New York City from 2020鈥2023. These estimates can be used to account for changes in underlying infection rate and help more accurately quantify changes in COVID-19 transmission and severity over time. We have also demonstrated that wastewater surveillance data combined with model-inference estimates can support the identification of time-periods that potentially intensify transmission. Additional studies are needed to better understand these periods and the potential to mitigate SARS-CoV-2 transmission.
Data availability
The SARS-CoV-2/COVID-19 cases, emergency department visits, mortality, and wastewater surveillance data were used with permission under a Data Use and Nondisclosure Agreement between the NYC Health Department and Columbia University. The NYC Health Department also has a comprehensive, publicly available data website here: .
Abbreviations
- CI:
-
Confidence interval
- COVID-19:
-
Coronavirus disease 2019
- ED:
-
Emergency department
- NYC:
-
New York City
- RNA:
-
Ribonucleic acid
- RT-dPCR:
-
Reverse transcription digital PCR
- RT-qPCR:
-
Quantitative reverse transcription polymerase chain reaction
- SARS-CoV-2:
-
Severe acute respiratory syndrome coronavirus 2
- SEIRSV model:
-
Susceptible-exposed-infectious-(re)susceptible-vaccination model
References
Reynolds LJ, Gonzalez G, Sala-Comorera L, Martin NA, Byrne A, Fennema S, et al. SARS-CoV-2 variant trends in Ireland: Wastewater-based epidemiology and clinical surveillance. Sci Total Environ. 2022;838(Pt 2):155828. . Epub 2022/05/20. PubMed PMID: 35588817; PMCID: PMC9110007.
Hoar C, Chauvin F, Clare A, McGibbon H, Castro E, Patinella S, et al. Monitoring SARS-CoV-2 in wastewater during New York City鈥檚 second wave of COVID-19: sewershed-level trends and relationships to publicly available clinical testing data. Environ Sci-Wat Res. 2022. .听PubMed PMID: WOS:000769620200001.
Melvin RG, Chaudhry N, Georgewill O, Freese R, Simmons GE. Predictive power of SARS-CoV-2 wastewater surveillance for diverse populations across a large geographical range. medRxiv. 2021.听. Epub 2021/02/04. PubMed PMID: 33532795; PMCID: PMC7852246.
Sanju谩n R, Domingo-Calap P. Reliability of wastewater analysis for monitoring COVID-19 incidence revealed by a long-term follow-up study. Front Virol. 2021;1. .
Schmitz BW, Innes GK, Prasek SM, Betancourt WQ, Stark ER, Foster AR, et al. Enumerating asymptomatic COVID-19 cases and estimating SARS-CoV-2 fecal shedding rates via wastewater-based epidemiology. Sci Total Environ. 2021;801:149794. . Epub 2021/09/02. PubMed PMID: 34467933; PMCID: PMC8378060.
Ungar L. Pandemic gets tougher to track as COVID testing plunges 2022 [updated 5/10/20228/16/2023]. Available from: .
Arielle Mitropoulos, Brownstein J. Decline of testing, sequencing could hinder search for future COVID-19 variants, experts warn 2022 [updated 3/30/2022; cited 2024 1/9/2024]. Available from: .
Keshaviah A, Karmali RN,听Vohra听D, Huffman T, Hu XC, Diamond MB. The Role of Wastewater Data in Pandemic Management. Washington, DC: Mathematica; 2022. Available from: .
Varkila MRJ, Montez-Rath ME, Salomon JA, Yu X, Block GA, Owens DK, et al. Use of wastewater metrics to track COVID-19 in the US. JAMA Netw Open. 2023;6(7):e2325591-e. .
Jarvie MM, Nguyen TNT, Southwell B, Wright D. Leveraging wastewater surveillance to actively monitor Covid-19 community dynamics in rural areas with reduced reliance on clinical testing. Appl Res. 2024;3(5):e202400012. .
Li L, Haak L, Carine M, Pagilla KR. Temporal assessment of SARS-CoV-2 detection in wastewater and its epidemiological implications in COVID-19 case dynamics. Heliyon. 2024;10(8):e29462. . Epub 20240409. PubMed PMID: 38638959; PMCID: PMC11024598.
New York City Department of Health and Mental Hygiene. SARS-CoV-2 concentrations measured in NYC Wastewater. Available from: .
Yang W, Kandula S, Huynh M, Greene SK, Van Wye G, Li W, et al. Estimating the infection-fatality risk of SARS-CoV-2 in New York City during the spring 2020 pandemic wave: a model-based analysis. Lancet Infect Dis. 2021;21(2):203鈥12. .
Yang W, Greene SK, Peterson ER, Li W, Mathes R, Graf L, et al. Epidemiological characteristics of the B.1.526 SARS-CoV-2 variant. Sci Adv. 2022;8(4):eabm0300. .
Li R, Pei S, Chen B, Song Y, Zhang T, Yang W, et al. Substantial undocumented infection facilitates the rapid dissemination of novel coronavirus (SARS-CoV-2). Science. 2020;368(6490):489鈥93. . Epub 2020/03/18. PubMed PMID: 32179701; PMCID: PMC7164387.
Ahmed W, Smith WJM, Metcalfe S, Jackson G, Choi PM, Morrison M, et al. Comparison of RT-qPCR and RT-dPCR platforms for the trace detection of SARS-CoV-2 RNA in wastewater. ACS ES T Water. 2022;2(11):1871鈥80. .
Yang W, Parton H, Li W, Watts EA, Lee E, Yuan H. SARS-CoV-2 dynamics in New York City during March 2020 - August 2023. medRxiv. 2024:2024.07.19.24310728. .
Anderson JL. An ensemble adjustment Kalman filter for data assimilation. Mon Weather Rev. 2001;129(12):2884鈥903.听.
Yang W, Shaff J, Shaman J. Effectiveness of non-pharmaceutical interventions to contain COVID-19: a case study of the 2020 spring pandemic wave in New York City. J R Soc Interface. 2021;18(175):20200822. .
NewYork City Department of Health and Mental Hygiene. NYC UHF 42 Neighborhoods. Available from: .
New York City Department of Health and Mental Hygiene. Variants 2023. Available from: .
Wolter N, Jassat W, Walaza S, Welch R, Moultrie H, Groome M, et al. Early assessment of the clinical severity of the SARS-CoV-2 omicron variant in South Africa: a data linkage study. Lancet. 2022;399(10323):437鈥46. .
Foladori P, Cutrupi F, Segata N, Manara S, Pinto F, Malpei F, et al. SARS-CoV-2 from faeces to wastewater treatment: what do we know? A review. Sci Total Environ. 2020;743:140444. . Epub 2020/07/11. PubMed PMID: 32649988; PMCID: PMC7311891.
Meekins DA, Gaudreault NN, Richt JA. Natural and experimental SARS-CoV-2 infection in domestic and wild animals. Viruses. 2021;13(10).听. Epub 2021/10/27. PubMed PMID: 34696423; PMCID: PMC8540328.
Karthikeyan S, Levy JI, De Hoff P, Humphrey G, Birmingham A, Jepsen K, et al. Wastewater sequencing reveals early cryptic SARS-CoV-2 variant transmission. Nature. 2022;609(7925):101鈥8. . Epub 2022/07/08. PubMed PMID: 35798029; PMCID: PMC9433318.
Hegazy N, Cowan A, D'Aoust PM, Mercier E, Towhid ST, Jia JJ, et al. Understanding the dynamic relation between wastewater SARS-CoV-2 signal and clinical metrics throughout the pandemic. Sci Total Environ. 2022;853. . PubMed PMID: WOS:000865439400008.
Hopkins L, Persse D, Caton K, Ensor K, Schneider R, McCall C, et al. Citywide wastewater SARS-CoV-2 levels strongly correlated with multiple disease surveillance indicators and outcomes over three COVID-19 waves. Sci Total Environ. 2023;855:158967. . Epub 2022/09/27. PubMed PMID: 36162580; PMCID: PMC9507781.
Natarajan A, Zlitni S, Brooks EF, Vance SE, Dahlen A, Hedlin H, et al. Gastrointestinal symptoms and fecal shedding of SARS-CoV-2 RNA suggest prolonged gastrointestinal infection. Med. 2022;3(6):371-87. e9.
Nourbakhsh S, Fazil A, Li M, Mangat CS, Peterson SW, Daigle J, et al. A wastewater-based epidemic model for SARS-CoV-2 with application to three Canadian cities. Epidemics-Neth. 2022;39. . PubMed PMID: WOS:000793569400003.
Teyssou E, Delagreverie H, Visseaux B, Lambert-Niclot S, Brichler S, Ferre V, et al. The Delta SARS-CoV-2 variant has a higher viral load than the Beta and the historical variants in nasopharyngeal samples from newly diagnosed COVID-19 patients. J Infect. 2021;83(4):E1鈥3. . PubMed PMID: WOS:000698619300001.
Li B, Deng A, Li K, Hu Y, Li Z, Xiong Q, et al. Viral infection and transmission in a large, well-traced outbreak caused by the SARS-CoV-2 Delta variant. medRxiv. 2021:2021.07.07.21260122. .
Yang Y, Guo L, Yuan J, Xu Z, Gu Y, Zhang J, et al. Viral and antibody dynamics of acute infection with SARS-CoV-2 omicron variant (B.1.1.529): a prospective cohort study from Shenzhen, China. Lancet Microbe. 2023;4(8):e632鈥41. . Epub 2023/07/18. PubMed PMID: 37459867.
Woodbridge Y, Amit S, Huppert A, Kopelman NM. Viral load dynamics of SARS-CoV-2 Delta and Omicron variants following multiple vaccine doses and previous infection. Nat Commun. 2022;13(1):6706. . Epub 2022/11/08. PubMed PMID: 36344489; PMCID: PMC9640564.
Migueres M, Dimeglio C, Mansuy JM, Abravanel F, Raymond S, Latour J, et al. Influence of Nasopharyngeal viral load on the spread of the omicron BA.2 variant. Clin Infect Dis. 2023;76(3):e514鈥7. . Epub 2022/07/08. PubMed PMID: 35796540.
Imai K, Ikeno R, Tanaka H, Takada N. SARS-CoV-2 omicron variant in human saliva samples in cell-free form. JAMA Netw Open. 2023;6(1):e2250207. . Epub 2023/01/10. PubMed PMID: 36622677; PMCID: PMC9857355.
Wolfel R, Corman VM, Guggemos W, Seilmaier M, Zange S, Muller MA, et al. Virological assessment of hospitalized patients with COVID-2019. Nature. 2020;581(7809):465鈥9. . Epub 2020/04/03. PubMed PMID: 32235945.
Albert S, Ruiz A, Peman J, Salavert M, Domingo-Calap P. Lack of evidence for infectious SARS-CoV-2 in feces and sewage. Eur J Clin Microbiol Infect Dis. 2021;40(12):2665鈥7. . Epub 2021/07/10. PubMed PMID: 34240259; PMCID: PMC8266481.
Boulware DR, Murray TA, Proper JL, Tignanelli CJ, Buse JB, Liebovitz DM, et al. Impact of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) vaccination and booster on coronavirus disease 2019 (COVID-19) symptom severity over time in the COVID-OUT trial. Clin Infect Dis. 2022;76(3):e1鈥9. .
Pena Rodriguez M, Hernandez Bello J, Vega Magana N, Viera Segura O, Garcia Chagollan M, Ceja Galvez HR, et al. Prevalence of symptoms, comorbidities, and reinfections in individuals infected with wild-type SARS-CoV-2, Delta, or Omicron variants: a comparative study in western Mexico. Front Public Health. 2023;11:1149795. . Epub 2023/05/14. PubMed PMID: 37181688; PMCID: PMC10174068.
Torabi SH, Riahi SM, Ebrahimzadeh A, Salmani F. Changes in symptoms and characteristics of COVID-19 patients across different variants: two years study using neural network analysis. 樱花视频 Infect Dis. 2023;23(1):838. . Epub 2023/11/29. PubMed PMID: 38017395; PMCID: PMC10683353.
Acknowledgements
This study was supported by the National Institute of Allergy and Infectious Diseases (AI175747 and AI145883) and Centers for Disease Control and Prevention (CDC) and the Council of State and Territorial Epidemiologists (CSTE; contract no.: NU38OT00297). The authors thank Lauren Firestein for overseeing the data use agreement and facilitating data sharing for this project; Ramona Lall for providing syndromic surveillance emergency department data; Wenhui Li for providing COVID-19-associated mortality data; Iris Cheng for providing immunization data; Jubayer Ahmed, Nelson De La Cruz, and Brandon Nguyen for managing and providing wastewater data; the NYC DOHMH Respiratory Pathogens data team for overarching data management and provision of data for this project; and Shama Ahuja, Sharon Greene, Scott Harper, Elizabeth Luoma, Ulrike Siemetzki-Kapoor, Celia Quinn, and Faten Taki for their input on this manuscript.
Funding
This study was supported by the National Institute of Allergy and Infectious Diseases (AI175747 and AI145883) and Centers for Disease Control and Prevention (CDC) and the Council of State and Territorial Epidemiologists (CSTE; contract no.: NU38OT00297).
Author information
Authors and Affiliations
Contributions
WY designed the study, performed the analysis, and wrote the first draft; EO, AO, and EAW oversaw provision of the SARS-CoV-2 wastewater surveillance data; HP and EL oversaw provision of the COVID-19 case and emergency department visit data. All authors contributed to the final draft.
Corresponding author
Ethics declarations
Ethics approval and consent to participate
This activity was classified as public health surveillance and exempt from ethical review and informed consent by the Institutional Review Boards of both Columbia University and NYC Health Department.
Consent for publication
Not applicable.
Competing interests
The authors declare no competing interests.
Additional information
Publisher鈥檚 Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article鈥檚 Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article鈥檚 Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit .
About this article
Cite this article
Yang, W., Omoregie, E., Olsen, A. et al. The use of wastewater surveillance to estimate SARS-CoV-2 fecal viral shedding pattern and identify time periods with intensified transmission. 樱花视频 25, 1108 (2025). https://doi.org/10.1186/s12889-025-22306-1
Received:
Accepted:
Published:
DOI: https://doi.org/10.1186/s12889-025-22306-1