Combining analysis of individual and wastewater whole genome sequencing improves SARS-CoV-2 surveillance

Evan P. Troendle*, Andrew J. Lee, Marina I. Reyne, Danielle M. Allen, Stephen J. Bridgett, Clara H. Radulescu, Michael Glenn, John-Paul Wilkins, Francesco Rubino, Behnam Firoozi Nejad, Cormac McSparron, Marc Niebel, Derek J. Fairley, Kate E. Binley, Christopher J. Creevey, Jennifer M. McKinley, Timofey Skvortsov, Deirdre F. Gilpin, John W. McGrath, Connor G. G. Bamford*David A. Simpson*

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

1 Downloads (Pure)

Abstract

Effective pathogen surveillance is critical for public health decision-making, with both individual and environmental monitoring playing essential roles. While wastewater (WW) and individual whole genome sequencing (WGS) have been used to monitor SARS-CoV-2 dynamics, their complementary potential for enhancing national-level genomic surveillance remains underexplored. This study aimed to evaluate the unique and combined contributions of WW and individual WGS to genomic surveillance. We conducted SARS-CoV-2 WGS on over 4000 WW samples and 23,000 individual samples across Northern Ireland (NI) between 2021 and 2023. SARS-CoV-2 RNA was amplified using the ARTIC nCov-2019 and Mini-XT protocols and sequenced on Illumina MiSeq. Variant compositions in WW data were analysed using Freyja and compared to individual data using time series analysis, correlation assessments, and volatility measurements via numerical derivatives, with mean absolute error (MAE) calculations used to assess concordance. Wastewater treatment plants (WWTPs) were ranked by concordance to individual WGS data. WW and individual WGS complementarity was quantified by mutation classification and overlap analysis. Temporal curve shifting was used to identify lags or leads in variant detection and to infer differences in geospatial spread between WW and individual sequencing data. We confirmed strong concordance between WW and individual variant compositions (mean MAE = 6.2 %). MAE was inversely correlated with sequencing rate (Pearson r=−0.37, p < 0.001) and increased during periods with more circulating variants, highlighting the value of increased sequencing efforts during volatile periods. The population size served by a WWTP was not a reliable indicator of how well its variant composition matched that of the national individual sequencing programme. Both individual and WW-based sequencing (WBS) detected unique, as well as common mutations. Patterns of variant spread within NI were consistent between both programmes (Pearson r = 0.63, p = 0.036), providing complementary insights into variant trends and geospatial spread. We demonstrate that integration of individual and WW WGS data offers more comprehensive SARS-CoV-2 genomic surveillance and improves confidence in predictions of variant composition and spread.

Original languageEnglish
Article number123953
Number of pages14
JournalWater Research
Volume284
Early online date13 Jun 2025
DOIs
Publication statusEarly online date - 13 Jun 2025

Bibliographical note

Publisher Copyright:
© 2025 The Author(s)

Keywords

  • Bioinformatics
  • Genomic epidemiology for public health
  • Integrated surveillance systems
  • Sars-cov-2 variant detection
  • Wastewater sequencing
  • Whole genome sequencing (wgs)

ASJC Scopus subject areas

  • Environmental Engineering
  • Civil and Structural Engineering
  • Ecological Modelling
  • Water Science and Technology
  • Waste Management and Disposal
  • Pollution

Fingerprint

Dive into the research topics of 'Combining analysis of individual and wastewater whole genome sequencing improves SARS-CoV-2 surveillance'. Together they form a unique fingerprint.

Cite this