This article explores the transformative role of genomic sciences within the One Health framework, which recognizes the interconnectedness of human, animal, and environmental health.
This article explores the transformative role of genomic sciences within the One Health framework, which recognizes the interconnectedness of human, animal, and environmental health. Aimed at researchers, scientists, and drug development professionals, it covers foundational concepts, methodological applications, and optimization strategies for genomic surveillance and analysis. The content delves into how cross-species genomic data is revolutionizing pandemic preparedness, antimicrobial resistance (AMR) tracking, and the management of zoonotic diseases. It also addresses critical challenges in data integration and bioinformatics, evaluates the comparative effectiveness of One Health genomics against traditional siloed approaches, and discusses future directions for implementing these technologies in biomedical research and clinical practice.
The One Health framework represents a transformative approach to public health that recognizes the inextricable linkages between human, animal, and environmental health. This whitepaper examines the critical imperative of integrating One Health principles with modern genomic sciences to address complex global health challenges. Through advanced genomic technologies including high-throughput sequencing, bioinformatic analyses, and real-time surveillance, researchers can now decode complex biological interactions across species and ecosystems with unprecedented precision. This technical guide explores methodological frameworks, experimental protocols, and practical applications of genomics within the One Health paradigm, providing researchers and drug development professionals with actionable strategies for implementing integrated health solutions.
One Health is defined as "an integrated, unifying approach that aims to sustainably balance and optimize the health of people, animals, and ecosystems" [1] [2]. This approach recognizes that the health of humans, domestic and wild animals, plants, and the wider environment are closely linked and interdependent [1]. The framework mobilizes multiple sectors, disciplines, and communities at varying levels of society to work together to foster well-being and tackle threats to health and ecosystems while addressing collective needs for healthy food, water, energy, and air [2].
The conceptual foundation of One Health rests on several key principles [3] [2]:
While the interconnectedness of human, animal, and environmental health has been recognized for centuries, the formalized One Health concept gained significant traction in the early 2000s in response to emerging zoonotic diseases [4]. The Manhattan Principles established in 2004 by the Wildlife Conservation Society represented a pivotal milestone, explicitly recognizing the critical links between human and animal health and the threats diseases pose to food supplies and economies [3] [4]. This was subsequently refined through the Berlin Principles, which expanded the conceptual framework [3].
The SARS outbreak in 2003 and the subsequent spread of highly pathogenic avian influenza H5N1 demonstrated the necessity of collaborative, cross-disciplinary approaches to emerging infectious diseases [4]. These events catalyzed international cooperation and institutional recognition of One Health, leading to the formation of the One Health High-Level Expert Panel (OHHLEP) in 2020 by the Quadripartite organizations: the Food and Agriculture Organization (FAO), the World Organisation for Animal Health (WOAH), the World Health Organization (WHO), and the United Nations Environment Programme (UNEP) [3] [2].
The COVID-19 pandemic has further underscored the urgent need for strengthened One Health approaches, with greater emphasis on connections to the environment and promoting healthy recovery [1]. The pandemic revealed how deeply human health is intertwined with animal health and ecosystem integrity, highlighting the necessity of genomic tools for understanding pathogen emergence and transmission dynamics [5] [6].
Modern genomic technologies have revolutionized our ability to operationalize the One Health framework across human, animal, and environmental domains. These technologies provide powerful tools for understanding pathogen evolution, transmission dynamics, and host-pathogen interactions at unprecedented resolution [3].
Table 1: Genomic Technologies in One Health Applications
| Technology | Key Features | One Health Applications | References |
|---|---|---|---|
| Nanopore Sequencing | Portable, real-time sequencing, long reads, direct RNA sequencing | Field-based pathogen detection, outbreak surveillance, antimicrobial resistance monitoring | [5] |
| Whole Genome Sequencing (WGS) | Comprehensive genomic data, high resolution | Pathogen characterization, transmission tracking, antimicrobial resistance research | [6] |
| Metagenomic Next-Generation Sequencing (mNGS) | Culture-independent, unbiased pathogen detection | Novel pathogen discovery, microbiome studies, environmental surveillance | [7] |
| Bioinformatic Pipelines | Data integration, computational analysis, visualization | Genomic epidemiology, phylogenetic analysis, predictive modeling | [6] [3] |
Nanopore sequencing represents a particularly transformative technology for One Health applications due to its portability and real-time capabilities [5]. This technology enables genomic analyses in field settings and local laboratories, making genomic surveillance accessible in resource-limited environments common in tropical regions where many emerging zoonotic diseases originate [5] [7]. The MinION device, for example, has been deployed worldwide to break down barriers and improve the accessibility and versatility of genomic sequencing [5].
The declining costs of DNA sequencing have further accelerated the adoption of genomic technologies in One Health contexts. However, this has created new challenges in data management, with annual acquisition of raw genomic data worldwide expected to exceed one zettabyte (one trillion GB) by 2025 [6]. This massive data generation necessitates robust bioinformatic infrastructure and scalable computational workflows to transform raw sequence data into actionable insights [6].
Implementing genomic technologies within a One Health framework requires systematic approaches that coordinate activities across human, animal, and environmental sectors. The Generalizable One Health Framework (GOHF) provides a structured five-step process for developing capacity to coordinate zoonotic disease programming across sectors [8]:
Figure 1: Generalizable One Health Framework (GOHF) for implementing genomic surveillance systems
Step 1: Engagement involves establishing One Health interest by identifying and engaging stakeholders across relevant sectors. This includes prioritizing zoonotic diseases through formalized processes like the One Health Zoonotic Disease Prioritization (OHZDP) and establishing sustained government support [8].
Step 2: Assessment focuses on mapping existing infrastructure and establishing baseline information on current activities, disease burden, and epidemiologic situations. Infrastructure mapping visualizes mechanisms of communication, collaboration, and coordination between sectors [8].
Step 3: Planning develops strategic roadmaps that define specific objectives, interventions, and resource requirements for genomic One Health implementation. This includes developing integrated surveillance plans that incorporate genomic data from human, animal, and environmental sources [8].
Step 4: Implementation executes the planned activities through coordinated action across sectors. This includes establishing laboratory networks, data sharing platforms, and joint response protocols that leverage genomic technologies [8].
Step 5: Evaluation assesses the effectiveness of implemented activities and systems, using monitoring data to refine approaches and improve outcomes in an iterative process [8].
Implementing genomic surveillance within a One Health framework requires standardized methodologies that enable comparable data generation across human, animal, and environmental samples. The following protocols outline key approaches for integrated genomic surveillance:
Protocol 1: Cross-Species Pathogen Genomic Surveillance
Protocol 2: Metagenomic Monitoring of Antimicrobial Resistance
Table 2: Essential Research Reagents for One Health Genomic Applications
| Reagent Category | Specific Products | Application in One Health Genomics | Technical Considerations |
|---|---|---|---|
| Nucleic Acid Extraction Kits | QIAamp DNA/RNA Mini Kit, DNeasy PowerSoil Pro Kit | Efficient extraction from diverse sample matrices (clinical, environmental, veterinary) | Optimize protocols for different sample types; include inhibition removal steps |
| Library Preparation Kits | Nextera XT DNA Library Prep, Ligation Sequencing Kit (Nanopore) | Preparation of sequencing libraries from minimal input DNA | Select kits based on sequencing platform; consider multiplexing requirements |
| Portable Sequencing Devices | MinION, SmidgION (Oxford Nanopore) | Real-time genomic surveillance in field settings | Leverage portability for deployment in remote locations; optimize power supply |
| Target Enrichment Systems | Twist Target Enrichment, SureSelect XT HS | Focused sequencing of pathogen genomes from complex samples | Design panels to cover priority pathogens; optimize for cross-species detection |
| Bioinformatic Tools | CLC Genomics Workbench, Galaxy Platform, BV-BRC | Integrated analysis of genomic data from multiple sectors | Ensure compatibility across data types; implement reproducible workflows |
The integration of genomic data within a One Health context requires sophisticated bioinformatic frameworks capable of processing and analyzing heterogeneous datasets from multiple sources. Effective implementation relies on computational infrastructures that support data integration, analysis, and interpretation across disciplinary boundaries [6] [3].
Automated analysis pipelines such as the Automatic Bacterial Isolate Assembly, Annotation and Analyses Pipeline (ASA3P) provide standardized approaches for processing genomic data from diverse sources [6]. These pipelines enable reproducible analyses that facilitate comparisons across human, animal, and environmental isolates, supporting integrated surveillance and outbreak investigations.
Cloud computing infrastructures have become essential for managing the computational demands of One Health genomics. By sharing computational resources across multiple users and institutions, cloud platforms enable scalable analyses while minimizing costs associated with maintaining local computational infrastructure [6]. This approach is particularly valuable for resource-limited settings where establishing local high-performance computing capacity may be challenging.
Integrating genomic data with epidemiological, environmental, and clinical information creates a powerful foundation for One Health analytics. The relationships between these data types can be visualized through the following framework:
Figure 2: Data integration framework for One Health genomics
Key data types integrated in this framework include:
The integration of these diverse data streams enables the development of predictive models for disease emergence and spread, facilitates molecular epidemiology to track transmission pathways across species boundaries, and supports risk assessment for targeted interventions [3] [8].
Genomic technologies have transformed our approach to zoonotic disease surveillance and outbreak response within the One Health framework. Several key applications demonstrate the power of integrated genomic surveillance:
Emerging Infectious Disease Detection: The real-time genomic capabilities of nanopore sequencing have enabled rapid detection and characterization of emerging pathogens at the human-animal-environment interface [5] [7]. During the COVID-19 pandemic, genomic surveillance demonstrated how SARS-CoV-2 variants emerged and spread between human and animal populations, informing targeted control measures [5] [6].
Outbreak Investigation: Whole genome sequencing provides high-resolution data for tracking transmission pathways during zoonotic disease outbreaks. Genomic epidemiology has been successfully applied to investigate outbreaks of diseases including Ebola virus, MERS-CoV, and zoonotic influenza, revealing patterns of both human-to-human and animal-to-human transmission [6] [8]. The application of genomic data during the 2013-2016 Ebola outbreak in West Africa, for example, enabled researchers to distinguish between sustained human-to-human transmission and repeated zoonotic introductions [6].
Table 3: Genomic Applications in Zoonotic Disease Management
| Disease Category | Genomic Application | One Health Impact | References |
|---|---|---|---|
| Zoonotic Influenza | Whole genome sequencing to track reassortment events and host adaptations | Informed vaccine strain selection and animal control measures | [6] [8] |
| Antimicrobial Resistance | Metagenomic sequencing to track resistance genes across reservoirs | Identified transmission pathways between clinical, agricultural, and environmental settings | [6] [7] |
| Foodborne Pathogens | Genomicæº¯æº of Salmonella, E. coli, and Campylobacter along food production chains | Enabled targeted interventions at critical control points from farm to consumer | [9] [8] |
| Vector-Borne Diseases | Pathogen genomics combined with vector and host genetics | Revealed complex transmission cycles involving multiple host species | [7] |
The spread of antimicrobial resistance represents a quintessential One Health challenge that benefits significantly from genomic approaches. Resistance genes can move between bacteria inhabiting humans, animals, and the environment, requiring integrated surveillance across all reservoirs [6] [4].
Genomic studies have revealed that 73% of the global antimicrobial consumption occurs in livestock production, contributing significantly to the emergence and dissemination of resistance genes [6]. The use of genomics has enabled researchers to track the flow of specific resistance mechanisms between agricultural settings, the environment, and clinical environments, informing interventions to slow the spread of resistance [6].
The comprehensive nature of genomic data provides distinct advantages for AMR surveillance compared to traditional phenotypic methods. Whole genome sequencing enables detection of known resistance determinants alongside emerging mechanisms, supports analysis of mobile genetic elements that facilitate resistance gene transfer, and allows for tracking of resistant clones across sectors and geographic boundaries [6].
Despite the demonstrated value of genomic technologies in One Health applications, significant implementation challenges remain:
Resource Limitations: The infrastructure required for genomic sequencingâincluding reliable electricity, cold chain storage, and computational resourcesâmay be unavailable in many resource-limited settings where zoonotic disease threats are most prominent [7]. Portable sequencing technologies such as the MinION have helped address some of these barriers, but challenges remain in establishing sustainable sequencing capacity [5] [7].
Bioinformatic Capacity: The analysis of genomic data requires specialized computational skills that may not be available across all sectors involved in One Health implementation [6] [7]. Building bioinformatic capacity through training programs and developing user-friendly analytical platforms are essential for expanding access to genomic technologies [6].
Data Integration Challenges: Combining genomic data with epidemiological, clinical, and environmental information presents technical challenges related to data standardization, interoperability, and visualization [3]. Developing common data standards and interoperable platforms is critical for maximizing the utility of genomic data in One Health decision-making [3] [8].
The application of genomic technologies within One Health raises important ethical considerations that must be addressed:
Equitable Access: There is significant disparity in genomic sequencing capacity between high-income and low-to-middle-income countries, creating an imbalance in who benefits from genomic research [3] [7]. This inequity is particularly problematic since many emerging zoonotic diseases originate in tropical regions where sequencing capacity may be most limited [7].
Data Sovereignty: Genomic research involving biological samples from low and middle-income countries has historically often extracted resources and data without ensuring equitable sharing of benefits [5] [7]. Principles such as the CARE (Collective Benefit, Authority to Control, Responsibility, Ethics) principles for Indigenous Data Governance and the Nagoya Protocol on Access and Benefit Sharing provide frameworks for ensuring that communities and countries maintain authority over their genetic resources and associated data [5].
Digital Colonialism: The dominance of high-income countries in genomic research can lead to inequities where data are extracted from low and middle-income countries but analyzed and utilized elsewhere, limiting local capacity building and benefit [3] [7]. Building local sequencing and bioinformatic capacity through initiatives like the Africa BioGenome Project represents an important step toward addressing these imbalances [7].
The future of One Health genomics will be shaped by several technological and methodological developments:
Real-Time Genomic Surveillance: The increasing portability and decreasing cost of sequencing technologies will enable more widespread implementation of real-time genomic surveillance at the human-animal-environment interface [5]. The integration of automated sample preparation with portable sequencers will further enhance capabilities for rapid response to emerging threats.
Single-Cell Genomics: Advances in single-cell sequencing will enable more precise characterization of microbial communities and host-pathogen interactions across different reservoirs, providing unprecedented resolution for understanding transmission dynamics [5].
AI-Enhanced Analytics: Artificial intelligence and machine learning approaches will increasingly support the integration of genomic data with other data types, enabling more sophisticated predictive models for disease emergence and spread [10]. These approaches will help identify subtle patterns across complex datasets that may not be apparent through conventional analytical methods.
Pan-Genome Representations: Moving beyond single reference genomes to pan-genome representations that capture the full diversity of pathogens and hosts will enhance our ability to track transmissions and understand adaptive evolution [5]. This approach is particularly valuable for pathogens with significant genomic diversity or those that rapidly evolve in response to selective pressures.
The integration of modern genomic technologies within the One Health framework represents a powerful paradigm for addressing complex health challenges at the human-animal-environment interface. Genomic tools provide unprecedented capabilities for tracking pathogen transmission, understanding antimicrobial resistance dissemination, and detecting emerging threats across species boundaries.
The effective implementation of One Health genomics requires collaborative frameworks that coordinate activities across human health, animal health, and environmental sectors [8]. Methodological standards for genomic surveillance, data sharing, and bioinformatic analysis must be established to enable comparable data generation and interpretation across disciplines [6] [3].
As genomic technologies continue to evolve, their application within One Health approaches will become increasingly essential for global health security. By fostering interdisciplinary collaboration and building capacity across sectors and geographic regions, the scientific community can harness the power of genomics to promote health for people, animals, and ecosystems in an increasingly interconnected world.
The integration of genomic technologies into public health surveillance represents a paradigm shift in how we detect, monitor, and respond to health threats. Framed within the broader One Health context that recognizes the interconnectedness of human, animal, and environmental health, genomic surveillance provides unprecedented insights into pathogen evolution, transmission dynamics, and host-pathogen interactions across species and ecosystems [11]. This technical guide examines core applications, methodologies, and implementation frameworks for leveraging genomics as a cross-cutting tool in integrated health surveillance systems, with particular relevance for researchers, scientists, and drug development professionals working at the intersection of human, animal, and environmental health.
The One Health concept underscores that health outcomes across human, animal, and environmental domains are intrinsically linked, necessitating integrated, transdisciplinary approaches to address contemporary health challenges [11]. Genomic technologies serve as a foundational tool within this framework by enabling precise characterization of pathogens and their movements across interfaces. The dramatic reduction in sequencing costsâfrom approximately $10 million per megabase of DNA sequence in 2001 to less than one cent todayâhas made these tools increasingly accessible and scalable for routine public health applications [12] [13].
Next-generation sequencing (NGS) and whole-genome sequencing (WGS) provide higher resolution than traditional subtyping technologies, enabling more accurate cluster detection, transmission tracing, and phenotypic characterization of pathogens [13]. When genomic data are integrated with epidemiological, clinical, and environmental information through integrated genomic surveillance (IGS) systems, public health agencies can achieve earlier detection of outbreaks, better understanding of transmission pathways, and more targeted interventions [14]. This approach is particularly valuable for addressing zoonotic diseases, which account for up to 75% of new or emerging infectious diseases and require coordinated investigation across human and animal populations [15].
Genomic surveillance delivers actionable insights across diverse public health domains, from foodborne illness outbreaks to antimicrobial resistance monitoring. The table below summarizes key application areas with specific examples and documented impacts.
Table 1: Applications and Documented Impact of Genomic Surveillance in Public Health
| Application Area | Specific Use Cases | Key Impact Metrics | References |
|---|---|---|---|
| Outbreak Detection & Investigation | Foodborne pathogens (Listeria, Salmonella), COVID-19 variant tracking | Listeria: Increased detected clusters from 14 to 21 annually; median cases per cluster decreased from 6 to 3 | [13] |
| Antimicrobial Resistance (AMR) Surveillance | Carbapenemase-producing organisms (CPOs), Mycobacterium tuberculosis | WGS enables comprehensive AMR gene detection vs. traditional PCR; identifies resistance mechanisms | [16] [17] [13] |
| Zoonotic Disease Monitoring | Pathogen discovery in reservoir hosts, interspecies transmission tracking | 75% of new/emerging infectious diseases have zoonotic origins | [15] |
| Vaccine & Therapeutic Development | Seasonal influenza monitoring, SARS-CoV-2 variant characterization | Informs diagnostic tests, treatments, and vaccine updates | [13] |
Genomic surveillance has transformed public health response to foodborne illnesses. Prior to implementation, many outbreaks went undetected or were identified only after extensive spread. The integration of WGS into foodborne disease surveillance has enabled more rapid and precise intervention, with the number of Listeria outbreaks resolved through identification of contaminated food sources increasing from one to nine per year following implementation [13]. Similarly, genomic surveillance of carbapenemase-producing organisms in Washington State demonstrated high congruence between genomic and epidemiologically defined clusters, with genomic data helping to refine linkage hypotheses and address gaps in traditional surveillance [16] [17].
For antimicrobial resistance, WGS provides superior resolution for cluster investigations compared to traditional methods such as multilocus sequence typing (MLST) [16] [17]. The technology enables detection of resistance genes and differentiation between related and unrelated cases, informing infection control measures and antimicrobial stewardship programs [16]. Beyond human health, genomic applications extend to environmental and agricultural domains within the One Health framework, including wastewater surveillance for community-level pathogen monitoring and genomic breeding programs for improving food security in tropical regions [15] [7].
The following diagram illustrates the core workflow for integrated genomic surveillance, from sample collection to public health action:
Based on the Washington State Department of Health's successful implementation for antimicrobial resistance surveillance, the following protocol provides a robust framework for bacterial pathogen genomic characterization [16] [17]:
Sample Preparation and DNA Extraction:
Library Preparation and Sequencing:
Quality Control Thresholds: The following table outlines essential QC parameters and thresholds for ensuring high-quality genomic data:
Table 2: Quality Control Parameters for Genomic Sequencing
| Parameter | Threshold | Remedial Action |
|---|---|---|
| Average Read Depth | â¥40à | Repeat sequencing |
| Genome Size | â¥1 Mb | Investigate extraction issues |
| Assembly Scaffolds | <500 | Optimize library preparation |
| Assembly Ratio SD | â¤2.58 | Check for contamination |
The bioinformatics workflow for genomic surveillance involves multiple steps to transform raw sequencing data into actionable insights:
Implementation with CDC PHoeNIx Pipeline:
Cluster Definition Criteria:
Effective genomic surveillance requires robust technical infrastructure to handle the substantial computational and data storage demands of sequencing data. Specialized file formats have been developed to optimize analysis speed and storage efficiency for quantitative genomic data.
The D4 (dense depth data dump) format represents an innovation specifically designed for sequencing depth data, balancing improved analysis speeds with efficient file size [18]. The format uses an adaptive encoding scheme that profiles a random sample of aligned sequence depth to determine an optimal encoding strategy, taking advantage of the observation that depth values often have low variability in many genomics assays.
Table 3: Technical Solutions for Genomic Data Management
| Component | Solution Options | Key Features |
|---|---|---|
| Data Format | D4 Format, bigWig, bedGraph | D4 offers adaptive encoding, fast random access, better compression for depth data [18] |
| Analysis Platform | CDC PHoeNIx, BigBacter, Illumina DRAGEN | Standardized pipelines, recombination-aware analysis, integration with public health systems [16] |
| Data Integration | Custom R scripts, Nextstrain Auspice | Combines genomic and epidemiological data for visualization and interpretation [16] |
| Cloud Infrastructure | GVS (Genomic Variant Store) | Enables joint calling across large datasets (>245,000 genomes in All of Us program) [19] |
For large-scale genomic initiatives such as the All of Us Research Program, which has released 245,388 clinical-grade genome sequences, specialized cloud-based variant storage solutions like the Genomic Variant Store (GVS) have been developed to enable efficient querying and analysis of massive genomic datasets [19]. These infrastructures are essential for supporting the joint calling and variant discovery processes required for population-scale genomic surveillance.
Successful implementation of genomic surveillance requires careful consideration of technical requirements, workforce development, and ethical frameworks. The following table provides essential components for establishing genomic surveillance capabilities:
Table 4: Research Reagent Solutions for Genomic Surveillance
| Category | Specific Products/Technologies | Application Notes |
|---|---|---|
| Sequencing Platforms | Illumina MiSeq, NovaSeq 6000, Oxford Nanopore | MiSeq suitable for smaller batches; NovaSeq for high-throughput; Nanopore for portability [16] [7] |
| Library Prep Kits | Illumina DNA Prep with Nextera DNA CD indexes, Illumina Microbial Amplicon Prep | Target enrichment for specific pathogens; hybrid capture for zoonotic pathogen monitoring [15] [16] |
| Analysis Tools | CDC PHoeNIx, PopPUNK, Gubbins, IQTREE2 | Open-source pipelines optimized for public health pathogen characterization [16] [17] |
| Data Visualization | Nextstrain Auspice, R with custom scripts | Phylogenetic trees integrated with epidemiological data for outbreak investigation [16] |
| Mat2A-IN-21 | Mat2A-IN-21, MF:C26H20F2N4O2, MW:458.5 g/mol | Chemical Reagent |
| Fuziline | Fuziline, MF:C24H39NO7, MW:453.6 g/mol | Chemical Reagent |
Building sustainable genomic surveillance capacity requires investments beyond sequencing equipment alone [12] [13]:
Despite its transformative potential, genomic surveillance implementation faces several challenges [12] [7]:
The Public Health Bioinformatics Fellowship programs and Pathogen Genomics Centers of Excellence (PGCoEs) represent promising models for addressing these challenges through shared resources, training, and standardized approaches [12].
Genomic technologies have evolved from specialized research tools to essential components of integrated health surveillance systems within the One Health framework. The precise characterization of pathogens and their transmission pathways across human, animal, and environmental interfaces enables more targeted and effective public health interventions. As demonstrated by successful implementations in food safety, antimicrobial resistance monitoring, and pandemic response, genomic surveillance provides the resolution necessary to detect outbreaks earlier, trace transmission sources more accurately, and understand pathogen evolution more completely.
Future advancement will require continued investment in cross-disciplinary workforce development, interoperable data systems, and ethical frameworks that facilitate responsible data sharing. The integration of metagenomic approaches for unbiased pathogen detection, combined with portable sequencing technologies, promises to further transform public health capabilitiesâparticularly in resource-limited tropical regions where health challenges are most acute [7]. For researchers, scientists, and drug development professionals, genomic surveillance offers not just a reactive tool for outbreak response, but a proactive foundation for understanding disease ecology and developing more effective countermeasures against health threats spanning the One Health spectrum.
The One Health framework represents a transformative approach to managing global health challenges, recognizing the profound interconnections between human, animal, plant, and environmental health. In an era characterized by globalization, climate change, and increasing antimicrobial resistance, genomic sciences have emerged as a critical tool for understanding these complex interactions. This whitepaper examines key global initiatives and policy drivers that are advancing the integration of genomics within the One Health paradigm, from high-level United Nations deliberations to groundbreaking continental projects like the African BioGenome Project. The strategic application of genomic technologies across these sectors enables precise pathogen identification, real-time outbreak tracking, and insights into host-pathogen co-evolution, ultimately strengthening global health security and sustainable development [20]. These initiatives collectively address the urgent need for coordinated, cross-sectoral approaches to mitigate biological threats through advanced genomic surveillance and research.
The United Nations General Assembly (UNGA) has established critical policy frameworks that directly and indirectly advance the One Health genomic sciences agenda. The Fourth UN High-level Meeting on the prevention and control of noncommunicable diseases (NCDs) and the promotion of mental health and wellbeing (HLM4) in September 2025 resulted in the political declaration "Equity and integration: transforming lives and livelihoods through leadership and action on noncommunicable diseases and the promotion of mental health" [21]. This declaration, while primarily focused on human health, creates important linkages to broader One Health objectives by emphasizing whole-of-government and whole-of-society collaboration to address underlying social, economic, commercial, and environmental drivers of health risks [21].
Concurrently, the UNGA79 Science Summit emphasized leveraging interdisciplinary solutions to bridge health divides, with mental health, digital healthcare, and One Health frameworks emerging as focal areas. Discussions highlighted the transformative power of integrating science, technology, and innovation (STI) across sectors to address systemic barriers including inequitable resource distribution and the impacts of climate change on health and agriculture [22]. These high-level policy dialogues create an enabling environment for genomic research that transcends traditional sectoral boundaries and addresses health challenges at the human-animal-environment interface.
Beyond specific declarations, ongoing UN-based processes provide critical platforms for advancing the One Health genomic agenda. The Quadripartite Collaboration between the Food and Agriculture Organization (FAO), World Health Organization (WHO), World Organisation for Animal Health (WOAH), and United Nations Environment Program (UNEP) has developed a One Health-driven joint plan of action to integrate systems and capacity for antimicrobial resistance (AMR) surveillance, governance framework, and cross-sectional interventions worldwide [23]. This collaboration recognizes that timely integrated data on AMR and antimicrobial use across human, animal, and environmental sectors is critical to curbing AMR's devastating impact, which is projected to cause 10 million deaths annually by 2050 if left unaddressed [23].
Table 1: Key Policy Frameworks Supporting One Health Genomics
| Policy Framework | Lead Organizations | Primary Focus Areas | Relevance to Genomics |
|---|---|---|---|
| UNGA HLM4 Political Declaration (2025) | UN Member States | NCDs, mental health, health equity | Creates enabling policy environment for cross-sectoral research |
| Quadripartite One Health Joint Plan of Action | FAO, WHO, WOAH, UNEP | Antimicrobial resistance, zoonotic diseases | Promotes integrated genomic surveillance systems |
| European One Health AMR Partnership | 53 organizations from 30 countries | Antimicrobial resistance | Fosters collaborative genomic research on AMR |
| UK Biological Security Strategy | UK Government | Biological security, pandemic preparedness | Funds genomic surveillance initiatives like GAP-DC |
The African BioGenome Project (AfricaBP) is a transformative, continent-wide initiative that exemplifies the application of One Health principles to genomic sciences. This pan-African effort aims to sequence, catalog, and study the genomes of Africa's rich and diverse biodiversity, with an ambitious goal of sequencing approximately 105,000 non-human eukaryotic genomes including plants, animals, fungi, and protozoa [24]. The project operates through the Digital Innovations in Africa for a Sustainable Agri-Environment and Conservation (DAISEA) network, fostering scientific collaborations and partnerships to provide a platform for innovations and policy change across Africa through biodiversity genomics [25].
AfricaBP's economic implications are substantial. A cost-benefit analysis of the South African Beef Genomics Program revealed that a total investment of US$44 million over 10 years was expected to yield at least US$139 million in benefits, with an economic return of 18.70% and a Benefit-Cost Ratio of 3.1 [24]. Similarly, a case study on the proposed 1000 Moroccan Genome Project projected a Benefit-Cost Ratio of 3.29, indicating US$3.29 in benefits for every US$1 invested [24]. These economic analyses demonstrate the significant value proposition of strategic investments in genomic infrastructure within the One Health framework.
The Genomics for Animal and Plant Disease Consortium (GAP-DC), launched in July 2023 and supported by Defra and UKRI, represents a sophisticated implementation of One Health genomics in a high-income context. This initiative addresses complexities in the UK's distributed animal and plant health science landscape by bringing together key organizations in pathogen detection and genomics for terrestrial and aquatic animal health (APHA, Royal Veterinary College, CEFAS, The Pirbright Institute) and plant health (Fera Science, Forest Research) [20].
GAP-DC's scope is defined by six interconnected work packages, each addressing a critical aspect of genomic surveillance:
This comprehensive approach facilitates collaboration and knowledge exchange between agencies, disciplines, and disease systems while exploring innovative methodologies such as environmental metagenomics [20].
Purdue University's One Health initiative exemplifies the innovative research approaches advancing the field. A project titled "Using Across-Phyla Methods To Increase Genomic Prediction Accuracy To Improve Health and Food Security" investigates whether techniques developed to study traits in animal and plant genetics can be adapted to explore similar questions in the human genome [10]. This cross-species genomic analysis represents a significant departure from traditional siloed approaches to genetics research.
The project builds on access to large-scale phenotypic and genome databases, including human genetic biobanks (23andMe, UK Biobank), plant genetic biobanks (EnsemblPlants, JGI Plant Gene Atlas), and livestock animal systems datasets through the Purdue Animal Sciences Research Data Ecosystem [10]. By leveraging methods developed in animal genetics that utilize abundant data and advanced predictive techniques, and applying them to human genetic datasets (which often have more restrictive data access), the research team aims to improve genomic predictions across all species within the One Health context [10].
Table 2: Major One Health Genomic Initiatives and Their Focus Areas
| Initiative | Geographic Scope | Primary Objectives | Key Achievements/Goals |
|---|---|---|---|
| African BioGenome Project | Continental Africa | Sequence 105,000 non-human eukaryotic species; build genomic capacity | Trained 401 researchers across 50 African countries in 2024 |
| GAP-DC | United Kingdom | Coordinate animal and plant disease genomics across government agencies | Six work packages addressing disease detection, spillover, and outbreak management |
| Purdue Cross-Species Genomics | United States | Develop cross-species genomic prediction methods | Leveraging animal genetics models for human health insights |
| European One Health AMR Partnership | 30 European countries | Combat antimicrobial resistance through collaborative research | 10-year program launched in 2025 involving 53 organizations |
Artificial intelligence (AI) applications are rapidly transforming One Health genomic surveillance, particularly in addressing complex challenges like antimicrobial resistance (AMR). A recent scoping review identified that AI is widely applied to combat AMR across different sectors (human, animal, and environmental health), with key opportunities including rapid identification of resistant pathogens, AI-powered surveillance and early warning systems, integration of diverse datasets, and support for drug discovery and antibiotic stewardship [23].
The review, which analyzed 43 studies after screening 543 initial candidates, found that AI systems can integrate genomic data with environmental surveillance to predict resistance hotspots and guide targeted interventions [23]. Initiatives like the Outbreak Consortium demonstrate the potential of AI-driven platforms to map risks using diverse datasets from various sectoral records, thereby fostering more effective antimicrobial stewardship [23]. However, significant challenges remain, such as data standardization issues, limited model transparency, infrastructure and resource gaps, ethical and privacy concerns, and difficulties in real-world implementation and validation [23].
Implementing robust One Health genomic surveillance requires standardized methodologies across diverse sectors and environments. The following experimental protocols represent best practices derived from major initiatives:
This protocol outlines a comprehensive approach to pathogen detection and characterization across human, animal, and environmental samples:
This methodology, developed by Purdue University researchers, enables the transfer of genomic prediction models across species boundaries:
Diagram 1: One Health Genomic Surveillance Workflow
Implementing robust One Health genomic research requires specialized reagents and materials optimized for diverse sample types across human, animal, plant, and environmental applications. The following table details essential research reagent solutions and their specific functions within this interdisciplinary field.
Table 3: Essential Research Reagents for One Health Genomic Studies
| Reagent/Material | Primary Function | Application in One Health Context |
|---|---|---|
| Cross-species nucleic acid extraction kits | DNA/RNA purification from diverse sample matrices | Standardized extraction from human, animal, plant, and environmental samples |
| Tagmentation-based library prep reagents | Rapid sequencing library preparation | High-throughput processing for pathogen surveillance across sectors |
| Pan-pathogen PCR master mixes | Amplification of conserved genomic regions | Broad-spectrum pathogen detection in human, animal, and environmental samples |
| Metagenomic sequencing kits | Untargeted sequencing of complex communities | Microbiome analysis across human, animal, and environmental interfaces |
| Hybridization capture reagents | Target enrichment for specific genomic regions | Focused sequencing of zoonotic pathogens or antimicrobial resistance genes |
| Long-read sequencing reagents | Generation of continuous sequence reads | Resolution of complex genomic regions in novel pathogens |
| Reverse transcriptase enzymes | cDNA synthesis from RNA templates | Detection and characterization of RNA viruses with zoonotic potential |
| Whole-genome amplification kits | Amplification of limited template DNA | Analysis of low-biomass samples from environmental or clinical sources |
Despite significant advancements, implementing comprehensive One Health genomic surveillance faces substantial technical challenges. Data standardization remains a critical hurdle, as genomic data from human, animal, plant, and environmental sources often utilize different formats, metadata standards, and quality control measures [23]. This lack of interoperability hinders effective integration and analysis across sectors. Additionally, many low- and middle-income countries face significant infrastructure gaps in sequencing capacity, computational resources, and bioinformatics expertise, creating global disparities in One Health genomic capabilities [24].
The AfricaBP Open Institute has made substantial progress in addressing these capacity gaps through its workshop series, which trained 401 African researchers across 50 countries in genomics, bioinformatics, molecular biology, sample collections, and biobanking in 2024 alone [24]. However, sustainable investment remains crucialâAfrican countries allocate an average of just 0.45% of their GDP to research and development, significantly below the global average of 1.7% [24]. This underinvestment hinders the transformation of Africa's intellectual capital into tangible products and services that could boost economic growth through genomic applications.
Advancing One Health genomics requires robust ethical frameworks to ensure equitable benefit sharing and appropriate use of genomic data. The AfricaBP has established important precedents through its commitment to equitable data sharing and benefits for the African population [24]. This includes developing policies to safeguard genomic data while promoting accessibility for African researchers, and ensuring that discoveries derived from African biodiversity translate to improved food security, biodiversity conservation, and health outcomes for African communities [24].
Similarly, the integration of AI in One Health applications raises important ethical considerations, including privacy concerns, model transparency, and the need for clear regulatory frameworks to ensure ethical and effective use within the One Health approach [23]. Future directions must prioritize the development of explainable AI systems that provide transparent decision-making processes, particularly when informing public health interventions or policy decisions based on integrated genomic data [23].
Diagram 2: Key Implementation Challenges in One Health Genomics
The integration of genomic sciences within the One Health framework represents a paradigm shift in how we approach complex health challenges at the human-animal-environment interface. Global policy drivers, from UNGA declarations to national biological security strategies, are increasingly recognizing the strategic importance of cross-sectoral genomic surveillance. Initiatives like the African BioGenome Project, GAP-DC, and innovative research at institutions like Purdue University demonstrate the tremendous potential of coordinated genomic approaches to address pressing challenges in food security, antimicrobial resistance, pandemic preparedness, and biodiversity conservation.
Moving forward, realizing the full potential of One Health genomics will require sustained investment in core capacities, development of standardized methodologies and data sharing frameworks, and commitment to equitable partnerships that ensure benefits are broadly shared across sectors and regions. Technological innovations, particularly in artificial intelligence and cross-species genomic analyses, offer promising tools for extracting deeper insights from integrated datasets. However, success will ultimately depend on continued collaboration across traditionally disparate disciplines and sectors, fostering a truly integrated approach to genomic sciences in service of health for all species and the planet we share.
The One Health concept underscores the interconnectedness of human, animal, and environmental health, necessitating an integrated, transdisciplinary approach to tackle contemporary health challenges [11]. Zoonotic diseases, which are infections transmitted between animals and humans, account for a substantial proportion of emerging infectious diseases. The genomic revolution has provided unprecedented capabilities to decode complex biological data, enabling comprehensive insights into pathogen evolution, transmission dynamics, and host-pathogen interactions across species and ecosystems [11]. Through the application of high-throughput sequencing technologies and sophisticated computational analyses, researchers can now trace the origins of pandemics with remarkable precision, monitor ongoing transmission events in near real-time, and identify genetic markers associated with increased transmissibility or virulence. This whitepaper explores how genomic technologies are transforming our understanding of zoonotic spillover events and pandemic origins, framed within the integrative context of One Health approaches that connect human, animal, and environmental surveillance systems.
Machine learning models represent a frontier in preemptive pandemic preparedness by identifying animal viruses with potential for human infectivity. A significant limitation in this field has been the lack of comprehensive datasets for viral infectivity, which restricts the predictable range of viruses. Recent research has addressed this through two key strategies: constructing expansive datasets across 26 viral families and developing the BERT-infect model, which leverages large language models pre-trained on extensive nucleotide sequences [26].
This approach substantially boosts model performance, particularly for segmented RNA viruses, which are involved with severe zoonoses but have been historically overlooked due to limited data availability. These models demonstrate high predictive performance even with partial viral sequences, such as high-throughput sequencing reads or contig sequences from de novo sequence assemblies, indicating their applicability for mining zoonotic viruses from metagenomic data [26]. Models trained on data up to 2018 have demonstrated robust predictive capability for most viruses identified post-2018, though challenges remain in predicting human infectious risk for specific zoonotic viral lineages, including SARS-CoV-2 [26].
Table 1: Performance Metrics of BERT-infect Model Across Viral Families
| Viral Family | Prediction Accuracy (%) | Notable Strengths | Limitations |
|---|---|---|---|
| Orthomyxoviridae | 92.5 | High accuracy for segmented RNA viruses | Requires segment-specific training |
| Coronaviridae | 89.7 | Effective even with partial sequences | Lower performance on SARS-CoV-2 lineage |
| Rotaviridae | 94.2 | Robust across diverse genotypes | Requires extensive validation |
| Paramyxoviridae | 87.9 | Good generalizability | Moderate performance on novel variants |
The following protocol outlines the methodology for developing and validating machine learning models to predict viral infectivity in humans:
Dataset Curation and Preprocessing
Model Development and Training
Performance Evaluation
Molecular epidemiology leverages pathogen genomic data to reconstruct transmission dynamics and identify spillover events. A recent study on mpox transmission in West Africa exemplifies this approach, where researchers sequenced 118 MPXV genomes isolated from cases in Nigeria and Cameroon between 2018 and 2023 [27]. The genomic analysis revealed contrasting transmission patterns: cases in Nigeria primarily resulted from sustained human-to-human transmission, while cases in Cameroon were driven by repeated zoonotic spillovers from animal reservoirs [27].
Phylogenetic analysis enabled researchers to identify distinct zoonotic lineages circulating across the Nigeria-Cameroon border, suggesting that shared animal populations in cross-border forest ecosystems drive viral emergence and spread. The study identified the closest zoonotic outgroup to the Nigerian human epidemic lineage (hMPXV-1) in a southern Nigerian border state and estimated that the shared ancestor of the zoonotic outgroup and hMPXV-1 circulated in animals in southern Nigeria in late 2013 [27]. The analysis further revealed that hMPXV-1 emerged in humans in August 2014 in southern Rivers State and circulated undetected for three years before being identified, with Rivers State serving as the main source of viral spread during the human epidemic [27].
A key genomic signature differentiated sustained human transmission from zoonotic spillovers: APOBEC3 mutational bias. Researchers observed that approximately 74% of reconstructed single nucleotide polymorphisms (SNPs) in the human-transmitted hMPXV-1 lineage were indicative of APOBEC3 editing, a host antiviral mechanism. In contrast, only 9% of reconstructed SNPs across zoonotic transmissions showed this pattern [27]. This molecular signature serves as a reliable marker for distinguishing sustained human-to-human transmission from spillover events.
Table 2: Genomic Features Differentiating Zoonotic vs. Human Transmission in MPXV
| Transmission Type | APOBEC3 Mutation Signature | Genetic Diversity | Phylogenetic Pattern | Representative Cases |
|---|---|---|---|---|
| Zoonotic Spillover | Minimal (9% of SNPs) | High between isolates | Divergent basal lineages | Cameroon cases (100%) |
| Sustained Human Transmission | Dominant (74% of SNPs) | Limited diversity with shared mutations | Tightly clustered lineages | Nigerian cases (96.3%) |
| Mixed Transmission | Variable | Moderate diversity | Multiple introduction clusters | Border region cases |
Sample Collection and Sequencing
Genomic Analysis
Phylogenetic Reconstruction
Evolutionary Analysis
The One Health framework provides an essential paradigm for integrating genomic data across human, animal, and environmental domains to comprehensively address zoonotic threats. This approach recognizes that human health is inextricably linked to animal health and the environment they share [11]. Genomic technologies and bioinformatics tools serve as the connective tissue in this framework, enabling researchers to decode complex biological data and generate actionable insights for health promotion and disease prevention [11].
Successful implementation of One Health genomics requires collaboration among geneticists, bioinformaticians, epidemiologists, zoologists, and data scientists to harness the full potential of these technologies in safeguarding global health [11]. This transdisciplinary approach enhances the precision of public health responses by integrating genomic data with environmental and epidemiological information. Case studies demonstrate successful applications of genomics and bioinformatics in One Health contexts, though challenges remain in data integration, standardization, and ethical considerations in genomic research [11].
The application of One Health genomics extends beyond outbreak response to include predictive surveillance. By monitoring pathogen evolution in animal reservoirs and environmental samples, researchers can identify potential threats before they spill over into human populations. This proactive approach requires sustained investment in interdisciplinary education, research infrastructure, and policy frameworks to effectively employ these technologies in the service of a healthier planet [11].
Genomic analysis of ancient pathogens provides valuable context for understanding the patterns and processes of pandemic emergence throughout human history. Recent research on the Plague of Justinian (AD 541-750), the world's first recorded pandemic, illustrates how ancient DNA can resolve long-standing historical mysteries [28].
For the first time, researchers uncovered direct genomic evidence of Yersinia pestis, the bacterium behind the Justinian Plague, in a mass grave at the ancient city of Jerash, Jordan, near the pandemic's epicenter [28]. Using targeted ancient DNA techniques, the team recovered and sequenced genetic material from eight human teeth excavated from burial chambers beneath a former Roman hippodrome that had been repurposed as a mass grave during the mid-sixth to early seventh century [28].
Genomic analysis revealed that plague victims carried nearly identical strains of Y. pestis, confirming the bacterium was present within the Byzantine Empire between AD 550-660 and causing a rapid, devastating outbreak consistent with historical descriptions [28]. A companion study analyzing hundreds of ancient and modern Y. pestis genomes showed that the bacteria had been circulating among human populations for millennia before the Justinian outbreak. Importantly, later plague pandemics did not descend from a single ancestral strain but arose independently and repeatedly from longstanding animal reservoirs, erupting in multiple waves across different regions and eras [28]. This pattern of repeated emergence from animal reservoirs stands in stark contrast to the COVID-19 pandemic, which originated from a single spillover event and evolved primarily through human-to-human transmission [28].
Table 3: Key Research Reagent Solutions for Zoonotic Spillover Studies
| Resource/Technology | Function | Application Example |
|---|---|---|
| High-Throughput Sequencing Platforms | Generate whole genome sequences of pathogens | Illumina NovaSeq 6000 used in All of Us Research Program for clinical-grade sequencing [19] |
| BERT-infect Model | Predict zoonotic potential of viruses from genetic sequences | Leverages LLMs pre-trained on nucleotide sequences to identify human infectivity potential [26] |
| Cytoscape | Visualize complex biological networks and integrate attribute data | Open source platform for visualizing molecular and genetic interaction networks [29] |
| Graphviz | Create diagrams of abstract graphs and networks | Open source graph visualization software for structural representation [30] |
| NCBI Virus Database | Repository of viral sequences and metadata | Source for 140,638 sequences across 26 viral families for training BERT-infect [26] |
| APOBEC3 Mutation Profiling | Differentiate human-to-human transmission from zoonotic spillovers | Identified 74% of SNPs in hMPXV-1 lineage showed APOBEC3 signature vs. 9% in zoonotic cases [27] |
| Phylogenetic Analysis Tools | Reconstruct evolutionary relationships and transmission chains | Used to identify distinct zoonotic MPXV lineages crossing Nigeria-Cameroon border [27] |
| Ancient DNA Techniques | Recover and sequence genetic material from historical samples | Enabled identification of Yersinia pestis in 6th-century mass grave in Jerash, Jordan [28] |
Genomic technologies have fundamentally transformed our ability to understand, predict, and respond to zoonotic spillover events and pandemic origins. Through the application of sophisticated machine learning models, phylogenetic analysis, and molecular epidemiology, researchers can now trace the evolutionary pathways of pathogens with unprecedented precision. The integration of these approaches within a One Health framework that connects human, animal, and environmental surveillance represents the most promising strategy for mitigating future pandemic threats. As genomic technologies continue to advance and become more accessible, their implementation in global surveillance systems will be crucial for early detection and response to emerging zoonotic diseases. The continued development of open-source bioinformatics tools, standardized protocols, and collaborative research networks will ensure that the scientific community remains prepared to address the ongoing challenge of zoonotic disease emergence in an increasingly interconnected world.
Whole Genome Sequencing (WGS) has revolutionized pathogen surveillance by providing unprecedented resolution for identifying and characterizing disease-causing microorganisms. This transformation is particularly impactful within the One Health framework, which recognizes the interconnectedness of human, animal, and environmental health. The emergence of zoonotic diseases and increasing antibiotic resistance has highlighted the critical need for integrated surveillance systems that monitor infectious diseases across all domains [31]. WGS technologies have displaced traditional typing methods by offering complete genetic blueprints of pathogens, enabling public health officials to explore compelling questions about disease transmission, evolution, and virulence with unparalleled precision.
The implementation of WGS represents a fundamental shift from conventional pathogen typing methods. Historically, techniques such as serotyping, pulsed-field gel electrophoresis (PFGE), and multilocus sequence typing (MLST) provided limited discriminatory power for differentiating between closely related bacterial strains [32] [31]. While these methods served as gold standards for decades, they lacked the resolution necessary for accurate phylogenetic analyses and source attribution in complex outbreak scenarios. The advent of WGS has overcome these limitations, providing public health agencies with a tool that supports precise disease control strategies and antimicrobial resistance management on a global scale [33] [31].
The evolution of pathogen typing methodologies reveals a clear trajectory toward increasingly detailed genetic characterization:
This historical progression demonstrates how each technological advancement addressed limitations of previous methods. PFGE, while revolutionary for its time, lacked the portability and standardization needed for global surveillance systems. MLST introduced sequence-based portability but suffered from limited discriminatory power due to its focus on only seven housekeeping genes [32]. The variable number tandem repeat (MLVA) method offered improved resolution for outbreak investigations but remained insufficient for evolutionary studies and spatiotemporal investigations [32].
Modern WGS analysis employs several sophisticated bioinformatics approaches for extracting relevant information from sequencing data. The two primary methods for assessing genetic similarity between bacterial strains are:
1. Gene-by-Gene Approach (cgMLST/wgMLST) Core genome multilocus sequence typing (cgMLST) compares hundreds or thousands of gene loci across bacterial genomes. This method involves aligning genome assembly data to a predefined scheme containing specific loci and associated allele sequences [32]. Each isolate is characterized by its allele profile, with differences between profiles used to construct phylogenetic relationships. Whole-genome MLST (wgMLST) extends this approach by incorporating accessory genes beyond the core genome, potentially offering higher resolution for closely related clusters [32].
2. Single-Nucleotide Polymorphism (SNP)-Based Analysis SNP-based methods identify individual nucleotide differences across entire genomes, providing the highest possible resolution for strain discrimination [32]. This approach is particularly valuable for investigating outbreaks involving highly clonal pathogens where minute genetic differences must be detected to establish transmission pathways.
Table 1: Comparison of Primary WGS Analytical Approaches
| Feature | cgMLST | wgMLST | SNP-Based |
|---|---|---|---|
| Basis of Comparison | Hundreds to thousands of core genes | All detectable genes | Single nucleotide positions |
| Resolution | High | Very high | Highest |
| Standardization | Scheme-dependent | Scheme-dependent | Reference-dependent |
| Reproducibility | High | Moderate | Variable |
| Computational Demand | Moderate | High | Very high |
| Best Application | Routine surveillance and outbreak detection | High-resolution cluster investigation | Precise transmission tracing |
Both cgMLST and SNP-based analyses produce distance matrices that enable phylogenetic tree construction through various clustering techniques, including neighbor-joining trees, minimum-spanning trees, and hierarchical clustering [32]. The choice of method depends on multiple factors including research question, dataset complexity, computational resources, and required resolution.
The bioinformatics workflow for pathogen WGS analysis follows a structured pipeline from raw sequencing data to actionable public health information. The process integrates quality control, assembly, annotation, and multiple analytical approaches to extract epidemiological insights.
Successful implementation of WGS for pathogen surveillance requires access to sophisticated bioinformatics tools and databases. The research community has developed numerous freely available resources that support various aspects of genomic data analysis.
Table 2: Essential Bioinformatics Tools for Pathogen WGS Analysis
| Tool/Resource | Primary Function | Application in Pathogen WGS |
|---|---|---|
| NCBI Databases | Repository of sequence data and literature | Access to reference genomes and BLAST searches [34] |
| UCSC Genome Browser | Genome visualization | Reference genomes for humans and model organisms [34] |
| SRA (Sequence Read Archive) | Raw NGS data repository | Access to sequencing data from global surveillance [34] |
| Galaxy Project | Workflow management | User-friendly NGS analysis without coding [34] |
| IGV | Data visualization | Examination of BAM and VCF files [34] |
| Enterobase | cgMLST analysis | Standardized typing for multiple bacterial species [32] |
| CARD | Antimicrobial resistance detection | Identification of AMR genes [33] |
| Nextflow | Workflow automation | Reproducible pipeline execution [34] |
| Imbricataflavone A | Imbricataflavone A, MF:C33H24O10, MW:580.5 g/mol | Chemical Reagent |
| Forestine | Forestine, MF:C33H47NO9, MW:601.7 g/mol | Chemical Reagent |
These tools collectively enable researchers to process raw sequencing data, perform quality control, conduct comparative analyses, and visualize results. Platforms like Galaxy Project make sophisticated analyses accessible to laboratories without extensive bioinformatics expertise, while resources like the Sequence Read Archive facilitate data sharing and comparative analyses across institutions and geographical boundaries [34].
The One Health approach operationalizes WGS through integrated systems that connect human, animal, and environmental health data. A exemplary implementation is the Integrated Genomic Surveillance System of Andalusia (SIEGA), which has established a region-wide genomic surveillance network [31]. This system processes raw whole-genome sequencing data through species-specific open software that reports antimicrobial resistance genes and virulence factors [31]. The initiative has accumulated over 1,900 bacterial genomes including Salmonella enterica, Listeria monocytogenes, Campylobacter jejuni, and Escherichia coli from food products, factories, farms, water systems, and human clinical samples [31].
SIEGA functions as a Laboratory Information Management System (LIMS) that enables customized reporting, detection of transmission chains, and automated alerts when new bacterial isolates show concerning genetic similarities to existing database entries [31]. This integrated approach allows public health officials to monitor pathogen transmission across domains, track resistance patterns, and identify virulence factors that pose significant threats to human and animal health.
The application of WGS for antimicrobial resistance (AMR) surveillance represents one of the most significant advances in public health microbiology. WGS enables in silico detection of resistance genes and predictive analysis of resistance phenotypes, providing crucial information for clinical management and outbreak control [33]. In Campylobacter jejuni studies, WGS has been instrumental in tracking the transmission of resistance determinants along the farm-to-fork continuum, revealing how antimicrobial use in agricultural settings contributes to the emergence of resistant strains that threaten human health [33].
The relationship between genomic analysis and AMR surveillance can be visualized through a specialized workflow that connects sequencing data with resistance profiling:
Despite the transformative potential of WGS, significant challenges remain in standardizing analytical approaches across laboratories and sectors. Different cgMLST schemes for the same bacterial species may contain different loci, use different naming conventions, or assign different allele numbers to identical sequences [32]. Even schemes with identical locus definitions hosted on different services (e.g., Enterobase and Ridom SeqSphere+) may not be comparable due to lack of synchronization in allele number allocation [32].
Harmonization efforts led by organizations such as the European Food Safety Authority (EFSA) aim to establish guidelines for reporting WGS-based typing data through integrated One Health WGS systems [31]. These initiatives promote data comparability across human, animal, food, and environmental sectors, enabling truly integrated surveillance that reflects the interconnected nature of health ecosystems.
Emerging technologies like graph reference genomes address limitations of traditional linear reference sequences, which fail to account for global genomic diversity and introduce reference bias [35]. Graph-based references represent genetic variation in populations by enhancing standard references with alternate sequences connected through edges [35]. This approach improves alignment accuracy, particularly for complex variants and populations with diverse genetic backgrounds that are poorly served by linear references based primarily on European genomes [35].
Graph genome tools like GRAF (Genomic Analysis Visualization Tool) demonstrate how next-generation references can enhance variant detection, especially for compound variants where multiple mutations occur in close proximity [35]. These advances are particularly valuable for global pathogen surveillance systems that encounter genetically diverse strains across different geographical regions and host species.
Modern WGS platforms enable near real-time detection of transmission events through automated alert systems. The SIEGA system, for example, automatically notifies public health officials when newly sequenced isolates show predetermined genetic similarity to existing database entries [31]. This capability dramatically accelerates response times during outbreaks, allowing for targeted interventions before widespread transmission occurs.
Advanced visualization tools integrated with these systems, such as Nextstrain Auspice, provide intuitive displays of phylogenetic relationships, geographical distributions, and temporal transmission patterns [31]. These visualizations help communicate complex genomic data to public health decision-makers, bridging the gap between bioinformatics and applied epidemiology.
Table 3: Quantitative Outcomes from Implemented WGS Surveillance Systems
| Surveillance System | Pathogens Covered | Genomes Accumulated | Key Outcomes |
|---|---|---|---|
| SIEGA (Andalusia) | 6 major bacterial pathogens | 1,906 genomes | Customizable reports, transmission chain detection, automated alerts [31] |
| Campylobacter jejuni Surveillance | C. jejuni | Not specified | AMR tracking from farm to fork, transmission dynamics [33] |
| Graph Genome Implementation | Pan-genome applications | Not specified | Improved variant detection, reduced reference bias [35] |
Whole Genome Sequencing has fundamentally transformed pathogen identification and characterization, providing public health systems with unprecedented resolution for tracking disease transmission and antimicrobial resistance. When implemented within a One Health framework, WGS enables integrated surveillance that connects human, animal, and environmental health data, offering comprehensive insights into disease dynamics across ecosystems. Despite ongoing challenges in standardization and data harmonization, continued advancement in bioinformatics tools, reference databases, and analytical approaches will further enhance the value of genomic surveillance for global health security. The implementation of systems like SIEGA demonstrates how WGS can be operationalized across sectors to improve outbreak response, guide targeted interventions, and ultimately reduce the burden of infectious diseases through evidence-based public health action.
Metagenomic Next-Generation Sequencing (mNGS) represents a transformative approach in pathogen detection, enabling the unbiased, comprehensive identification of microbial nucleic acids in clinical and environmental samples. This technology has emerged as a powerful diagnostic tool, particularly for cases where conventional microbiological tests (CMTs) fail to identify pathogens due to their low abundance, fastidious growth requirements, or unexpected nature. The application of mNGS within a One Health framework is particularly valuable, as it provides a unified method for detecting and characterizing pathogens across human, animal, plant, and environmental domains, offering insights into disease transmission, evolution, and ecology at the interfaces of these interconnected systems [36] [20]. This technical guide examines the principles, performance, protocols, and integrative potential of mNGS for researchers, scientists, and drug development professionals working at the intersection of genomic sciences and public health.
The diagnostic utility of mNGS must be evaluated against both conventional methods and newer targeted NGS (tNGS) approaches. A 2025 study of 115 patients with lower respiratory tract infections demonstrated that both mNGS and tNGS exhibited high sensitivity (95.08% each) for diagnosing invasive pulmonary fungal infections (IPFI). The specificity of mNGS was 90.74%, compared to 85.19% for tNGS. Both methods significantly outperformed conventional microbiological tests in sensitivity and negative predictive value (NPV) [37].
A more comprehensive 2025 study comparing three NGS methodologies across 205 patients with suspected lower respiratory tract infections revealed distinct performance characteristics and operational considerations [38]. The findings are summarized in Table 1 below.
Table 1: Comparative Analysis of NGS Methodologies for Pathogen Detection
| Parameter | Metagenomic NGS (mNGS) | Capture-based tNGS | Amplification-based tNGS |
|---|---|---|---|
| Total Species Identified | 80 | 71 | 65 |
| Diagnostic Accuracy | Not Specified | 93.17% | Lower than capture-based |
| Diagnostic Sensitivity | Not Specified | 99.43% | Lower for Gram-positive (40.23%) and Gram-negative bacteria (71.74%) |
| Specificity for DNA Viruses | Not Specified | 74.78% | 98.25% |
| Turnaround Time | 20 hours | Shorter than mNGS | Shortest |
| Cost (per sample) | $840 | Lower than mNGS | Lowest |
| Key Strengths | Detects rare/novel pathogens; broad surveillance | Ideal for routine diagnostics; balances performance & cost | Rapid results; resource-limited settings |
| Key Limitations | Highest cost; longest TAT; complex data analysis | Lower specificity for DNA viruses | Poor sensitivity for many bacteria |
Another critical technical consideration is the source of microbial nucleic acid. A 2025 analysis of 125 body fluid samples found that mNGS targeting whole-cell DNA (wcDNA) demonstrated superior performance compared to cell-free DNA (cfDNA) mNGS. The host DNA proportion in wcDNA mNGS (mean 84%) was significantly lower than in cfDNA mNGS (mean 95%), contributing to its higher concordance with culture results (63.33% vs. 46.67%) [39].
For central nervous system (CNS) infections, a prospective multicenter study established optimal diagnostic thresholds for mNGS: a species-specific read number (SSRN) â¥2 for viral CNS infections, and SSRN â¥5 or 10 for bacterial meningitis, which achieved a sensitivity of 73.3% and an area under the curve (AUC) of 0.846 [40].
Proper sample handling is fundamental to successful mNGS. For bronchoalveolar lavage fluid (BALF) analysis, samples are typically collected aseptically, stored at -20°C, and processed within 24 hours [37] [40]. The initial step involves liquefaction using dithiothreitol (DTT) to break down mucus, followed by vigorous homogenization using a vortex mixer with glass beads for mechanical disruption [37] [38].
Two primary DNA extraction paths can be followed:
For comprehensive pathogen detection, RNA extraction is also performed using kits such as the QIAamp Viral RNA Kit, followed by ribosomal RNA depletion and reverse transcription [37] [38].
Library construction for DNA sequences can be performed using systems such as the Ovation Ultralow System V2 or the VAHTS Universal Pro DNA Library Prep Kit for Illumina [37] [39]. For tNGS, two enrichment strategies are employed:
Sequencing is predominantly performed on Illumina platforms (e.g., NextSeq 550, NovaSeq) [37] [39]. The required sequencing depth varies significantly by method: mNGS typically generates ~20 million reads per sample, while amplification-based tNGS requires only ~0.1 million reads [37] [38].
The bioinformatic processing of mNGS data involves multiple critical steps to distinguish true pathogen signals from background noise. A standardized workflow is essential for reliable results, as depicted below.
Figure 1: Bioinformatic Workflow for mNGS Data Analysis
Quality Control and Host Depletion: Raw sequencing data is first processed using tools like Fastp to remove adapter sequences, ambiguous nucleotides, and low-quality reads [37] [38]. Human sequence data is identified and excluded by alignment to the hg38 reference genome using software such as the Burrows-Wheeler Aligner (BWA) [37] [38].
Pathogen Identification and Thresholding: The remaining microbial reads are aligned against comprehensive pathogen databases using tools like SNAP or other aligners [37]. A critical next step is background subtraction using negative controls (no-template controls, NTC). The established reporting criteria for mNGS include [37] [40]:
Successful implementation of mNGS requires specific laboratory reagents, kits, and computational tools. The key components are cataloged in Table 2 below.
Table 2: Essential Resources for mNGS Implementation
| Category | Specific Product/Kit | Primary Function |
|---|---|---|
| Nucleic Acid Extraction | QIAamp UCP Pathogen DNA Kit (Qiagen) | Simultaneous extraction and depletion of human DNA |
| MagPure Pathogen DNA/RNA Kit (Magen) | Total nucleic acid extraction for tNGS | |
| VAHTS Free-Circulating DNA Maxi Kit (Vazyme) | Cell-free DNA extraction | |
| Library Preparation | Ovation Ultralow System V2 (NuGEN) | Library construction for low-biomass samples |
| VAHTS Universal Pro DNA Library Prep Kit (Vazyme) | DNA library preparation for Illumina | |
| Respiratory Pathogen Detection Kit (KingCreate) | Primer pool for amplification-based tNGS | |
| Target Enrichment | Multiplex PCR Primer Panels (e.g., 198-plex) | Targeted amplification of pathogen sequences |
| Probe Capture Panels | Hybridization-based enrichment for tNGS | |
| Sequencing & Analysis | Illumina Platforms (NextSeq, NovaSeq) | High-throughput sequencing |
| Fastp | Quality control and adapter trimming | |
| Burrows-Wheeler Aligner (BWA) | Removal of host sequences (alignment to hg38) | |
| SNAP | Alignment to microbial databases for pathogen identification |
The true power of mNGS is realized when integrated within a One Health framework that connects data across human, animal, and environmental health sectors. Initiatives like the Genomics for Animal and Plant Disease Consortium (GAP-DC) exemplify this approach, using genomic surveillance to address technological and policy challenges across multiple diseases and sectors [20]. Such integration enables several critical applications:
The operationalization of One Health data integration requires coordinated infrastructure for collecting, integrating, and analyzing data across sectors. This includes addressing challenges of data interoperability, governance, and semantic differences while leveraging emerging technologies like APIs and machine learning for cross-domain analytics [36].
Metagenomic NGS has established itself as a transformative technology for pathogen detection and discovery, with demonstrated clinical utility across diverse infection syndromes. When deployed within an integrated One Health framework, mNGS moves beyond a diagnostic tool to become a cornerstone of comprehensive disease surveillance, enabling proactive detection of emerging threats and understanding of pathogen ecology across the human-animal-environment interface. As sequencing costs decline and bioinformatic capabilities advance, the strategic implementation of mNGS and complementary tNGS technologies will be crucial for advancing genomic sciences research and strengthening global health security.
Antimicrobial resistance (AMR) represents one of the most critical global public health threats of our time, directly causing an estimated 1.14 million deaths annually worldwide [41]. The One Health approach recognizes that the health of humans, domestic and wild animals, plants, and the wider environment are closely linked and interdependent [9]. AMR most clearly illustrates this interconnectedness, as resistant microorganisms and their genetic determinants circulate freely across ecosystems and species boundaries [42]. The complex dynamics of AMR emergence, transmission, and persistence necessitate a collaborative, multisectoral, and transdisciplinary approach working at local, regional, national, and global levels to achieve optimal health outcomes [9].
The scale of the AMR challenge is staggering. In 2021, 4.71 million deaths globally were associated with drug-resistant infections, with the highest mortality rates disproportionately affecting sub-Saharan Africa and South Asia [43] [44]. Beyond its devastating health impacts, AMR poses severe economic threats, potentially reducing annual global GDP by an estimated 3.8% by 2050 and pushing millions into extreme poverty [43]. The World Bank estimates that addressing AMR could require over $1 trillion annually in healthcare costs alone if left unchecked [43]. This mounting crisis underscores the urgent need for integrated surveillance strategies that track AMR across all One Health sectors to inform effective interventions and policies.
The burden of AMR extends far beyond direct health consequences, creating substantial economic ripple effects across global economies and health systems. Recent estimates suggest that resistant bacterial infections alone could impose a global economic burden of $412 billion in healthcare costs and $443 billion in lost productivity annually up to 2035 [43]. The agricultural sector faces similarly stark projections, with output potentially declining by 11% by 2050, disproportionately affecting low- and middle-income countries (LMICs) [43]. These economic impacts are not evenly distributed, reflecting and potentially exacerbating existing global inequities.
The human toll of AMR continues to escalate. Without effective intervention, deaths directly attributable to AMR are projected to reach nearly 2 million annually by 2050 [41]. However, modeling suggests that 92 million infectious deaths could be prevented between 2025 and 2050 through a combination of better healthcare for severe infections and improved access to appropriate antibiotics [41]. The table below summarizes the current and projected burden of AMR across different domains.
Table 1: Global Burden of Antimicrobial Resistance
| Domain | Current Burden (2021-2023) | Projected Burden (by 2050) | Key Statistics |
|---|---|---|---|
| Human Health | 1.14 million direct deaths annually [41] | Nearly 2 million direct deaths annually [41] | 4.71 million deaths associated with drug-resistant infections in 2021 [43] [41] |
| Economic Impact | Significant and growing healthcare costs [43] | 3.8% reduction in annual global GDP [43] | $412B in healthcare costs + $443B in lost productivity annually by 2035 [43] |
| Animal & Agriculture | Production losses in livestock and fisheries [43] | 11% output decline [43] | Cumulative GDP loss of $575â$953B by 2050 [43] |
Significant inequalities mark the global distribution of AMR's impact, with LMICs experiencing up to 90% of total global deaths from AMR [44]. This disproportionate burden results from multiple intersecting factors, including high rates of infectious diseases, challenges in healthcare access, fragile supply chains for essential medicines, and inadequate water, sanitation, and hygiene (WASH) infrastructure [43] [44]. The structural and social determinants of health profoundly influence AMR vulnerability, with marginalized populations often experiencing the compounded effects of poverty, limited healthcare access, and occupational exposures [44].
Despite recognition of these disparities, global responses have inadequately addressed equity dimensions. A recent review of 145 National Action Plans (NAPs) for AMR found that 125 made no mention of sex or gender considerations, highlighting a critical gap in policy development [44]. Furthermore, surveillance capacity varies dramatically between regions, with many LMICs lacking robust laboratory infrastructure for AMR monitoring [44]. This surveillance gap creates a vicious cycle: regions with the highest AMR burdens often have the least capacity to generate data to inform interventions, perpetuating health inequities.
The World Health Organization's Global Antimicrobial Resistance and Use Surveillance System (GLASS) represents a cornerstone of global AMR monitoring efforts. Established in 2015, GLASS progressively integrates surveillance data on antimicrobials used in humans, tracks antimicrobial use, and seeks to understand the role of AMR in the food chain and environment [42]. The system provides a standardized approach to collecting, analyzing, interpreting, and sharing data by country, region, and area, enabling monitoring of national surveillance systems and emphasizing data representativeness and quality [42].
As of 2025, GLASS has expanded significantly, with 141 countries, territories, and areas participating and contributing data on over 23 million bacteriologically confirmed infection episodes between 2016 and 2023 [45] [46]. The enhanced GLASS dashboard provides a comprehensive view of crude AMR data used to generate national, regional, and global modeled estimates of surveillance coverage, antibiotic resistance, and trends [45]. This system now presents AMR data for eight bacterial pathogens frequently isolated from patients with bloodstream, gastrointestinal, urinary, or urogenital gonorrhoea infections, displaying resistance to 23 antibiotics across 11 antibiotic classes [45].
Table 2: Global AMR Surveillance Systems and Frameworks
| Surveillance System | Lead Organization | Scope | Key Outputs |
|---|---|---|---|
| GLASS | World Health Organization (WHO) [45] | Global human health surveillance with One Health expansion [45] | Standardized AMR/AMU data; 2025 report with 23M+ infection episodes [45] [46] |
| Tripartite AMR Surveillance | WHO, OIE, FAO [42] | Integrated human, animal, and food production surveillance [42] | Joint guidelines; coordinated global action plan [42] |
| National Action Plans (NAPs) | Individual member states [47] | Country-level multisectoral AMR strategies [47] | National surveillance and stewardship plans; 115+ plans implemented [47] |
Effective AMR surveillance requires integration across human, animal, and environmental sectors to track the complete circulation of resistant pathogens and resistance genes. The interconnectedness of these compartments means that resistance emerging in one sector can rapidly spread to others [42] [48]. For example, antimicrobial use in agricultureâwhich accounts for a significant volume of global antimicrobial consumptionâdirectly impacts resistance selection in environmental and human pathogens [42] [43].
The tripartite collaboration between WHO, the World Organisation for Animal Health (OIE), and the Food and Agriculture Organization (FAO) has established guidelines for integrated AMR surveillance to ensure all countries implement systems covering antimicrobial use and consumption in human and animal populations [42]. These guidelines facilitate understanding of how AMR spreads across different settings and enable assessment of interventions within and between sectors [42]. The framework emphasizes the importance of standardized methodologies, data sharing protocols, and joint analysis to inform comprehensive One Health responses to AMR.
Pathogen genomics has revolutionized the study of bacterial AMR by providing deep insights into the mechanisms, emergence, and spread of resistant pathogens [48]. Genomic technologies can answer multiple critical questions relevant to understanding AMR, including: (1) discovery and detection of underlying resistance mechanisms (mutations and genes); (2) determining the link between genetics and resistance phenotypes; (3) understanding evolution of resistance; (4) determining transmission and spread of AMR pathogens with high spatio-temporal resolution; and (5) connecting AMR across different compartments using the One Health concept [48].
The application of whole-genome sequencing (WGS) in AMR research and surveillance has expanded dramatically since the first bacterial genome was sequenced in 1995 [48]. Early WGS studies uncovered the role of mobile genetic elements, such as plasmids and transposons, in disseminating resistance genes among bacterial populations [48]. More recently, genomic approaches have enabled the discovery of novel AMR determinants, such as plasmid-mediated colistin resistance in E. coli [48], and have been further empowered by statistical genomic methods like bacterial genome-wide association studies (bGWAS) to identify resistance-associated mutations [48].
Implementing genomic surveillance for AMR requires careful consideration of multiple technical factors to generate actionable data. The key inputs for effective genomic analysis include high-quality, representative, and diverse genomic data, phenotypic AMR data, and relevant contextual epidemiologic metadata [48]. Successful integration of genomics into public health and clinical practice requires moving beyond academic exercises to establish robust workflows that meet International Organization for Standardization (ISO) standards and generate interpretable reports for public health and clinical end-users [48].
The following protocol outlines a standardized approach for genomic surveillance of AMR across One Health sectors:
Sample Collection and Processing:
DNA Extraction and Quality Control:
Whole-Genome Sequencing:
Bioinformatic Analysis:
Data Integration and Reporting:
Implementing genomic surveillance for AMR requires a comprehensive suite of laboratory reagents, sequencing technologies, and bioinformatic tools. The table below details essential components of the AMR researcher's toolkit, organized by workflow stage.
Table 3: Essential Research Reagents and Computational Tools for AMR Genomics
| Workflow Stage | Reagent/Tool | Specification/Function | Application in AMR Research |
|---|---|---|---|
| Sample Processing | Culture media (MH, BHI, CHROMagar) [48] | Standardized growth conditions for diverse pathogens | Isolation of target bacteria from complex samples |
| DNA Extraction | DNeasy Blood & Tissue Kit (Qiagen) [48] | High-quality genomic DNA extraction | Preparation of sequencing-ready DNA from pure cultures |
| Library Preparation | Nextera XT DNA Library Prep Kit (Illumina) [48] | Tagmentation-based library preparation | High-throughput sequencing library construction |
| Sequencing | Illumina NovaSeq 6000 [48] | Short-read sequencing (2Ã150 bp) | High-accuracy sequencing for variant calling |
| Sequencing | Oxford Nanopore MinION [48] | Long-read sequencing | Resolution of complex genomic regions and plasmids |
| Quality Control | FastQC [48] | Raw read quality assessment | Identification of sequencing issues and contaminants |
| Genome Assembly | SPAdes [48] | De Bruijn graph assembler | High-quality draft genome assembly from short reads |
| AMR Gene Detection | CARD (Comprehensive Antibiotic Resistance Database) [48] | Curated repository of resistance elements | Identification of known AMR genes and mutations |
| Phylogenetic Analysis | IQ-TREE [48] | Maximum likelihood phylogenetic inference | Reconstruction of evolutionary relationships and transmission chains |
| Data Visualization | Phandango [48] | Interactive phylogenetic tree visualization | Exploration of genotype-phenotype relationships |
The transformation of raw genomic data into actionable public health intelligence requires robust analytical frameworks and interdisciplinary collaboration. Genomic data must be integrated with epidemiological metadata to understand transmission dynamics across One Health sectors [48]. This integration enables researchers and public health professionals to move beyond simply detecting resistance genes to understanding how they are spreading between humans, animals, and environments [48].
Successful implementation of genomics for AMR surveillance depends on establishing clear reporting mechanisms that translate complex genomic findings into interpretable information for public health decision-makers [48]. This includes developing standardized protocols for data analysis, interpretation thresholds, and communication formats that maintain scientific rigor while being accessible to non-specialists [48]. The implementation of FAIR (Findable, Accessible, Interoperable, and Reusable) principles for data sharing ensures that genomic data can be effectively utilized across the global AMR research and public health community [48].
Genomic studies have provided unprecedented insights into the transmission dynamics of AMR across One Health compartments. Research has demonstrated links between extended-spectrum beta-lactamase (ESBL)-producing E. coli from human and food chain sources in the UK [48], while other studies have identified sharing of vancomycin-resistant Enterococcus faecium (VREfm) between livestock and humans [48]. These findings highlight the interconnectedness of resistance reservoirs and the importance of integrated surveillance.
At the local level, genomic approaches have revealed complex transmission networks within healthcare settings, informing infection control practices [48]. High-resolution sequencing has enabled researchers to track the movement of carbapenem-resistant Enterobacteriaceae (CRE) between facilities and identify previously unrecognized transmission routes [48]. Similarly, studies of within-host evolution during prolonged infections have provided insights into the evolutionary trajectories leading to resistance development, informing treatment strategies [48].
Despite the transformative potential of genomic approaches for AMR surveillance, significant implementation barriers persist. Many challenges remain for the broader implementation of genomics for AMR at local, regional, and global scales, including limited laboratory infrastructure, technical expertise shortages, and data integration complexities [48]. Equitable access to genomic technologies represents a particular concern, with resource-limited settingsâoften those with the highest AMR burdensâfacing the greatest barriers to implementation [48] [44].
The effective integration of genomic data into public health practice requires overcoming substantial technical and operational hurdles. Establishing robust workflows that meet International Organization for Standardization (ISO) standards and generate interpretable reports for public health and clinical end-users remains challenging [48]. Furthermore, sustainable data management systems capable of handling the volume and complexity of genomic data while ensuring privacy and security require significant investment and coordination across jurisdictions and sectors [48].
The future of AMR genomics is being shaped by several promising technological and methodological innovations. Rapid advances in sequencing technologies, including portable nanopore sequencing and third-generation platforms, are making real-time genomic surveillance increasingly feasible in diverse settings [48]. Coupled with improvements in bioinformatic tools and automated analysis pipelines, these technologies promise to democratize access to genomic surveillance and reduce the time from sample to actionable result.
Machine learning and artificial intelligence approaches are opening new frontiers in AMR prediction and understanding. These methods can identify complex patterns in genomic data that may not be apparent through traditional analysis, potentially enabling more accurate prediction of resistance phenotypes from genomic sequences alone [48]. Additionally, the integration of genomic data with other omics technologies (transcriptomics, proteomics) and clinical metadata promises to provide more comprehensive understanding of resistance mechanisms and host-pathogen interactions [48].
The growing recognition of AMR as a development and equity issue is also shaping future directions [44]. There is increasing emphasis on ensuring that genomic surveillance capacity is strengthened in LMICs and that research addresses the specific needs and contexts of disproportionately affected populations [44]. This includes developing appropriate technologies and frameworks that function effectively in resource-limited settings and address the structural drivers of AMR vulnerability [44].
Genomic technologies have fundamentally transformed our ability to track, understand, and respond to AMR across One Health sectors. The integration of genomic surveillance with public health practice provides unprecedented opportunities to illuminate the complex dynamics of resistance emergence and transmission at local, regional, and global scales. However, realizing the full potential of these approaches requires sustained investment in laboratory infrastructure, technical capacity, and data sharing frameworks, particularly in resource-limited settings.
The continued evolution of AMR demands a corresponding evolution in our surveillance and response capabilities. Future progress will depend on strengthening collaborative networks across human health, animal health, and environmental sectors; developing more accessible and scalable genomic technologies; and building bridges between genomic data and public health action. By embracing a comprehensive One Health approach to genomic surveillance, the global community can develop more effective strategies to preserve antimicrobial efficacy and mitigate the growing threat of AMR to global health, food security, and sustainable development.
Dengue fever represents a critical public health challenge in Brazil, with the country reporting some of the highest case numbers globally. Between 2000 and 2013 alone, Brazil reported more than 7.3 million notified dengue infections [49]. Genomic surveillance has emerged as a transformative tool for understanding and combating dengue virus (DENV) transmission, providing crucial insights into viral evolution, outbreak dynamics, and pathogen spread. This case study examines the application of genomic sciences to dengue surveillance in Brazil through the integrated lens of the One Health framework, which recognizes the interconnectedness of human, animal, and environmental health [11]. We explore how advanced genomic technologies combined with cross-sectoral collaboration are enhancing disease prevention and control strategies for this arboviral threat.
Brazil has experienced the co-circulation of all four dengue serotypes (DENV-1 through DENV-4), with different genotypes dominating during distinct outbreak periods. Genomic surveillance has been instrumental in tracking this complex epidemiological landscape:
DENV-4 Genotype Replacement: The 2012 dengue outbreak in Rio de Janeiro was primarily caused by DENV-4 genotype II, forming a monophyletic clade closely related to viruses from Venezuela and northern Brazil [49]. This outbreak marked a significant shift in DENV-4 epidemiology in Brazil. The same study also reported, for the first time in Rio de Janeiro, the detection of DENV-4 genotype I, which appeared to have been introduced from Southeast Asia on separate occasions, demonstrating the value of genomic surveillance in identifying multiple introduction events.
Current Genotype Diversity: More recent data from Yogyakarta, Indonesia (sharing similar DENV ecology with Brazil) reveals the co-circulation of six distinct genotypes across the four serotypes [50], highlighting the substantial genetic diversity that genomic surveillance must track:
Table 1: Dengue Virus Genotypes Circulating in Brazil and Their Characteristics
| Serotype | Genotype | Prevalence | Geographic Origin | First Detection in Brazil |
|---|---|---|---|---|
| DENV-1 | I, IV | Variable across outbreaks | Southeast Asia, Caribbean | 1980s |
| DENV-2 | Cosmopolitan | Dominant in multiple outbreaks | Asia, global distribution | 1990 |
| DENV-3 | I | Significant in multiple outbreaks | Central America, Asia | 2000 |
| DENV-4 | II | Caused major 2012 outbreak | Northern Brazil, Venezuela | 1981-1982 (absent until 2005-2007) |
| DENV-4 | I | Limited circulation | Southeast Asia | 2012 (Rio de Janeiro) |
Genomic sequencing has revealed critical aspects of DENV transmission patterns in Brazil. The 2012 DENV-4 outbreak in Rio de Janeiro resulted from a viral lineage whose common ancestor was estimated to have emerged in February 2010 [49]. This lineage then spread from northern Brazil to more densely populated southeastern states, demonstrating how genomic data can reconstruct transmission pathways.
Analysis of the Applying Wolbachia to Eliminate Dengue (AWED) trial showed "strong spatial and temporal structure to the DENV genomic relationships, consistent with highly focal DENV transmission around the home" [50]. This fine-scale understanding of transmission patterns directly informs targeted intervention strategies.
The standard genomic surveillance pipeline for dengue virus involves sample collection, processing, sequencing, and bioinformatics analysis.
Figure 1: Workflow for Dengue Virus Genomic Surveillance
Sample Collection and Processing: Surveillance begins with sample collection from human cases (e.g., plasma from febrile patients) or mosquito vectors [50]. During the AWED trial, venous blood was collected from participants with acute febrile illness of 1-4 days duration [50]. Viral RNA is then extracted using commercial kits such as the High Pure Viral Nucleic Acid Kit (Roche) [50].
Reverse Transcription and Amplification: Extracted RNA undergoes reverse transcription into complementary DNA (cDNA) using systems such as SuperScript IV First-strand Synthesis System with random hexamers [50]. The viral genome is then amplified in multiple overlapping fragments using DENV-specific primers in separate PCR reactions.
Library Preparation and Sequencing: Amplified products are purified and used for library construction with kits such as Nextera XT DNA Library Preparation Kit (Illumina) [50]. Libraries are normalized, pooled, and sequenced on platforms such as Illumina NextSeq2000 using 300-cycle kits [50].
The bioinformatics workflow for DENV genomic surveillance includes multiple critical steps:
Quality Control and Read Processing: Raw sequencing reads undergo quality trimming and adapter removal using tools such as AlienTrimmer, with short reads (<50 bases) and low-quality bases (quality score <30) being filtered out [50].
Alignment and Consensus Generation: Processed reads are aligned to DENV reference genomes using aligners such as BWA-MEM [50]. Consensus sequences are then generated using variant callers such as iVar with default parameters [50]. Regions with coverage below 10Ã may be imputed from consensus sequences of relevant DENV genotypes.
Phylogenetic and Evolutionary Analysis: Multiple sequence alignment is performed with tools such as MAFFT, followed by phylogenetic reconstruction using maximum likelihood methods in IQ-TREE or Bayesian approaches [50] [51]. Genotypes are assigned by examining consensus trees with statistical support measures.
Table 2: Essential Research Reagents and Tools for DENV Genomic Surveillance
| Category | Item | Specific Example | Function in Workflow |
|---|---|---|---|
| Sample Processing | Nucleic Acid Extraction Kit | High Pure Viral Nucleic Acid Kit (Roche) | RNA extraction from plasma or mosquito samples |
| Molecular Biology | Reverse Transcription System | SuperScript IV First-Strand Synthesis System | cDNA synthesis from viral RNA |
| PCR Amplification Reagents | DENV-specific primer pools | Whole genome amplification in fragments | |
| Sequencing | Library Prep Kit | Nextera XT DNA Library Preparation Kit (Illumina) | Preparation of sequencing libraries |
| Sequencing Platform | Illumina NextSeq2000 with P2 v3 300-cycle kit | High-throughput sequencing | |
| Bioinformatics | Quality Control Tool | AlienTrimmer | Adapter removal and read quality filtering |
| Sequence Aligner | BWA-MEM | Alignment of reads to reference genomes | |
| Variant Caller | iVar | Consensus sequence generation | |
| Phylogenetic Software | IQ-TREE, PhyML, MrBayes | Evolutionary analysis and tree building |
The One Health concept emphasizes transdisciplinary approaches to address complex health challenges at the human-animal-environment interface [11]. For dengue surveillance, this translates to integrated systems that monitor viral activity across multiple components of the ecosystem.
Entomological Surveillance: Mosquito collections and screening provide early warning of DENV transmission. A study in São Paulo state demonstrated this approach by testing collected Aedes aegypti and Ae. albopictus mosquitoes for CHIKV RNA, with 5.6% of specimens testing positive during outbreak periods [52]. Similar methodologies apply directly to DENV surveillance.
Environmental and Climate Factors: The abundance of Aedes mosquitoes is significantly influenced by abiotic factors such as precipitation and temperature [52]. Surveillance systems that incorporate climate data can better predict outbreak risks and seasonal patterns.
The One Health approach leverages comparative genomics across species to enhance predictive capabilities. Research initiatives are exploring "whether techniques developed to test such traits in animal and plant genetics might be adapted to explore such questions in the human genome" [10]. This cross-phyla approach may reveal novel insights into host-pathogen interactions and transmission dynamics.
Figure 2: One Health Approach to Dengue Surveillance
Genomic sequencing has proven invaluable for understanding DENV outbreak dynamics in Brazil. Analysis of the 2012 Rio de Janeiro outbreak revealed a strong correlation between genetic data and epidemiological patterns [49]. The combination of "genetic and epidemiological data from blood donor banks" has shown potential for anticipating epidemic spread of arboviruses [49].
The AWED trial demonstrated how genomic surveillance can evaluate intervention effectiveness. The study showed "wMel protection extended to all six genotypes identified amongst AWED trial participants" [50], with the Wolbachia intervention causing a "near-total disruption of transmission" [50]. This genotypic efficacy assessment is crucial for understanding the broad-spectrum protection offered by such biocontrol approaches.
Genomic surveillance provides critical data for vaccine development and evaluation through:
Genotype-Specific Efficacy Assessment: The AWED trial enabled estimation of "genotype-specific protective efficacies of wMel, which were similar (±10%) to the point estimates of the respective serotype-specific efficacies" [50]. This granular assessment ensures interventions remain effective across diverse viral genotypes.
Viral Evolution Monitoring: Continuous genomic surveillance detects mutations that might affect vaccine efficacy or disease severity, enabling proactive strategy adjustments.
Despite its transformative potential, genomic surveillance implementation faces significant challenges in Brazil and similar settings:
Resource and Infrastructure Limitations: Many Latin American countries have limited research and development investment, with national budgets for R&D "approximately one hundred-fold less than in high-income countries" [53]. This affects access to sequencing technologies, reagents, and computational resources.
Technical Capacity Building: Establishing sustainable genomic surveillance requires training personnel in both laboratory methodologies and bioinformatics analysis. Initiatives such as the Asian-American Center for Arbovirus Research and Enhanced Surveillance (A2CARES) have focused on "capacity-strengthening and peer training workshops for local health scientists" [53] to address this gap.
Data Integration and Interpretation: Combining genomic data with epidemiological and clinical information remains challenging. Effective "integration of genomic data with environmental and epidemiological information" [11] requires standardized protocols and interdisciplinary collaboration.
Future directions for enhancing dengue genomic surveillance in Brazil include:
Portable Sequencing Technologies: Field-deployable sequencing devices can enable rapid, local genomic characterization during outbreaks, reducing dependency on central laboratories.
South-South Collaboration: Partnerships between scientifically developing countries facilitate technology transfer and knowledge sharing. The collaboration between Nicaragua and Ecuador exemplifies how "South-South exchange of expertise in genomic sequencing" [53] builds regional capacity.
Standardized Frameworks: Developing consistent protocols for sample processing, data generation, and analysis ensures comparability across studies and regions. This includes addressing "data integration, standardization, and ethical considerations in genomic research" [11].
Genomic surveillance has revolutionized dengue monitoring and response in Brazil, providing unprecedented insights into viral diversity, transmission patterns, and outbreak dynamics. When implemented within a One Health framework that integrates human, vector, and environmental data, genomic approaches offer a powerful tool for understanding and controlling this significant public health threat. Despite ongoing challenges related to resource limitations and capacity building, continued advancements in sequencing technologies, bioinformatics, and international collaboration promise to further enhance Brazil's capacity to track and respond to dengue outbreaks. The institutionalization of genomic surveillance systems, as demonstrated during the COVID-19 pandemic [53], provides a foundation for sustainable arbovirus monitoring that will remain essential for public health security in the face of emerging and reemerging infectious diseases.
This case study details a comprehensive genome-wide analysis of Klebsiella pneumoniae species complex (KpSC) conducted within a One Health framework in Norway. The research analyzed 3,255 whole-genome sequenced isolates from human, animal, and marine sources collected over a 20-year period (2001-2020) to assess diversity, niche-enriched traits, and cross-reservoir transmission dynamics. Results revealed distinct but overlapping populations across niches, with limited recent direct transmission events but notable spillover of clinically relevant genetic features. Human-to-human transmission was identified as the primary route of infection, yet the study provides genomic evidence of plasmid-mediated virulence gene exchange between animal and human reservoirs. This investigation establishes a methodological blueprint for integrating large-scale genomic data into One Health research and informs targeted public health interventions for antimicrobial resistance management.
The One Health concept underscores the interconnectedness of human, animal, and environmental health, necessitating an integrated, transdisciplinary approach to tackle contemporary health challenges [11]. Genomic technologies have emerged as pivotal tools in One Health initiatives, enabling researchers to decode complex biological data for comprehensive insights into pathogen evolution, transmission dynamics, and host-pathogen interactions across species and ecosystems [11] [7].
Klebsiella pneumoniae species complex (KpSC) represents a group of opportunistic pathogens that cause severe and difficult-to-treat infections, particularly concerning with the emergence of multidrug-resistant (MDR) and hypervirulent strains [54] [55]. KpSC members are unique in their ability to colonize diverse niches, including humans, animals, and marine environments, before causing infections, raising critical questions about their transmission pathways and the clinical relevance of non-human reservoirs [55].
This case study presents a detailed examination of KpSC in Norway through a One Health genomic lens, serving as a model for integrating large-scale genomic data to address fundamental questions in pathogen ecology and transmission. The study leverages Norway's unique position with its low antimicrobial resistance prevalence, providing an exceptional opportunity to study niche connectivity in a population largely unaffected by antibiotic selection pressures [55].
The study employed a comprehensive sampling strategy encompassing diverse reservoirs across Norway (2001-2020). The collection and processing of isolates followed a standardized protocol to ensure consistency and comparability across niches.
Table: Sample Distribution Across Reservoirs
| Source Type | Subcategories | Number of Isolates | Collection Years | Sampling Details |
|---|---|---|---|---|
| Human Infections | Blood (n=1,920); Urine (n=252) | 2,172 | 2001-2018 | Nationwide surveillance across 22 clinical microbiology laboratories |
| Human Carriage | Fecal carriage | 484 | 2015-2016 | General adult population, Tromsø municipality |
| Marine Environments | Bivalves/sea urchins (n=92); Surface seawater (n=7) | 99 | 2016, 2019, 2020 | 77 production areas + 5 non-rearing locations; pooled samples (10-20 individuals) |
| Food-Producing Animals | Pigs (n=146); Turkey flocks (n=173); Broiler flocks (n=145); Wild boars (n=27); Dogs (n=16); Cattle (n=12) | 500+ | 2018-2020 | Norwegian monitoring program for AMR in veterinary sector (NORM-VET); caecal/faecal samples |
The study utilized a dual sequencing approach to maximize data quality and completeness:
The analytical framework incorporated multiple complementary approaches:
Diagram Title: One Health Genomic Analysis Workflow
The analysis revealed a complex population structure with significant diversity both between and within niches:
Table: Distribution of Key Genetic Features Across Niches
| Genetic Feature | Human Infections | Human Carriage | Pigs | Poultry | Marine |
|---|---|---|---|---|---|
| Sublineages (SLs) | SL17, SL35, SL37, SL45, SL107 | SL17, SL35, SL37 | SL17, SL35, SL37, SL107 | SL35, SL37, SL45 | SL17, SL37, SL3010 |
| Aerobactin-encoding plasmids | Limited | Limited | Enriched | Present | Absent |
| Colicin a | Limited | Limited | Enriched | Present | Absent |
| iuc3 virulence locus | Present | Present | Enriched | Limited | Absent |
GWAS approaches identified specific genetic elements associated with particular niches:
The study provided nuanced insights into transmission patterns:
Diagram Title: KpSC Transmission Network Across Niches
This section details essential reagents, technologies, and computational tools employed in this One Health genomic study, providing a resource for researchers designing similar investigations.
Table: Essential Research Reagents and Platforms for One Health Genomic Studies
| Category | Specific Tool/Technology | Application in Study | Technical Considerations |
|---|---|---|---|
| Sequencing Platforms | Illumina MiSeq/HiSeq 2500 | Primary short-read sequencing of all isolates | High accuracy, cost-effective for large sample sizes |
| Oxford Nanopore Technologies (ONT) | Long-read sequencing for hybrid assembly (16.9% of isolates) | Resolves complex regions, enables complete plasmid assembly | |
| Bioinformatics Tools | TrimGalore v0.6.7 | Adapter trimming and quality control | Critical for data standardization across batches |
| SPAdes v3.15.4 | Genome assembly from short reads | Produces high-quality draft genomes | |
| Hybrid assembly pipelines | Combine short and long reads for complete genomes | Essential for plasmid and repetitive element resolution | |
| Analytical Frameworks | Genome-Wide Association Study (GWAS) | Identification of niche-enriched genetic traits | Requires careful correction for population structure |
| Phylogenomic analysis | Reconstruction of evolutionary relationships | Core genome approaches provide highest resolution | |
| Transmission analysis | Inference of recent transmission events | â¤22 SNP threshold balances sensitivity/specificity | |
| Specialized Reagents | Multiplexed library preparation kits | High-throughput sequencing library construction | Enables cost-effective processing of large sample sets |
| Quality control reagents (Qubit, Bioanalyzer) | DNA quantification and qualification | Essential for sequencing success | |
| Pungiolide A | Pungiolide A, MF:C30H36O7, MW:508.6 g/mol | Chemical Reagent | Bench Chemicals |
| Hebeirubescensin H | Hebeirubescensin H, MF:C20H28O7, MW:380.4 g/mol | Chemical Reagent | Bench Chemicals |
The findings from this One Health genomic analysis carry significant implications for designing effective public health interventions:
This case study highlights several critical methodological aspects for One Health genomic investigations:
This study aligns with broader European efforts to advance genomic medicine, including the 1+ Million Genomes (1+MG) initiative which aims to enable secure access to genomics and corresponding clinical data across Europe [57] [58]. The Genomic Data Infrastructure (GDI) project, deploying a federated, sustainable, and secure infrastructure for genomic and clinical data access, provides a framework that could expand to incorporate One Health data [57] [59]. Such integration would enhance cross-border collaboration and enable more comprehensive understanding of pathogen dynamics across human, animal, and environmental interfaces.
This One Health genomic analysis of Klebsiella pneumoniae in Norway demonstrates the power of integrated genomic approaches to elucidate complex transmission dynamics at the human-animal-environment interface. While human-to-human transmission predominates for KpSC, the study provides definitive evidence of cross-reservoir spillover of clinically relevant strains and mobile genetic elements, particularly between pig and human populations.
The methodological framework presented offers a blueprint for future One Health genomic studies, emphasizing the importance of standardized protocols, comprehensive sampling, and advanced bioinformatic integration. As genomic technologies become increasingly accessible and bioinformatic capabilities expand, the integration of One Health principles into public health surveillance and intervention strategies will be crucial for addressing the interconnected challenges of antimicrobial resistance, emerging pathogens, and global health security.
The findings reinforce that safeguarding human health requires a holistic approach that considers the complex ecological networks in which pathogens evolve and spread. Future research should build upon this foundation to explore functional mechanisms of niche adaptation and expand surveillance to underrepresented reservoirs, ultimately enabling more predictive and proactive public health interventions.
The One Health concept underscores the inextricable linkages between human, animal, and environmental health, promoting an integrated, transdisciplinary approach to tackle contemporary health challenges [60]. In genomic sciences, this approach necessitates comprehensive surveillance and analysis of pathogens, antimicrobial resistance (AMR) markers, and health determinants across all three domains [61] [7]. However, the implementation of this vision faces significant computational and bioinformatics hurdles due to the enormous scale, diversity, and distributed nature of the resulting genomic data. The projection that antimicrobial resistance alone could cause 10 million deaths annually by 2050 has reinforced its status as one of the greatest global health threats of the 21st century, demanding more sophisticated analytical approaches [61]. This technical guide examines the core computational challenges in One Health genomics and provides detailed methodologies for overcoming them, enabling researchers to advance this critical field.
The integration of multi-sectoral genomic data within a One Health framework presents distinct computational obstacles that must be addressed to realize its full potential. These challenges span data management, processing, analysis, and interpretation phases.
Table 1: Key Computational Challenges in Large-Scale One Health Genomics
| Challenge Category | Specific Technical Hurdles | Impact on One Health Research |
|---|---|---|
| Data Volume & Transfer | Petabyte-scale data generation; network limitations for transfer [62] | Hinders integration of distributed human, animal, and environmental datasets |
| Data Heterogeneity | Non-standardized formats across platforms and sectors [61] [62] | Impedes cross-species comparisons and integrated analysis |
| Computational Complexity | NP-hard problems (e.g., Bayesian network reconstruction); memory-intensive operations [62] | Limits ability to model complex health interactions across domains |
| Resource Constraints | High sequencing costs; bioinformatics complexity; infrastructure limitations in tropical regions [61] [7] | Creates geographic disparities in One Health implementation capacity |
| Data Integration & Visualization | Merging multi-omics datasets (genomic, proteomic, metabolomic) [63] | Challenges in creating unified views of cross-sectoral health relationships |
The challenges are particularly acute in tropical regions, which contain over 80% of Earth's biodiversity and approximately 40% of the global population, yet face significant infrastructure limitations [7]. Additionally, the separation of human, animal, and plant genetics into distinct research silos has historically limited the exchange of analytical methods and insights between these critical One Health domains [10].
Effective processing of One Health genomic data requires robust computational frameworks and workflow management systems designed for scale and reproducibility. The following guidelines provide a foundation for large-scale data processing:
A critical supplementary rule involves understanding where the computational burden originates and designing around it [64]. Different analysis types impose distinct requirements â some demand extensive computing power, while others require substantial memory or storage resources. For variant calling across multiple samples in a One Health context, the natural approach is to distribute one sample per analysis initially, then perform joint variant calling by distributing work per chromosome or genomic region [64].
Diagram 1: Computational workflow for One Health genomic data analysis (76 characters)
Organizing files and analyses with comprehensive metadata is crucial for One Health genomics, where data originates from diverse sources and requires integration. Effective metadata management includes:
<sampleID>_<source>_<processing>.fileformat) [64]For One Health applications, metadata should specifically capture the originating domain (human, animal, environmental), geographical location, sampling date, and relevant ecological parameters to enable meaningful cross-domain analysis.
Comprehensive AMR surveillance requires a dual approach combining isolate-based whole-genome sequencing (WGS) with shotgun metagenomics for complex matrices [61]. The following protocol provides a detailed methodology:
Sample Collection and Processing:
Sequencing Methodologies:
Bioinformatic Analysis:
Technical Considerations:
The Purdue University One Health initiative demonstrates innovative approaches for cross-species genomic analyses that leverage methods developed in animal and plant genetics for human health applications, and vice versa [10]. The following protocol outlines this methodology:
Data Collection:
Analytical Framework:
Implementation:
Effective visualization is essential for interpreting the complex relationships in One Health genomic data. The Gviz package in Bioconductor provides a flexible framework for visualizing genomic data in the context of diverse annotation features, integrating tightly with existing Bioconductor infrastructure [65]. Key capabilities include:
For large-scale genomic datasets, visual scalability becomes critical [66]. As genomic datasets grow exponentially, visualization tools must efficiently represent data at different resolutions â from chromosome-level structural variations to nucleotide-level sequences. This requires:
Table 2: Essential Bioinformatics Tools for One Health Genomics
| Tool Category | Specific Tools | Application in One Health |
|---|---|---|
| Workflow Management | Nextflow, Snakemake, Apache Hadoop [63] [64] | Automated processing of multi-sectoral genomic data |
| Variant Analysis | GATK, BCFtools [63] | Cross-species variant identification and annotation |
| Genome Visualization | Gviz, IGV, UCSC Genome Browser [65] [66] | Visual integration of genomic data across domains |
| Metagenomic Analysis | MetaPhlAn, HUMAnN, SHOGUN | Profiling microbial communities across environments |
| Database Integration | BioMart, NCBI GenBank, EMBL-EBI [63] | Accessing and combining diverse biological data |
| Cloud Platforms | Google Cloud Life Sciences, AWS Genomics Workflows [63] | Scalable computation for large One Health datasets |
Diagram 2: One Health genomic data analysis and visualization pipeline (82 characters)
Table 3: Essential Research Reagents and Computational Resources for One Health Genomics
| Resource Category | Specific Items | Function in One Health Research |
|---|---|---|
| Sequencing Technologies | Oxford Nanopore MinION, Illumina NovaSeq, PacBio Sequel [61] [5] | Portable and high-throughput DNA/RNA sequencing across field and lab settings |
| DNA Extraction Kits | Optimized kits for diverse sample types (clinical, environmental, veterinary) | High-quality DNA extraction from varied One Health specimen sources |
| Cloud Computing Platforms | Google Cloud Life Sciences, AWS Genomics, Microsoft Bioinformatics [63] | Scalable computational resources for large-scale cross-domain analyses |
| Bioinformatics Pipelines | Nextflow/Snakemake workflows, GATK, BCFtools [63] [64] | Standardized processing and analysis of genomic data across projects |
| Genomic Databases | NCBI GenBank, EMBL-EBI, ResFinder, Pathogenwatch [61] [63] | Reference data for annotation and comparative analysis |
| Visualization Tools | Gviz, IGV, Cytoscape, Tableau [65] [66] [63] | Visual exploration and communication of complex cross-domain relationships |
| Sample Collection Materials | Environmental sampling kits, veterinary swabs, clinical specimen containers | Standardized collection of samples across One Health domains |
Addressing the computational and bioinformatics challenges in large-scale data analysis is fundamental to advancing the One Health agenda in genomic sciences. By implementing robust computational frameworks, standardized protocols, and scalable visualization tools, researchers can effectively integrate genomic data across human, animal, and environmental domains. The solutions outlined in this guide â from workflow automation and metadata management to cross-species analytical methods â provide a foundation for extracting meaningful insights from complex One Health datasets. As genomic technologies continue to evolve, particularly with the advent of real-time sequencing [5], the computational approaches described here will enable more timely interventions and evidence-based policies to promote health across all sectors. Strategic investment in bioinformatics capacity, particularly in underserved tropical regions [7], will be essential to ensuring equitable implementation of One Health genomics worldwide.
The One Health concept underscores the inextricable links between human, animal, and environmental health, advocating for a unified approach to tackle complex global health challenges [60]. In genomic sciences, this approach is particularly crucial, as pathogens and antimicrobial resistance genes circulate freely across species and ecosystems, disregarding disciplinary boundaries. The modern landscape of global health threats, including emerging infectious diseases and antimicrobial resistance (AMR), demands a collaborative, transdisciplinary methodology that traditional sector-specific approaches cannot adequately address [67]. Genomic research, with its capacity to decode complex biological data across species, provides an unprecedented opportunity to understand pathogen evolution, transmission dynamics, and host-pathogen interactions at the human-animal-environment interface [60].
Despite this potential, significant structural and operational barriers impede effective collaboration. Fragmented funding streams, disciplinary silos, and absence of standardized protocols often prevent the integration of knowledge and resources necessary for comprehensive One Health genomic surveillance [68]. The institutionalization of multisectoral coordination remains highly complex, requiring deliberate strategies to bridge communication gaps, align priorities, and establish shared frameworks for data generation and interpretation [69]. This technical guide outlines evidence-based strategies and practical methodologies for fostering effective multi-sectoral and multi-disciplinary collaboration in One Health genomic research, providing researchers, scientists, and drug development professionals with the tools to break down persistent silos and harness the full potential of integrative science.
A foundational step in building effective multi-sectoral partnerships involves precisely understanding key terms often used interchangeably. Within the context of One Health initiatives, coordination and collaboration represent distinct but complementary concepts in the spectrum of partnership.
Coordination refers to the organization and management of human and/or physical resources to achieve a common goal, serving as a core element of collaboration [69]. It focuses on organizing people, organizations, or resources but does not always require reciprocal engagement. A practical example includes a team leader delegating tasks to team members or the allocation of vaccines from national warehouses to subnational storage centers.
Collaboration, in contrast, describes an evolving process involving the active and reciprocal engagement of two or more social entities (e.g., people, teams, organizations) working together toward a shared goal [69]. This process occurs at multiple levels, including within the same team, across teams, departments, organizations, sectors, and between countries.
Dr. Muhammad Sartaj illustrated this distinction using the example of an orchestra: "During a performance, the conductor coordinates the musicians by guiding them with hand movements as they play a symphony. This is coordination. However, the process that takes place before the performanceâsuch as rehearsals, sitting together, and sharing knowledgeâis an example of collaboration" [69]. This analogy clarifies that while coordination manages aligned activities, collaboration involves deeper integration of expertise and shared learning. For One Health genomic research, both elements are essential: coordination ensures efficient operational workflows, while collaboration enables the transdisciplinary integration of knowledge required to interpret complex genomic data across health domains.
Implementing successful One Health programs, particularly in genomics, requires navigating significant systemic barriers. Evidence from low- and middle-income countries (LMICs) and global assessments highlights consistent challenges that hinder effective multi-sectoral collaboration.
Table 1: Key Systemic Barriers to One Health Collaboration
| Barrier Category | Specific Challenges | Impact on Genomic Research |
|---|---|---|
| Governance & Political Will | Lack of political will, weak governance, unclear goals and roles, absence of formal policy decision-making processes [70] [68]. | Fragmented leadership for cross-sector genomic surveillance initiatives; inadequate policy support for data sharing. |
| Funding Structures | Top-down funding frameworks that create silos, competing priorities between sectors, rivalries over budget allocation [70] [68]. | Disconnected funding streams for human, animal, and environmental genomics; limited resources for integrated AMR surveillance. |
| Technical & Resource Capacity | Lack of human, financial, and logistics resources, high sequencing costs, bioinformatics complexity [70] [71]. | Limited capacity for WGS and metagenomics in veterinary and environmental health programs; inequitable resource distribution. |
| Data Standardization | Variability of sub-cultures within disciplines, inconsistencies in sequencing protocols across laboratories, absent standardized bioinformatics workflows [68] [71]. | Hindered cross-sector comparability; inability to track resistance transmission across geographic and ecological boundaries. |
| Trust & Institutional Dynamics | Limited trust and co-ownership within and between organizations, conflicting priorities between stakeholders from different sectors [69] [68]. | Reluctance to share preliminary genomic data; insufficient credit-sharing mechanisms across disciplines. |
These barriers collectively create a landscape where siloed operations often prevail over integrated approaches. Particularly in genomic AMR surveillance, implementation remains fragmented, predominantly focused on clinical settings with insufficient integration across One Health compartments [71]. Addressing these challenges requires targeted strategies that create enabling environments for collaboration, which we explore in the following section.
The integration of coordination mechanisms within governmental and research bodies requires establishing supportive policies, legislation, and frameworks to enable their effective operationalization. Enacting such legal frameworks strengthens national governance of coordination bodies, facilitates resource allocation, and enhances accountability [69]. A critical first step involves identifying and formally engaging relevant sectors. The purpose and rationale for engaging sectors must be clearly delineated and articulated to ensure the right sectors are involved, optimize resource use, increase transparency, and foster trust [69].
EMPHNET's Stakeholder Mapping and Analysis Tool provides a structured approach for this process, using three primary domains to assess and prioritize stakeholders [69]:
Following stakeholder identification, Dr. Scott Dowell recommends that countries and research consortia employ a systematic approach to public health emergency coordination that is equally applicable to sustained research collaboration. This approach includes four main steps [69]:
A structured toolkit for implementing multisectoral collaboration, developed by the Centre for Environmental and Social Health (CESH), provides a dynamic, iterative process involving four key steps [72]. The following diagram visualizes this continuous cycle:
Diagram 1: Multisectoral Collaboration Cycle. This framework outlines the four iterative steps for establishing and maintaining effective collaboration, emphasizing continuous learning and adaptation [72].
For each step in this cycle, specific tools and processes enhance effectiveness:
Step 1: Convene and Plan - This foundational stage involves stakeholder mapping and analysis to identify key actors, followed by jointly defining the problem using systems thinking, establishing shared goals, and agreeing on implementation approaches [72]. This ensures all participants have a shared understanding of objectives and commitment to the collaborative process.
Step 2: Establish Structure - Critical operational elements include clarifying roles and responsibilities, agreeing on leadership models that emphasize traits facilitating collaboration, setting up governance mechanisms and processes, and planning administrative coordination to support shared vision [72].
Step 3: Execute and Coordinate - Implementation requires clear coordination channels, continuous reflection and adaptation using frameworks like the 5R framework or CARL framework, and active relationship building and maintenance among partners [72].
Step 4: Monitor and Adapt - Continuous evaluation assesses both the collaboration's effectiveness and its outcomes, with findings disseminated to stakeholders to generate appropriate solutions that enhance collaboration [72].
Beyond structural frameworks, specific enabling factors have been identified as critical for sustaining One Health collaborations. Evidence from systematic reviews indicates that the existence of a reference framework document for One Health activities, good coordination between different sectors at various levels, and joint multisectoral meetings that advocate the One Health approach significantly enhance collaborative success [70]. Furthermore, the availability of dedicated funds and adequate resources coupled with the support of technical and financial partners is a fundamental enabler [70].
Funding strategies that create incentive structures through joint budgets or special grants for One Health activities are particularly effective in promoting collaboration [68]. These mechanisms directly address the systemic barrier of siloed funding by creating shared resources that necessitate cooperative planning and implementation. Additionally, research networks and collaborations that incentivize knowledge exchange facilitate the cross-disciplinary dialogue essential for interpreting complex genomic data across human, animal, and environmental health contexts [68].
The application of genomics within a One Health framework is particularly advanced in antimicrobial resistance (AMR) surveillance, providing a model for other collaborative genomic research areas. An integrated genomic framework combines isolate-based whole-genome sequencing (WGS) with shotgun metagenomics within a cohesive One Health strategy [71]. The following workflow illustrates this integrated approach:
Diagram 2: Integrated Genomic AMR Surveillance Workflow. This framework combines isolate-based WGS and metagenomics across One Health sectors for comprehensive resistance monitoring [71].
The reliability of genomic AMR surveillance depends on well-defined sequencing protocols, encompassing sample processing, sequencing methodology, and bioinformatics workflows [71]. Standardizing these technical elements is critical for cross-sectoral comparability and data integration.
Table 2: Technical Specifications for Genomic AMR Surveillance
| Technical Element | Isolate-Based WGS | Shotgun Metagenomics |
|---|---|---|
| Primary Application | High-resolution characterization of priority pathogens and MDR/XDR strains [71]. | Analysis of complex microbial communities and unculturable bacteria from diverse reservoirs [71]. |
| Sample Requirements | High-quality genomic DNA from bacterial isolates [71]. | Optimized DNA extraction from complex environmental samples [71]. |
| Sequencing Platforms | Short-read (Illumina) for SNP detection; Long-read (Oxford Nanopore, PacBio) for complete assemblies [71]. | Short-read for affordable characterization; Long-read for mobile genetic element detection [71]. |
| Sequencing Depth | â¥100à coverage for SNP detection and outbreak investigations; 30-50à for broader surveillance [71]. | Balance sensitivity for rare resistance genes with financial feasibility [71]. |
| Bioinformatics Pipeline | Standardized pipelines for ARG annotation, strain typing, phylogenetic analysis [71]. | Harmonized workflows for resistome characterization, virulence factor annotation [71]. |
| Data Integration | Integration with global AMR databases (NCBI, Pathogenwatch, ResFinder) [71]. | Cross-sectoral comparability through standardized data-sharing protocols [71]. |
| D-Dimannuronic acid | D-Dimannuronic acid, MF:C12H18O13, MW:370.26 g/mol | Chemical Reagent |
| Erysenegalensein E | Erysenegalensein E, MF:C25H26O6, MW:422.5 g/mol | Chemical Reagent |
Implementing the technical workflows for One Health genomic surveillance requires specific research reagents and materials. The following table details essential solutions for integrated AMR monitoring:
Table 3: Key Research Reagent Solutions for One Health Genomic Surveillance
| Reagent/Material | Function | Application Context |
|---|---|---|
| High-Fidelity DNA Polymerases | Ensure accurate amplification with minimal errors during library preparation [71]. | Critical for both WGS and metagenomic sequencing across all sectors. |
| Metagenomic DNA Extraction Kits | Optimized for diverse sample matrices (soil, water, feces) to maximize yield and representativeness [71]. | Essential for environmental and animal reservoir sampling where inhibitor removal is challenging. |
| Targeted Enrichment Panels | Capture specific resistance genes or pathogens from complex samples [71]. | Screening for known high-priority ARGs in surveillance programs. |
| Standardized Assembly Algorithms | Unified bioinformatics tools for genome reconstruction and annotation [71]. | Enabling cross-laboratory and cross-sector comparability of genomic data. |
| Curated ARG Databases | Reference databases for consistent annotation of resistance determinants [71]. | Harmonizing gene nomenclature across human, animal, and environmental health sectors. |
| Platform-Specific Library Prep Kits | Tailored reagents for different sequencing technologies (Illumina, Nanopore, PacBio) [71]. | Ensuring compatibility with the specific sequencing infrastructure available across sectors. |
Breaking down silos in One Health genomic research requires both technical and relational foundations. The strategies outlined in this guideâfrom establishing formal governance structures and implementing structured collaboration frameworks to standardizing technical protocolsâprovide a roadmap for researchers, scientists, and drug development professionals to overcome disciplinary and sectoral boundaries. The integrated genomic surveillance framework for AMR demonstrates the powerful insights achievable when human, animal, and environmental health sectors collaborate effectively, combining isolate-based WGS with metagenomics to track resistance dynamics across ecosystems [71].
Ultimately, successful multi-sectoral collaboration in genomic sciences requires a fundamental shift in perspectiveârecognizing that results and breakthroughs will not be achieved within single institutions or organizations but will be shared among One Health programs [68]. This paradigm change, supported by enabling funding structures, standardized technical protocols, and formal coordination mechanisms, promises to unlock the full potential of genomics to address complex global health challenges. By deliberately implementing these strategies, the scientific community can transform One Health from a conceptual framework into an operational reality, fostering a more resilient and integrated approach to health security in an interconnected world.
Genomic technologies are revolutionizing disease understanding and therapeutic development, yet their benefits remain unevenly distributed. Achieving equitable access and building sustainable capacity represents both an ethical imperative and a practical necessity for maximizing scientific progress. This is particularly true within a One Health framework, which recognizes the profound interconnections between human, animal, and environmental health. Pathogen surveillance, zoonotic disease tracking, and ecosystem monitoring all depend on robust, distributed genomic capabilities. When genomic capacity is concentrated in only a few well-resourced regions, it creates blind spots in our global health defense system, allowing emerging threats to go undetected and unaddressed. This whitepaper provides a technical guide for researchers and drug development professionals to implement practical strategies that build genomic capacity and ensure equity, thereby strengthening our collective ability to address health challenges across species and ecosystems.
The integration of genomic technologies into healthcare and research has unveiled significant disparities in access and implementation. A 2024 review of international genomic health policies found that while the core concept of "access" is frequently cited, other critical components of equityâsuch as cultural responsiveness, non-discrimination, and community participationâare covered to a much lesser degree [73]. The analysis revealed a complete absence of policy addressing the concepts of "liberty" and "entitlement" in genomic care, highlighting a significant gap in the foundational equity framework [73].
These policy shortcomings manifest in concrete ways. In genomic research, nearly 95% of genomic data is derived from European ancestral populations, creating a massive "data equity problem" that undermines the generalizability and effectiveness of precision medicine across all populations [74]. Furthermore, racial and ethnic minorities, Indigenous communities, and rural residents consistently demonstrate less access to genetics health services, a disparity that persists even when demand exists [73] [75]. For instance, Indigenous Australians are three-fold underrepresented in clinical genetic services despite a clear need [73].
Table 1: Key Disparities in Genomic Technology Access
| Disparity Dimension | Key Findings | Implications |
|---|---|---|
| Genomic Data Composition | 95% of genomic data from European ancestral populations [74] | Reduced diagnostic accuracy, ineffective therapies for underrepresented groups |
| Geographic Access | Limited access in rural/underserved areas; services concentrated at major hospitals [75] [74] | Delayed diagnoses, prolonged "diagnostic odyssey" |
| Racial/Ethnic Equity | Under-representation of minority groups in genetic services [73] [74]; Structural racism in NICU care [76] | Widening health disparities, perpetuation of historical inequities |
| Workforce Distribution | Shortage of skilled professionals (bioinformaticians, genetic counselors) creates implementation bottlenecks [77] | Limited capacity to integrate genomics into routine care |
A capable workforce is the cornerstone of genomic capacity. However, a significant shortage of skilled professionalsâincluding bioinformaticians, genetic counselors, clinical geneticists, and data analystsâcreates critical bottlenecks in implementing genomic programs, even in high-growth regions [77]. Addressing this requires a multi-faceted approach.
Strategic investments in education and training are paramount. The National Institutes of Health (NIH) has funded the Diversity Centers for Genome Research Consortium to directly address the need for workforce diversity and research capacity building [78]. This initiative employs common data elements (CDEs) to capture evaluation outputs, synergize reporting, and facilitate continuous quality improvement. The five CDEs focus on: genomics programs and equipment, scientific productivity, scientific collaboration, community engagement, and workforce development [78].
Simultaneously, building capacity within the existing primary care workforce is essential for mainstreaming genomics. A 2025 scoping review identified that the most significant barriers for primary care practitioners (PCPs) are knowledge gaps, environmental context and resources, and skills deficiencies [79]. The review recommends multifaceted, evidence-based education strategies with interactive components to change practitioner behavior, alongside a clear clarification of PCPs' roles and referral pathways to tertiary genetics services [79].
The following protocol outlines a methodology for embedding capacity building within a genomic research study, aligning with the One Health principle of sustainable system strengthening.
Protocol Title: Integrated Workflow for Equitable Genomic Diagnosis and Workforce Training
Objective: To establish a replicable model for providing rapid genomic testing while simultaneously training a multidisciplinary clinical and bioinformatics workforce in underserved settings.
Materials and Reagent Solutions:
Methodology:
This protocol creates a positive feedback loop: the clinical need drives the research activity, which in turn creates a authentic training environment that builds lasting local capacity.
Diagram 1: Integrated Capacity Building Workflow. This model shows how training and service delivery are synergized to create a sustainable local genomic ecosystem.
Moving beyond theoretical commitment to practical action requires innovative service-delivery models and technologies designed specifically for equity.
The SeqFirst-neo program provides a powerful experimental model for centering equity at the point of care [76]. This research initiative tested a genotype-driven workflow using simple, broad exclusion criteria for rapid genome sequencing (rGS) in a neonatal intensive care unit (NICU), rather than complex clinical inclusion criteria that often incorporate unconscious bias.
Experimental Protocol:
Other innovative strategies are emerging to bridge the equity gap:
Table 2: Analysis of Innovative Equity-Focused Genomic Models
| Model Name | Core Innovation | Key Outcome | One Health Relevance |
|---|---|---|---|
| SeqFirst-neo [76] | Genotype-first workflow with simple exclusion criteria | 9x greater odds of diagnosis; eliminated diagnostic disparity for Black infants | Standardized triage can be adapted for animal disease surveillance and environmental pathogen monitoring. |
| Telegenetics [75] | Virtual delivery of genetic services | Improved access for rural/underserved populations; effective remote counseling | Enables expert consultation across vast geographic distances, crucial for veterinary care and ecosystem management. |
| Texome Project [74] | Proactive provision of comprehensive genetic services to underserved populations | Increased access to WES and long-term follow-up for underrepresented groups | Demonstrates a proactive, system-based approach to closing equity gaps, a model for inclusive planetary health surveillance. |
| Capacity Building Framework [81] [79] | Community and practitioner engagement as a core research component | Builds trust, enhances recruitment, improves PCP genomic capability | Builds resilient, distributed health networks capable of responding to cross-species health threats. |
Diagram 2: Contrasting Genomic Testing Workflows. The shift from a phenotype-first to a genotype-first model is fundamental to reducing bias and achieving equity.
Table 3: Research Reagent Solutions for Equitable Genomic Capacity
| Tool Category | Specific Technology/Reagent | Function in Equitable Capacity Building |
|---|---|---|
| Sequencing Platforms | Oxford Nanopore portable sequencers [80] | Enables real-time, in-field sequencing for remote labs and pathogen surveillance, decentralizing capability. |
| Sequencing Platforms | Ultima Genomics UG 100 [80] | Drives down cost-per-base, making large-scale studies and population screening more economically feasible. |
| Bioinformatics Tools | Cloud Computing (AWS, Google Cloud) [80] | Provides scalable, cost-effective data storage and analysis, removing the need for expensive local computing infrastructure. |
| Bioinformatics Tools | AI/ML Tools (e.g., DeepVariant) [80] | Automates and improves accuracy of variant calling, reducing dependency on highly specialized expert labor. |
| Diagnostic Kits | Whole Exome/Genome Sequencing Kits [74] | Provides comprehensive, hypothesis-free testing, crucial for diagnosing conditions with non-specific presentations across diverse populations. |
| Collaborative Frameworks | Common Data Elements (CDEs) [78] | Standardizes evaluation metrics across capacity-building projects, enabling synergy and comparative learning. |
Building capacity and ensuring equity in genomic technology is not merely an ethical adjunct to scientific progress; it is a fundamental component of robust, reproducible, and effective science. The strategies outlinedâfrom workforce development and policy reform to the implementation of genotype-first workflows and telegeneticsâprovide a roadmap for creating a more inclusive genomic future. For the One Health community, these efforts are particularly critical. The threats of pandemics, antimicrobial resistance, and environmental change do not respect national or socioeconomic boundaries. A distributed, equitable global genomic infrastructure is our best defense, allowing for early detection and rapid response to health challenges at the human-animal-environment interface. By investing in the frameworks, technologies, and trained personnel needed to extend genomic capabilities to all corners of the world, we are not only leveling the playing field for human health but also fortifying our collective resilience against the complex health challenges of the 21st century.
The FAIR Guiding PrinciplesâFindable, Accessible, Interoperable, and Reusableârepresent a foundational framework for scientific data management and stewardship that has gained significant traction since their formal publication in 2016 [82]. These principles were specifically designed to address the challenges posed by the increasing volume, complexity, and creation speed of data in modern research ecosystems [82]. The core innovation of FAIR lies in its emphasis on machine-actionability, enabling computational systems to find, access, interoperate, and reuse data with minimal human intervention [83]. This characteristic makes FAIR particularly crucial for data-intensive fields like genomic sciences, where the scale of data often exceeds human processing capabilities.
Within the context of One Healthâan integrated, unifying approach that aims to sustainably balance and optimize the health of people, animals, and ecosystems [1]âthe FAIR principles provide an essential framework for overcoming data fragmentation across sectors. The One Health approach recognizes that the health of humans, domestic and wild animals, plants, and the wider environment are closely linked and interdependent [1]. This interconnectedness generates complex, multidisciplinary data streams that require sophisticated integration methods to be meaningful for research and public health decision-making. The Digital One Health (DOH) framework has emerged as a response to this challenge, proposing a structured approach to data integration that aligns closely with FAIR objectives [84].
The convergence of FAIR principles with One Health approaches in genomic research creates a powerful paradigm for addressing pressing global challenges such as infectious diseases, antimicrobial resistance, and food safety [1]. This technical guide explores the implementation of FAIR data principles within One Health-oriented genomic sciences, providing researchers with practical methodologies, visualization tools, and standardized protocols to enhance data quality, interoperability, and reuse across disciplinary boundaries.
The FAIR principles are organized into four distinct but interconnected pillars, each addressing specific aspects of data management:
Findable: The initial step in (re)using data is its discovery. To enhance findability, both metadata and data should be easily discoverable by humans and computers alike [82]. This requires that (meta)data are assigned globally unique and persistent identifiers, described with rich metadata, and registered or indexed in searchable resources [83]. Machine-readable metadata is particularly crucial for enabling automatic discovery of datasets and services [82].
Accessible: Once identified, users must understand how data can be accessed. FAIR stipulates that (meta)data should be retrievable by their identifier using a standardized communications protocol that is open, free, and universally implementable [83]. Importantly, metadata should remain accessible even when the corresponding data is no longer available, preserving the record of what once existed [83].
Interoperable: Effective data integration requires that data can operate with applications or workflows for analysis, storage, and processing [82]. Interoperability is achieved through the use of formal, accessible, shared languages for knowledge representation, vocabularies that follow FAIR principles themselves, and qualified references to other (meta)data [83]. This pillar enables the merging of diverse datasets from human, animal, and environmental domains within the One Health framework.
Reusable: The ultimate objective of FAIR is to optimize data reuse [82]. Reusability depends on rich description of data with a plurality of accurate and relevant attributes, clear usage licenses, detailed provenance, and adherence to domain-relevant community standards [83]. This ensures that data can be replicated and/or combined in different settings by different researchers.
Table 1: Detailed breakdown of the FAIR principles and their implementation considerations.
| FAIR Pillar | Principle Code | Principle Statement | Key Implementation Considerations |
|---|---|---|---|
| Findable | F1 | (Meta)data are assigned a globally unique and persistent identifier | Use of DOIs, ARKs, or other persistent ID systems; resolution through identifier services |
| F2 | Data are described with rich metadata | Metadata richness follows community standards; multiple attribution elements | |
| F3 | Metadata clearly and explicitly include the identifier of the data they describe | Bidirectional linking between data and metadata; unambiguous association | |
| F4 | (Meta)data are registered or indexed in a searchable resource | Deployment in discoverable repositories; indexing by specialized data search engines | |
| Accessible | A1 | (Meta)data are retrievable by their identifier using a standardized communications protocol | HTTP(S) implementation; possible authentication/authorization mechanisms |
| A1.1 | The protocol is open, free, and universally implementable | No proprietary barriers; community-vetted standards | |
| A1.2 | The protocol allows for an authentication and authorization procedure, where necessary | Access control without compromising discoverability; tiered access when appropriate | |
| A2 | Metadata are accessible, even when the data are no longer available | Preservation commitment; metadata persistence policies | |
| Interoperable | I1 | (Meta)data use a formal, accessible, shared, and broadly applicable language for knowledge representation | Standardized data models; formal semantics; machine-readable formats |
| I2 | (Meta)data use vocabularies that follow FAIR principles | Use of community standards like SNOMED CT, MeSH [85]; ontology utilization | |
| I3 | (Meta)data include qualified references to other (meta)data | Relationship specification; contextual linking; cross-repository references | |
| Reusable | R1 | (Meta)data are richly described with a plurality of accurate and relevant attributes | Comprehensive documentation; multiple descriptive dimensions |
| R1.1 | (Meta)data are released with a clear and accessible data usage license | Standard license formats (CC0, CC-BY, etc.); machine-readable license statements | |
| R1.2 | (Meta)data are associated with detailed provenance | Origin tracking; processing history; transformation documentation | |
| R1.3 | (Meta)data meet domain-relevant community standards | Adherence to field-specific metadata schemas; participation in standards development |
The One Health approach is defined by the World Health Organization as "an integrated, unifying approach that aims to sustainably balance and optimize the health of people, animals and ecosystems" [1]. This approach recognizes the intimate connections between human health, animal health, and environmental factors, particularly in the context of emerging infectious diseases, antimicrobial resistance, and food safety [1]. The COVID-19 pandemic has dramatically underscored the necessity of strengthening One Health implementation, with particular emphasis on connections to the environment [1].
The Digital One Health (DOH) framework has been proposed to address the significant gaps in current data integration practices [84]. This framework is structured around five key pillars: (a) harmonization of standards to establish trust, (b) automation of data capture to enhance quality and efficiency, (c) integration of data at point of capture to limit bureaucracy, (d) onboard data analysis to articulate utility, and (e) archiving and governance to safeguard the OH data resource [84]. These pillars align closely with FAIR principles and provide a practical implementation pathway for One Health data integration.
Current implementations of One Health often focus on multi-sectoral collaboration but frequently overlook opportunities to integrate contextual and pathogen-related data into a unified data resource [84]. This lack of integration hampers effective, data-driven decision-making in One Health activities [84]. The DOH framework aims to leverage technology to create data as a shared resource, overcoming not only current structural barriers but also addressing prevailing ethical and legal concerns [84].
The integration of genomic data into clinical and public health workflows presents distinctive challenges within the One Health framework. A 2025 systematic review highlighted that fragmented data ecosystems disrupt interoperability, complicate patient-centered care, and present significant challenges for incorporating genomic data into clinical workflows [86]. Key challenges identified include:
The systematic review, which analyzed 161 studies, found that interoperability remains a cornerstone challenge for modern healthcare systems, requiring well-defined strategies for structuring, managing, and securely exchanging diverse health data [86]. This is particularly critical for chronic disease management, where effective data sharing ensures timely, high-quality healthcare [86]. Without robust interoperability frameworks, fragmented systems lead to redundant efforts and delayed treatments, ultimately adversely affecting patient outcomes [86].
The FAIRification process refers to the practical implementation of FAIR principles onto data resources. A practical "how to" guidance to go FAIR can be found in the Three-point FAIRification Framework [82]. This process typically involves the following key stages:
Pre-FAIRification Assessment: Evaluate the current state of data resources against FAIR principles using maturity models or assessment tools. This establishes a baseline and identifies priority areas for improvement.
Data Model Alignment: Map existing data structures to formal models, ontologies, and vocabularies that comply with FAIR principles, particularly addressing the Interoperable and Reusable pillars.
Identifier Implementation: Assign globally unique and persistent identifiers to datasets and key metadata elements, supporting the Findability principles.
Metadata Enhancement: Enrich metadata descriptions to meet domain-relevant community standards, ensuring comprehensive coverage of the Reusable principles.
Infrastructure Deployment: Establish or leverage trusted repositories that support FAIR access patterns, including standardized protocols and authentication/authorization where required.
Provenance Tracking: Implement systems to record data origin, processing history, and transformations, addressing the Reusable principle R1.2.
License Clarification: Attach clear, machine-readable usage licenses to enable reuse while respecting ethical and legal constraints.
The European Union's open data portal provides an exemplary case study of FAIR implementation, demonstrating how data quality guidelines can be operationalized at scale [87]. Similarly, the AnaEE (Analysis and Experimentation on Ecosystems) Research Infrastructure offers insights into achieving semantic interoperability in ecosystem studies [88], a critical concern for the environmental health component of One Health.
The following diagram illustrates the integrated workflow for generating FAIR-compliant genomic data within a One Health context, connecting human, animal, and environmental data streams:
Diagram 1: FAIRification workflow for One Health genomic data integration
Achieving semantic interoperability across human, animal, and environmental genomic data requires a structured approach to vocabulary alignment and data modeling. The following diagram illustrates the semantic interoperability framework essential for FAIR-compliant One Health data:
Diagram 2: Semantic interoperability framework for cross-domain One Health data
This protocol outlines a standardized methodology for generating FAIR-compliant genomic data within a One Health research context, incorporating elements from the "Genomics Applied to One Health" training course [89] and best practices from genomic data repositories.
Objective: To collect human, animal, and environmental samples with comprehensive metadata capture ensuring traceability and future reuse.
Materials:
Procedure:
Objective: To generate genomic sequencing data with complete technical metadata following FAIR principles.
Materials:
Procedure:
Objective: To process raw sequencing data into analysis-ready formats with comprehensive quality assessment.
Computational Tools:
Procedure:
Objective: To add biological context to genomic data using standardized ontologies and vocabularies.
Resources:
Procedure:
Table 2: Essential research reagents and computational tools for FAIR-compliant One Health genomic research.
| Category | Item/Resource | Specification/Function | FAIR Alignment |
|---|---|---|---|
| Sample Collection | Barcoded Collection Kits | Unique sample identification at source | Supports F1 (persistent identifiers) and R1.2 (provenance) |
| Mobile Data Capture Apps | Field metadata documentation with offline capability | Enhances R1 (rich metadata) and A1 (accessible protocol) | |
| Sequencing | MinION Nanopore Platform | Portable sequencing for field deployment [89] | Enables A1.1 (open protocol) and I1 (standard formats) |
| Unique Dual Indexes | Multiplexing without sample cross-talk | Ensures R1.2 (provenance) and data quality for reuse | |
| Bioinformatics | EupathDB Databases | Specialized functional annotation [89] | Provides I2 (FAIR vocabularies) and R1.3 (community standards) |
| CF Conventions | Standardized environmental data representation [87] | Enables I1 (formal language) across environmental domains | |
| Data Repository | Dataverse Software | Repository platform for data stations [88] | Supports F4 (searchable resource) and A1 (access protocol) |
| Zenodo/Figshare | General-purpose FAIR-compliant repositories | Provides persistent identifiers (F1) and standardized access (A1) | |
| Ontology Services | OBO Foundry Ontologies | Interoperable biomedical ontologies | Delivers I2 (FAIR vocabularies) and I1 (formal language) |
| ROR Registry | Organization identifier service [88] | Enables F1 (persistent IDs) for institutional attribution |
Case studies from data-intensive scientific communities demonstrate the practical implementation and benefits of FAIR principles. The high-energy physics community has developed FAIR frameworks for AI-ready datasets, creating standardized resources such as the FAIR and AI-ready Higgs boson decay dataset [87]. This initiative exemplifies how FAIR principles can be applied to complex experimental data to enhance reuse and computational analysis.
Similarly, the Earth sciences community has implemented FAIR principles in critical resources like the IPCC WGI AR6 Atlas repository [87]. This implementation showcases the importance of standardized metadata and structured data formats for supporting international climate assessments. The use of CF (Climate and Forecast) metadata conventions demonstrates how community standards enable interoperability across diverse datasets and research groups [87].
The European Union's open data portal provides another instructive case study, having developed comprehensive data quality guidelines to enhance the FAIRness of its extensive data resources [87]. These guidelines operationalize FAIR principles at scale, addressing practical concerns of data management, documentation, and access.
Table 3: Quantitative metrics for assessing FAIR principle implementation in genomic data resources.
| FAIR Principle | Metric Category | Specific Measurable Indicators | Target Performance Level |
|---|---|---|---|
| Findable | Identifier Implementation | Percentage of datasets with persistent identifiers | 100% assignment at creation |
| Metadata Richness | Number of metadata elements per dataset | Minimum 20 core elements | |
| Search Performance | Search result relevance score | >80% precision in retrieval | |
| Accessible | Protocol Compliance | HTTP/S implementation success rate | 100% protocol adherence |
| Availability | Repository uptime percentage | >99.5% availability | |
| Authentication | Tiered access implementation capability | Support for multiple auth methods | |
| Interoperable | Standard Adoption | Use of community metadata standards | 100% alignment with domain schemas |
| Vocabulary Use | Percentage of annotations using controlled terms | >90% terminology standardization | |
| Format Compliance | Support for standard data formats | Minimum 3 standard formats | |
| Reusable | License Clarity | Presence of machine-readable license | 100% license assignment |
| Provenance Detail | Completeness of provenance documentation | Full chain from sample to data | |
| Citation Performance | Number of third-party citations | Increasing annual citation rate |
The implementation of FAIR principles within One Health genomic research represents a critical enabling strategy for addressing complex health challenges at the human-animal-environment interface. As demonstrated by the frameworks, protocols, and case studies presented in this technical guide, the systematic application of FAIR principles enhances data discovery, integration, and reuse across traditional disciplinary boundaries. The Digital One Health framework [84] provides a particularly promising approach for operationalizing FAIR principles at scale, addressing both technical and governance challenges in multi-sectoral data sharing.
Future developments in this field will likely focus on several key areas:
The integration of FAIR principles with One Health approaches in genomic sciences will continue to evolve, driven by technological advancements and increasingly urgent needs for coordinated responses to global health challenges. By adopting the methodologies, standards, and best practices outlined in this guide, researchers can contribute to building more resilient, integrated, and effective data ecosystems for protecting and promoting health across species and ecosystems.
The One Health approach, which recognizes the interconnectedness of human, animal, and environmental health, is increasingly critical for managing disease threats in an era of globalization and climate change [20]. Genomic surveillance has become a foundational tool within this approach, enabling precise pathogen identification, real-time outbreak tracking, and insights into host-pathogen co-evolution [20]. Robust pipelines and stringent quality assurance are essential to transform raw genomic data into actionable intelligence for researchers, scientists, and drug development professionals. This technical guide outlines the core components, methodologies, and quality frameworks required to develop such pipelines, emphasizing their application across human, animal, and plant health domains to support a cohesive biosecurity strategy.
A comprehensive genomic surveillance pipeline integrates wet-lab and computational processes, supported by cross-sectoral collaboration. The key technical domains that cut across all pipeline activities are Sampling and Sequencing, Data Processing and Analysis, Quality Assurance & Validation, and Evidence-to-Policy translation [20]. The table below summarizes the quantitative standards for the final data package.
Table 1: Quality Control Standards for Genomic Surveillance Data Submission [90]
| Component | Standard or Requirement | Purpose |
|---|---|---|
| Genome Sequence Data | Quality control (QC) assessment | Ensures data reliability for downstream analysis |
| Contextual Data | Validation of metadata | Confirms accuracy of associated sample data |
| Data Submission | Submission to public repository (e.g., via INSDC) | Facilitates data sharing and global surveillance |
| Data Access | Availability through custom dashboards (e.g., NCBI Pathogen Detection) | Enables cluster analysis and genotyping for public health decision-making |
| Data Curation | Ongoing maintenance of public data | Ensures long-term relevance and accuracy of shared data |
The initial phase focuses on generating high-quality genomic data. This involves systematic sample collection from humans, animals, plants, and the environment, reflecting the One Health scope [20]. High-throughput sequencing (HTS) technologies are deployed across various contexts, from central labs to field applications [20]. Specific protocols, such as those for bacterial whole-genome sequencing, involve genomic DNA isolation, quantification and purity measurement, DNA library preparation, and quality control of libraries prior to sequencing [91].
Following sequencing, data processing and analysis extract meaningful biological insights. This stage applies bioinformatics pipelines and computational tools for tasks like genome assembly, variant calling, and phylogenetic analysis [20]. A One Health approach requires integrating these genomic findings with epidemiological and environmental data, which presents challenges in data dispersion, heterogeneous collection methods, and semantic interoperability [36]. Successful integration allows for co-analysis, leading to new hypotheses and improved early warning systems for health events [36].
Quality assurance is a cross-cutting technical domain that ensures data reliability, reproducibility, and compliance with accreditation frameworks [20]. This extends from technical data quality to the integrity of the entire data package submitted to public repositories.
Contributing to global surveillance requires assembling a standard data package. The process involves five key protocols [90]:
The following table details key reagents and materials essential for wet-lab sequencing workflows in genomic surveillance.
Table 2: Key Research Reagent Solutions for Genomic Sequencing Workflows
| Item / Reagent | Function | Example Use Case |
|---|---|---|
| Nexttec 1-step Kit | Genomic DNA isolation from bacterial samples | Preparation of pure DNA template for sequencing [91] |
| Qubit Fluorometer | Accurate quantification of genomic DNA concentration | Ensures precise loading of DNA into sequencing library preparations [91] |
| Nanodrop Spectrophotometer | Measurement of genomic DNA purity (e.g., ratio of A260/A280) | Assesses sample quality and detects contaminants like salts or proteins [91] |
| NEBNext Ultra II FS DNA Kit | Preparation of DNA libraries for Illumina sequencing | Fragments, end-repairs, and adds adapters to DNA for sequencing [91] |
| Illumina MiSeq | Performing high-throughput whole-genome sequencing | Generates the raw sequence data (FASTQ files) for analysis [91] |
Moving from single-sector surveillance to an integrated One Health system requires multi-sector coordination at every stage, from sample collection to dissemination of results [36]. Initiatives like the Genomics for Animal and Plant Disease Consortium (GAP-DC) exemplify this by fostering collaboration between government agencies and academic institutions to tackle shared technological and policy challenges [20]. GAP-DC's work packages address critical areas such as frontline pathogen detection at borders, pathogen spillover at the wildlife-domestic interface, and management of both outbreak and endemic diseases [20]. This operational model emphasizes that beyond technical integration, successful One Health genomic surveillance requires complex partner identification, sustained engagement, co-development of system scope, and joint data analysis and interpretation across sectors [36].
Developing robust pipelines and quality assurance for genomic surveillance is a multidisciplinary endeavor fundamental to the One Health mission. It requires the integration of advanced technological capabilities, rigorous and standardized protocols, and a collaborative framework that transcends traditional sectoral boundaries. By adhering to stringent quality controls from sample to policy decision and leveraging integrated data systems, the scientific community can enhance disease detection, elucidate transmission dynamics, and ultimately strengthen global health security.
The increasing frequency and impact of emerging infectious diseases and antimicrobial resistance (AMR) have exposed critical vulnerabilities in global health security. Traditional disease surveillance systems, characterized by their sectoral independence and disciplinary isolation, struggle to provide the holistic insights needed to address complex health threats at the human-animal-environment interface [92]. The One Health approach recognizes that the health of humans, domestic animals, wildlife, and ecosystems are profoundly interconnected, and that effective public health responses require integrated, multisectoral coordination [93].
This paradigm shift is particularly evident in genomic sciences, where pathogen genome sequencing has transformed our ability to track disease transmission, understand pathogen evolution, and investigate outbreaks. One Health genomics represents a fundamental departure from traditional surveillance by integrating genomic data across human, animal, and environmental sectors, enabling researchers to unravel complex transmission dynamics that would otherwise remain invisible within disciplinary silos [11]. This technical guide provides a comprehensive comparative analysis of these contrasting approaches, examining their conceptual foundations, methodological frameworks, practical implementations, and impacts on public health decision-making.
Traditional surveillance systems operate within well-established but isolated sectoral boundaries, with distinct data collection, analysis, and reporting mechanisms for human health, animal health, and environmental monitoring. This approach is characterized by independent data systems, sector-specific analysis platforms, and disciplinary visualizations that limit cross-sectoral insight [92].
The fundamental workflow in traditional surveillance follows a linear pathway within each sector: (1) sample or data collection; (2) data storage and aggregation; (3) data analysis and interpretation; and (4) dissemination or outcome communication [92]. While this model has demonstrated effectiveness for addressing sector-specific health concerns, it creates significant blind spots for understanding cross-sectoral health threats.
Key limitations of this approach include:
One Health genomics represents a transformative approach that leverages pathogen genomic data as a universal language to track health threats across species and ecosystems. This methodology is grounded in the understanding that pathogen genomic data is host-agnostic, enabling researchers to reconstruct transmission networks and identify reservoirs without the constraints of sectoral boundaries [6].
The conceptual advancement of One Health genomics lies in its ability to connect seemingly unrelated health events through shared pathogen genomes, creating a unified understanding of disease ecology. This approach has been most successfully applied in foodborne disease surveillance through systems such as PulseNet, GenomeTrakr, and the European One Health Whole Genome Sequencing System [92].
The core innovation of One Health genomics is its capacity for phylogenetic analysis across sectors, which allows for the assessment of transmission dynamics at the human-animal-environment interface, enabling proactive prevention of One Health threats through early outbreak detection and reservoir identification [92].
Table 1: Fundamental Characteristics of Surveillance Approaches
| Characteristic | Traditional Siloed Surveillance | One Health Genomics |
|---|---|---|
| Conceptual Foundation | Sector-specific health protection | Interconnected health across domains |
| Data Structure | Independent, sector-specific databases | Integrated, cross-sectoral data platforms |
| Primary Focus | Human health OR animal health OR environmental health | Interface events and cross-domain transmission |
| Pathogen Understanding | Within-sector transmission dynamics | Cross-species transmission and evolution |
| Governance Model | Sector-specific mandates and policies | Cross-sectoral coordination mechanisms |
Implementing effective One Health genomic surveillance requires sophisticated data integration frameworks that can accommodate heterogeneous data sources while addressing complex governance challenges. Recent research has identified several promising approaches for achieving this integration [92].
A systematic framework developed for Washington State government emphasizes that successful One Health data integration must consider common challenges of limited resource settings, including lack of informatics support during planning phases, and the need to move beyond scoping and planning to actual system development, production, and joint analyses [92]. This framework highlights that One Health integration requires complex partner identification, engagement and co-development of system scope, complex data governance, and joint data analysis across sectors.
Table 2: Approaches to One Health Genomic Data Integration
| Approach | Key Features | Implementation Examples |
|---|---|---|
| Shared Secure Surveillance Platform | ⢠Human, animal, environmental, and food isolates⢠Sequence data with associated metadata⢠Controlled data access⢠Automated data sharing with international repositories | Swiss surveillance platform between human and veterinary medicine [94] |
| Shared Bioinformatics Platform | ⢠Common workflow system⢠Quality control processes⢠Integrated analysis of human, animal, food, and environmental isolates | Italian multisectoral data collection and bioinformatic analysis platform [94] |
| Federated Database Ecosystem | ⢠Distributed databases connected through APIs⢠Maintained data sovereignty⢠Cross-database querying capabilities | Proposed federated solutions for sharing genomic data across jurisdictions [94] |
| Open Access Databases with Standardized Practices | ⢠Store sequences with standardized metadata⢠Inclusion of specimens from all One Health domains⢠Clear processes for data correction and updating | Best practices for One Health contributions to international sequence databases [94] |
The implementation of One Health genomic surveillance requires a structured workflow that encompasses sample collection, sequencing, data integration, and joint analysis across sectors. The following Graphviz diagram illustrates this integrated workflow:
One Health Genomics Workflow
The computational backbone of One Health genomics requires robust bioinformatic infrastructure capable of handling massive genomic datasets while enabling cross-sectoral data integration. The following architecture supports this integration:
Data Integration Architecture
The transition from traditional siloed surveillance to integrated One Health genomics produces measurable differences across multiple performance dimensions. The table below summarizes key comparative metrics based on documented implementations and case studies:
Table 3: Performance Comparison of Surveillance Approaches
| Performance Metric | Traditional Siloed Surveillance | One Health Genomics |
|---|---|---|
| Outbreak Detection Speed | Delayed, often after significant human transmission | Early warning through integrated environmental and animal data |
| Transmission Insight | Limited to within-sector understanding | Cross-species transmission routes and reservoirs identified |
| Data Completeness | Fragmented, sector-specific data | Holistic view of health threats across domains |
| Resource Efficiency | Duplicative testing and analysis across sectors | Shared resources and infrastructure |
| Antimicrobial Resistance Tracking | Sector-specific resistance patterns | Comprehensive tracking from hospitals to livestock to environment |
| Response Coordination | Sector-specific response actions | Integrated, cross-sectoral public health interventions |
| Pandemic Preparedness | Reactive to human case emergence | Proactive detection in animal and environmental reservoirs |
The global challenge of antimicrobial resistance (AMR) exemplifies the critical importance of One Health genomics. Between 2010 and 2015, total global antibiotic consumption increased by 65% from 21.1 to 34.8 billion defined daily doses, with the greatest increases in emerging economies [6]. The use of antimicrobials in livestock production comprises 73% of entire global consumption, currently estimated at 131,000 tons annually and predicted to rise to 200,000 tons by 2030 [6].
One Health genomics reveals that AMR is not confined to clinical settings but operates as a connected ecosystem. Whole genome sequencing enables researchers to track the movement of resistance genes between bacterial species and across human, animal, and environmental domains, identifying high-risk clones and mobile genetic elements that drive resistance dissemination [6]. This comprehensive understanding is essential for developing effective containment strategies.
Wastewater surveillance has emerged as a powerful application of One Health genomics, particularly during the COVID-19 pandemic. Wastewater contains a broad range of chemicals and biota from human populations, providing a community-wide health snapshot [95]. The "one sample many analyses" (OSMA) approach allows for simultaneous detection of pathogens, AMR genes, and pharmaceutical residues, creating a cost-effective sentinel system for public health threats [95].
This approach enables non-invasive detection of community infection trends, including for non-notifiable pathogens, while also monitoring antimicrobial usage patterns through analytical chemistry techniques [95]. Wastewater's status as both a pollutant and data source makes it an ideal sentinel within the One Health toolkit, connecting human health with environmental monitoring.
The implementation of One Health genomic surveillance faces significant computational challenges due to the enormous volume of data generated by modern sequencing technologies. It has been estimated that the annual acquisition of raw genomic data worldwide would exceed one zettabyte (one trillion GB) by 2025 [6]. This data deluge requires scalable computational infrastructures and sophisticated bioinformatic pipelines.
Key solutions to these challenges include:
Tools like ASA3P (Automatic Bacterial Isolate Assembly, Annotation and Analyses Pipeline) represent the kind of automated, standardized bioinformatic solutions needed to make One Health genomics operationally feasible across diverse settings [6].
Implementing effective One Health genomic surveillance requires specific technical reagents and computational tools. The following table details essential components and their functions:
Table 4: Essential Research Reagents and Tools for One Health Genomics
| Reagent/Tool Category | Specific Examples | Function in One Health Surveillance |
|---|---|---|
| Sequencing Technologies | Oxford Nanopore, Illumina | Generate genomic data from human, animal, and environmental samples |
| Bioinformatic Pipelines | ASA3P, Custom workflows | Process raw sequence data, perform assembly, annotation, and analysis |
| Database Infrastructure | APIs, Federated databases, Cloud storage | Enable cross-sectoral data sharing and integration |
| Quality Control Tools | QC thresholds, Standardization protocols | Ensure data quality and interoperability across diverse sample types |
| Computational Resources | Cloud computing, High-performance computing | Handle massive datasets and complex phylogenetic analyses |
| Visualization Platforms | Dashboards, Interactive tools | Present integrated data to support public health decision-making |
Based on the documented experiences and case studies of One Health genomic implementation, several key recommendations emerge for researchers and public health agencies:
Develop Modular, Scalable Bioinformatics Pipelines Implementation of open-source, accessible bioinformatics pipelines that can be adapted to local needs and resources is essential. These pipelines should be modular for flexibility, reproducible across computing environments, and capable of integrating diverse data types [94].
Strengthen Cross-Sectoral Data Governance Frameworks Successful One Health genomics requires navigating complex data governance challenges. Establishing clear agreements on data sharing protocols, access controls, and intellectual property while maintaining appropriate privacy protections is fundamental to building trust across sectors [92].
Build Technical Workforce Capacity The specialized skills required for One Health genomicsâincluding bioinformatics, genomic epidemiology, and cross-sectoral collaborationârepresent a critical capacity gap in many settings. Strategic investment in training programs, workforce development, and knowledge transfer is needed at national and global levels [94].
Implement Federated Data Systems Rather than centralizing all data, federated systems that connect distributed databases through APIs offer a practical approach to data integration while respecting jurisdictional boundaries and data sovereignty concerns [94].
Adopt "One Sample Many Analyses" Approaches Maximizing the value of surveillance samples through coordinated testing for multiple pathogens, AMR markers, and chemical indicators provides greater efficiency and more comprehensive health intelligence [95].
The transformative potential of One Health genomics lies in its ability to move beyond sectoral limitations and create a unified understanding of health threats. As genomic technologies become more accessible and bioinformatic tools more sophisticated, the implementation of integrated surveillance systems will be crucial for effective pandemic preparedness, antimicrobial resistance containment, and proactive management of emerging health threats at the human-animal-environment interface.
The One Health approach is an integrated, unifying framework that aims to sustainably balance and optimize the health of people, animals, and ecosystems, recognizing their close interdependence [1]. In the context of genomic sciences, this approach leverages advanced genomic surveillance technologies to mitigate threats posed by endemic and emerging diseases across agricultural, public health, and biosecurity domains [20]. The rationale for quantifying outcomes within this framework stems from the need to objectively evaluate performance in outbreak detection and response, strategically inform policies, and optimize resource allocation for maximum public health impact [96]. As the world continues to confront the aftermath of the COVID-19 pandemic, which led to the loss of millions of lives and trillions of dollars from the global economy, the imperative to strengthen One Health approaches with greater emphasis on genomic surveillance has never been clearer [1]. This technical guide provides researchers and drug development professionals with methodologies and metrics for rigorously evaluating the effectiveness, cost-benefit, and public health impact of One Health initiatives in genomic research.
A critical framework for quantifying the effectiveness of One Health systems involves analyzing timeliness metrics across multisectoral public health emergencies. These metrics objectively measure the time between key outbreak milestones, allowing for quantitative assessment of detection and response performance [96]. The One Health timeliness framework emphasizes cross-sectoral coordination and community engagement, analyzing metrics related to predictive alerts of outbreaks and preventive responses.
Table 1: One Health Timeliness Metrics and Milestones
| Metric Phase | Start Milestone | End Milestone | Performance Indicator |
|---|---|---|---|
| Detection | Outbreak Start | First Detection | Speed of initial identification |
| Notification | First Detection | Official Notification | Reporting efficiency |
| Verification | Official Notification | Verification | Case confirmation timeliness |
| Response Initiation | Verification | Response Implementation | Reaction rapidity |
Research from Uganda (2018-2022) demonstrates the practical application of these metrics, revealing that the greatest predictors of improved timeliness are (1) frequent past experience with similar disease outbreaks and (2) whether an outbreak is a viral hemorrhagic fever due to heightened perceived threat and pre-existing preparedness measures [96]. Diagnostic and laboratory considerations, along with contextual influences such as One Health collaborations, were also described as relevant to timeliness.
Within the UK's Genomics for Animal and Plant Disease Consortium (GAP-DC), effectiveness is quantified through six interconnected work packages, each addressing specific aspects of genomic surveillance effectiveness [20]:
The GAP-DC initiative employs a cross-cutting technical approach across four domains to ensure robust quantification: (1) Sampling and Sequencing, (2) Data Processing and Analysis, (3) Quality Assurance - Validation and Accreditation, and (4) Evidence-to-Policy translation [20].
Objective: To quantify timeliness metrics for multisectoral outbreak detection and response within a One Health framework.
Materials and Data Collection:
Statistical Analysis:
Objective: To evaluate the effectiveness of genomic surveillance systems in detecting and characterizing pathogens across One Health sectors.
Sample Collection and Sequencing:
Data Processing and Analysis:
Quality Assurance and Validation:
Analysis of 81 outbreaks in Uganda between 2018-2022 revealed distinct patterns in timeliness performance across different types of health threats [96]. The quantitative findings demonstrate that:
Table 2: Timeliness Performance by Outbreak Characteristic
| Outbreak Characteristic | Median Detection Time (Days) | Median Response Time (Days) | Key Influencing Factors |
|---|---|---|---|
| Frequent Past Experience | Significantly faster | Significantly faster | Established protocols, familiar pathogens |
| Viral Hemorrhagic Fevers | Faster | Faster | Perceived high threat, pre-existing preparedness |
| Novel/Unknown Pathogens | Significantly slower | Significantly slower | Lack of diagnostic tools, unfamiliar clinical presentation |
| Strong One Health Collaboration | Faster | Faster | Integrated reporting, shared resources |
The data reveals that while teams respond quickly to known diseases, novel diseases that the health system is unfamiliar with take longer to detect and address. Enhanced coordination between animal, human, and environmental health sectors demonstrates measurable improvements in timeliness metrics [96].
While specific financial data requires comprehensive economic analysis, the GAP-DC initiative identifies several critical cost-benefit considerations [20]:
The UK's approach to biological hazards, underpinned by its biological security strategy's four foundational pillars (understand, prevent, detect, and respond), emphasizes surveillance as an early warning tool and first line of defense against endemic and emerging pathogens and pests [20].
The following diagram illustrates the integrated workflow for One Health genomic surveillance and outcome quantification:
One Health Genomic Surveillance Pipeline
The following diagram details the sequence of milestones in the One Health timeliness metrics framework:
Outbreak Timeliness Metric Milestones
Table 3: Essential Research Reagents for One Health Genomic Surveillance
| Reagent/Material | Function | Application Context |
|---|---|---|
| High-Throughput Sequencing Kits | Generate comprehensive genomic data from diverse sample types | Pathogen characterization, antimicrobial resistance monitoring |
| Metagenomic Sequencing Reagents | Enable detection of unknown pathogens without prior targeting | Surveillance of novel disease threats, environmental sampling |
| Cross-Sector Sample Collection Kits | Standardize specimen collection across human, animal, environmental domains | Integrated surveillance, outbreak investigation |
| Bioinformatics Pipelines | Process and analyze genomic data for pathogen detection | Transmission tracking, virulence marker identification |
| Quality Control Standards | Ensure data reliability and reproducibility across laboratories | Method validation, inter-laboratory comparisons |
| One Health Data Integration Platforms | Harmonize data from multiple sectors for comprehensive analysis | Timeliness metric calculation, outcome quantification |
The quantification of outcomes in One Health genomic sciences requires ongoing refinement of metrics and methodologies. Current evidence demonstrates that the effectiveness of One Health initiatives is significantly enhanced through cross-sectoral collaboration, standardized protocols, and integrated data systems [20] [96]. The timeliness metrics framework offers a validated approach for quantitative assessment of outbreak detection and response performance, enabling objective evaluation of system strengths and weaknesses.
Implementation of these quantitative frameworks faces several challenges, including the need for standardized data collection across sectors, interoperability of reporting systems, and resource allocation for sustained genomic surveillance capacity [96]. Furthermore, analyses of National One Health Strategic Plans have revealed varying levels of alignment with contemporary One Health principles, with specific actions inconsistently anticipated across plans [97]. Disparities in addressing issues such as climate change, anthropogenic drivers, and non-communicable diseases were also evident, highlighting areas for future development.
For researchers and drug development professionals, adopting these quantitative frameworks enables more rigorous evaluation of One Health interventions, strengthens evidence-based policy development, and optimizes resource allocation for maximum public health impact. Future directions should focus on expanding surveillance networks, enhancing bioinformatics capabilities, and integrating innovative technologies to address evolving challenges, including building predictive capabilities to better anticipate and mitigate disease impact [20].
The One Health approach recognizes that the health of humans, animals, and ecosystems are interdependent, pushing scientific inquiry beyond traditional siloed approaches to coordination and integration [36]. In genomic sciences, this approach enables researchers to address complex health challenges at the human-animal-environment interface through cross-species genomic analyses that can identify novel hypotheses and improve early warning systems for impending health events [36]. The emerging discipline of cross-species genomic prediction represents a paradigm shift in how we understand population health across species boundaries, leveraging advanced computational tools and large-scale genomic datasets to translate findings between biological kingdoms.
This technical guide examines the validation frameworks, methodological considerations, and practical implementations for cross-species genomic predictions. By leveraging genomic tools and methods across human, animal, and plant genetics, researchers can address a significant gap in current scientific practice, where these fields have historically developed in isolation despite using similar underlying models and approaches [10]. The validation of these cross-species predictions requires sophisticated statistical frameworks and integrated data systems that can accommodate the complexity of multi-species genomic data while maintaining scientific rigor and biological relevance.
Genomic prediction is an effective method for shortening breeding cycles and accelerating genetic gains, with traditional approaches focusing on estimating 'additive' breeding values for individual genotypes [98]. For comprehensive cross-species prediction, models must incorporate both additive and non-additive genetic effects to account for the full spectrum of genetic contributions to complex traits. The genomic predicted cross-performance (GPCP) tool represents one advanced implementation, utilizing a mixed linear model based on additive and directional dominance effects [98]. This approach is particularly valuable for traits with significant dominance effects and for clonally propagated crops where inbreeding depression and heterosis are prevalent.
The fundamental GPCP model can be represented as:
y = Xb + Fα + Za + Wd + e [98]
Where:
This model effectively partitions genetic variance into its constituent components, allowing for more accurate prediction of cross-performance in breeding programs and potentially in cross-species applications.
Validating cross-species genomic predictions requires rigorous statistical frameworks to ensure predictive accuracy and biological relevance. The following comparative data illustrate key metrics and approaches:
Table 1: Performance Metrics for Genomic Prediction Models Across Species Boundaries
| Metric | Human Genetics Applications | Animal Breeding Applications | Plant Genetics Applications |
|---|---|---|---|
| Prediction Accuracy Range | 0.15-0.45 for complex behavioral traits | 0.35-0.75 for production traits | 0.40-0.80 for yield traits |
| Typical Dataset Size | 10,000-1,000,000 individuals | 1,000-100,000 individuals | 1,000-50,000 individuals |
| Key Methodological Approaches | Polygenic risk scores, Genome-wide association studies | Genomic estimated breeding values, Selection indices | Genomic selection, Hybrid performance prediction |
| Primary Validation Method | Hold-out validation, External cohort replication | Cross-validation, Progeny testing | Cross-validation, Field trials |
| Data Integration Challenges | Ethical constraints, Phenotypic heterogeneity | Multi-trait evaluation, Genotype by environment interactions | High-throughput phenotyping, Environmental covariates |
Table 2: Quantitative Comparison of Experimental Results for Method Validation
| Validation Method | Implementation Requirements | Strengths | Limitations |
|---|---|---|---|
| t-test Comparison | Two sample groups with calculated means, standard deviations, and sample sizes [99] | Determines if difference between two means is statistically significant; Simple to implement | Limited to comparison of two groups; Requires normal distribution assumption |
| F-test for Variances | Two data sets to compare spread of values [99] | Assesses equality of variances before t-test; Important for model assumptions | Must be â¥1 (s1² ⥠s2²); Limited to variance comparison only |
| Cross-species Prediction Accuracy | Large datasets from multiple species; Genomic and phenotypic records [10] | Directly measures transferability of predictions across species; Most relevant validation | Requires substantial genomic resources across multiple species; Complex interpretation |
For determining statistical significance between experimental results, the t-test provides a fundamental validation tool, with the formula:
t = (xÌâ - xÌâ) / [sâ(1/nâ + 1/nâ)] [99]
Where xÌâ and xÌâ are the sample means, s is the pooled estimate of standard deviation, and nâ and nâ are the sample sizes. The number of degrees of freedom (df) is (nâ + nâ) - 2. The null hypothesis (Hâ) assumes no difference between means, while the alternative hypothesis (Hâ) proposes a significant difference exists [99]. Rejection of Hâ occurs when the absolute t-value exceeds the critical value at a specified significance level (typically α = 0.05), indicating statistically significant differences.
The validation of cross-species genomic predictions requires carefully designed experimental workflows that ensure methodological rigor while accommodating the unique challenges of multi-species data integration. The following diagram illustrates a comprehensive framework:
Implementing cross-species genomic predictions requires robust data integration frameworks that can accommodate the diverse data sources inherent to One Health approaches. The development of such frameworks involves systematic processes:
Table 3: One Health Data Integration Framework Development
| Development Phase | Key Activities | Outputs | Stakeholder Engagement |
|---|---|---|---|
| Conceptualization | Hypothesis generation; Study aim definition; Relationship mapping between exposure sources and outcomes [100] | Directed Acyclic Graphs; Logic models; Multi-pathway visualization matrices | Multidisciplinary team assembly; Community member involvement |
| Planning | Study design determination; Data source identification; Analytical method selection [100] | Mixed methodology protocols; Power and sample size computations; Data collection strategies | Specialist consultation; Resource aggregation; Capacity assessment |
| Implementation | System development; Data collection; Integration pipeline construction | Functional data systems; Automated collection protocols; Analysis platforms | Cross-sector training; Joint development; Governance establishment |
| Operationalization | Joint data analysis; Interpretation; Dissemination; System refinement | One Health surveillance outputs; Early warning systems; Intervention strategies | Ongoing collaborative relationships; Communication protocols |
Successful implementation of One Health data integration requires moving beyond simple data sharing to true co-analysis of integrated datasets, which presents both technical and governance challenges. Key considerations include:
The experimental validation of cross-species genomic predictions requires specific research reagents and computational resources:
Table 4: Research Reagent Solutions for Cross-Species Genomic Studies
| Category | Specific Examples | Function/Application | Implementation Considerations |
|---|---|---|---|
| Genomic Data Resources | UK Biobank; EnsemblPlants; Livestock Research Data Ecosystems [10] | Provide large-scale phenotypic and genomic data for model training and validation | Data use agreements; Ethical considerations; Computational infrastructure |
| Analysis Software & Packages | BreedBase; R packages (sommer); AlphaSimR [98] | Implement genomic prediction models; Perform simulations; Statistical analysis | Computational resource requirements; Technical expertise; Customization needs |
| Statistical Validation Tools | XLMiner ToolPak; Analysis ToolPak; Custom scripts for t-tests and F-tests [99] | Compare experimental results; Determine statistical significance; Validate predictive models | Appropriate test selection; Significance threshold setting; Interpretation frameworks |
| Pathogen Genomic Resources | PulseNet; GenomeTrakr; NCBI databases [36] | Support integrated genomic surveillance; Enable phylogenetic analysis across species | Laboratory capacity for sequence generation; Bioinformatics expertise; Data standardization |
This protocol outlines the steps for validating genomic predictions across species boundaries, adapted from methodologies successfully implemented in both animal and plant genetics [98] [10].
Step 1: Data Collection and Harmonization
Step 2: Model Training and Parameter Estimation
Step 3: Cross-Species Prediction
Step 4: Statistical Validation
This protocol describes the implementation of integrated genomic surveillance for pathogen tracking across human, animal, and environmental domains [36].
Step 1: Sample Collection and Sequencing
Step 2: Data Integration and Management
Step 3: Phylogenetic Analysis and Interpretation
Step 4: Joint Analysis and Response
Validated cross-species genomic predictions enable numerous practical applications within the One Health framework:
Infectious Disease Surveillance and Control Integrated genomic surveillance systems allow for early outbreak detection and improved understanding of pathogen reservoirs, evolution, and modes of transmission across species boundaries [36]. The implementation of phylogenetic analysis enables assessment of transmission dynamics at the human-animal-environment interface, providing critical information for targeted interventions.
Agricultural Resilience and Food Security Cross-species genomic prediction methods developed in animal genetics can be applied to improve predictions of complex traits in humans, such as resilience, disease resistance, or responses to environmental stressors [10]. Similarly, approaches from human genetics may inform animal and plant breeding programs aimed at enhancing productivity and sustainability.
Environmental Health Monitoring By integrating genomic data from multiple species with environmental monitoring data, researchers can develop early warning systems for ecosystem health threats and identify potential zoonotic disease emergence hotspots before they lead to widespread outbreaks.
The implementation of cross-species genomic prediction systems faces several significant challenges:
Table 5: Implementation Challenges and Mitigation Strategies
| Challenge Category | Specific Challenges | Potential Mitigation Strategies |
|---|---|---|
| Technical Barriers | Data heterogeneity; Lack of semantic interoperability; Computational infrastructure requirements [36] | Develop common data standards; Implement middleware solutions; Cloud-based computing resources |
| Governance Issues | Data jurisdiction; Organizational mandates; Privacy and ethical concerns [36] | Establish clear data sharing agreements; Develop ethical frameworks; Engage legal experts early |
| Resource Constraints | Funding limitations; Technical capacity gaps; Aging data infrastructure [36] | Pursue cross-sector funding opportunities; Implement training programs; Phase modernization efforts |
| Analytical Complexities | Statistical methods for cross-species prediction; Validation frameworks; Interpretation challenges | Develop specialized statistical tools; Establish validation standards; Create interdisciplinary analysis teams |
The pathway toward operationalizing cross-species genomic predictions requires coordinated effort across multiple domains, but offers substantial potential benefits for improving health outcomes across species boundaries. As these methods mature and validation frameworks become more robust, the integration of genomic predictions into routine One Health practice will enhance our ability to predict, prevent, and respond to complex health challenges at the human-animal-environment interface.
The integration of genomic sciences into public health practice represents a transformative frontier for the One Health approach, which recognizes the interconnectedness of human, animal, and environmental health. This technical guide explores how the systematic evaluation of One Health surveillance systemsâspecifically through the OH-EpiCap toolâcan identify critical pathways for effectively incorporating genomic data into integrated health monitoring. As pathogen genomics generates increasingly large and complex datasets, the organizational and operational frameworks of our surveillance systems must evolve to translate this data into actionable public health insights. The lessons drawn from multi-country applications of OH-EpiCap provide a evidence-based foundation for strengthening the epidemiological capabilities that underpin genomic research applications across health domains.
The OH-EpiCap tool was developed to address a recognized gap in evaluating the multi-sectoral collaboration essential for effective One Health surveillance [101]. Its development responded to the reality that while international health agencies strongly promote One Health approaches, most surveillance systems remain functionally compartmentalized with limited cross-sectoral collaboration [101]. This evaluation framework enables institutes and governments to systematically characterize, assess, and monitor their One Health epidemiological surveillance capacities and capabilities, creating a foundation upon which advanced genomic technologies can be effectively leveraged.
OH-EpiCap is organized around three primary dimensions, each divided into four targets that contain four specific indicators, creating a comprehensive evaluation framework of 48 indicators total [101] [102]. This structure enables a systematic assessment of One Health surveillance systems across organizational, operational, and impact aspects.
Table 1: OH-EpiCap Evaluation Framework Structure
| Dimension | Targets | Key Indicator Examples |
|---|---|---|
| Organization | 1.1 Formalization | Common aim, support documentation, coordination roles, leadership |
| 1.2 Coverage & Transdisciplinarity | Relevant sectors, disciplines, geography, populations, hazards | |
| 1.3 Resources | Financial & human resources, shared operational resources, training | |
| 1.4 Evaluation & Resilience | Internal/external evaluations, corrective measures, adaptability | |
| Operational Activities | 2.1 Data Collection & Methods Sharing | Collaborative surveillance design, harmonized laboratory techniques |
| 2.2 Data Sharing | Data sharing agreements, FAIR principle compliance, data quality | |
| 2.3 Data Analysis & Interpretation | Multi-sectoral data analysis, shared statistical techniques | |
| 2.4 Communication | Internal/external communication, dissemination to decision-makers | |
| Impact | 3.1 Technical Outputs | Timely emergence detection, knowledge improvement, cost reduction |
| 3.2 Collaborative Added Value | Strengthened networks, international collaboration, common strategy | |
| 3.3 Immediate & Intermediate Outcomes | Advocacy, awareness, preparedness, evidence-based interventions | |
| 3.4 Ultimate Outcomes | Research opportunities, policy changes, better health outcomes |
The OH-EpiCap employs a semi-quantitative scoring methodology with four levels of compliance for each indicator, where higher values indicate better adherence to One Health principles [101] [102]. The implementation protocol involves:
The tool includes "Not Applicable" options for indicators irrelevant to specific surveillance contexts, enhancing its adaptability across different systems and hazards [102]. This consistent yet flexible approach enables meaningful cross-system comparisons while respecting contextual differences.
Figure 1: OH-EpiCap evaluation framework structure showing the three dimensions, twelve targets, and forty-eight indicators.
The OH-EpiCap tool was rigorously tested through eleven case studies conducted in 2022, focusing on diverse foodborne hazards and antimicrobial resistance surveillance systems across multiple European countries [101] [103]. These evaluations targeted:
The evaluation methodology involved multi-stakeholder workshops bringing together representatives from human, animal, and environmental health sectors to reach consensus on scoring. The number of assessors per case study ranged from one to five, with evaluation durations spanning 2-8 hours [105]. This collaborative scoring approach helped mitigate individual subjectivity while fostering shared understanding across sectors.
The case study evaluations revealed consistent strengths and weaknesses across the surveillance systems, providing actionable insights for system improvement:
Table 2: Common Strengths and Weaknesses Identified in Multi-Country Case Studies
| Aspect | Strengths | Weaknesses |
|---|---|---|
| Organization | Sector coverage, coordination during crises | Lack of operational leadership, formal governance |
| Operational Activities | Data collection protocols, information sharing during alerts | Poor FAIR data principle compliance, limited technique sharing |
| Impact | Timely detection, knowledge improvement | Unmeasured cost-effectiveness, unassessed health outcomes |
Specific challenges identified included:
These findings highlight critical gaps between technical capabilities and operational implementation that must be addressed to fully leverage genomic technologies in One Health surveillance.
Genomic technologies and bioinformatics provide powerful tools for decoding complex biological data, enabling comprehensive insights into pathogen evolution, transmission dynamics, and host-pathogen interactions across species and ecosystems [11]. When effectively integrated into One Health surveillance systems, these technologies enable:
The targeted application of genomic technologies is particularly valuable in tropical regions, which face disproportionate burdens of infectious diseases but have historically been underrepresented in genomic research initiatives [7]. Portable sequencing technologies and bespoke bioinformatics solutions offer promising approaches to address these regional disparities.
Successful integration of genomic data into One Health surveillance requires addressing several critical infrastructure challenges:
The OH-EpiCap evaluations identified that systems scoring higher on operational targets related to data sharing and analysis were better positioned to effectively incorporate genomic technologies, highlighting the interdependence between surveillance fundamentals and advanced technological capabilities.
Based on the OH-EpiCap case study findings, the following strategic approaches can enhance the integration of genomic sciences into One Health surveillance:
Table 3: Essential Research Reagents and Tools for One Health Genomic Surveillance
| Reagent/Tool Category | Specific Examples | Function in One Health Surveillance |
|---|---|---|
| Sequencing Platforms | Oxford Nanopore Technologies, Illumina systems | Enable pathogen whole genome sequencing and metagenomic analysis |
| Bioinformatics Pipelines | Genome assemblers, variant callers, phylogenetic tools | Support analysis of genomic data for outbreak investigation and pathogen characterization |
| Reference Databases | Custom-built databases for local flora/fauna, antimicrobial resistance genes | Provide context for interpreting genomic findings in local ecosystems |
| Data Integration Tools | Interoperable data platforms, shared analysis environments | Facilitate combination of genomic, epidemiological, and environmental data |
| Quality Control Materials | Standardized controls, validation panels | Ensure reliability and comparability of genomic data across sectors and time |
These research reagents and tools collectively enable the generation, analysis, and interpretation of genomic data within a One Health context, addressing the operational gaps identified through OH-EpiCap evaluations.
Figure 2: Implementation pathway from identified challenges through solutions to enhanced capabilities for One Health genomic surveillance.
The application of OH-EpiCap across diverse surveillance systems and geographical contexts provides compelling evidence that systematic evaluation of One Health capacities is both feasible and valuable for identifying concrete improvements. The tool's structured assessment of organization, operational activities, and impacts creates a comprehensive framework for strengthening the foundational elements necessary for effective genomic surveillance. As genomic technologies continue to transform disease detection and tracking, the integration challenges identified through these real-world evaluationsâparticularly regarding governance, data sharing, and impact measurementâmust be addressed to fully realize the potential of genomics within a One Health approach.
The lessons from OH-EpiCap implementation demonstrate that technical capabilities alone are insufficient without corresponding advancements in collaborative structures and processes. Future efforts should focus on developing and validating specific indicators for genomic data integration within the OH-EpiCap framework, enabling more targeted assessment of this critical capability. By building on these evaluation findings and addressing the identified gaps, the global health community can accelerate progress toward genuinely integrated surveillance systems that leverage genomic sciences to protect health across human, animal, and environmental domains.
The Genomics for Animal and Plant Disease Consortium (GAP-DC) represents a pioneering initiative that positions the United Kingdom at the forefront of applying genomic surveillance to national biosecurity challenges. Launched in July 2023 and backed by £10 million in government funding, GAP-DC operationalizes the One Health principle by integrating surveillance efforts across terrestrial and aquatic animals, plants, and their shared environments [107] [108] [20]. This in-depth technical guide examines the consortium's structure, methodological frameworks, and technical workflows, presenting it as a replicable model for leveraging genomic technologies to protect agriculture, trade, and public health against emerging biological threats.
The One Health approach is a collaborative, multisectoral, and transdisciplinary framework that recognizes the inextricable interconnection between the health of people, animals, plants, and their shared environment [9] [1]. In an era characterized by globalization, climate change, and increasing antimicrobial resistance, this approach is critical for addressing complex health challenges that transcend traditional sectoral boundaries [11] [67]. Zoonotic diseasesâpathogens that can transfer between animals and humansâaccount for a significant proportion of emerging infectious diseases, underscoring the necessity of integrated surveillance systems [9] [67].
Genomic surveillance has emerged as a powerful tool within the One Health arsenal. By decoding the complete genetic material of pathogens, researchers can achieve precise identification, track transmission dynamics in real-time, trace outbreaks to their source, and gain insights into pathogen evolution and host-pathogen interactions [11] [20]. The strategic application of high-throughput sequencing (HTS) technologies, supported by robust bioinformatics, enables a proactive shift from disease response to prediction and prevention [20] [7].
GAP-DC unites the UK's leading organizations in pathogen detection and genomics to create a cohesive national biosecurity front. The consortium is funded by £10 million from the Department for Environment, Food and Rural Affairs (Defra) and UK Research and Innovation (UKRI) [108]. Its governance structure integrates expertise from key partners, creating a unified framework for tackling established and emerging pathogens across ecosystems [107] [20].
Table 1: GAP-DC Core Consortium Partners and Their Specializations
| Organization | Area of Expertise | Role in GAP-DC |
|---|---|---|
| Animal and Plant Health Agency (APHA) | Animal health, disease diagnostics | Lead agency, overall project coordination [108] |
| Pirbright Institute (TPI) | Viral diseases of livestock | Pathogen genomics, vaccine research [107] |
| Royal Veterinary College (RVC) | Veterinary medicine, epidemiology | Infectious disease genomics, animal health [108] |
| Centre for Environment, Fisheries and Aquaculture Science (CEFAS) | Aquatic animal health | Health of marine and freshwater organisms [108] [20] |
| Fera Science Ltd | Plant health, agri-technology | Plant pathogen detection, food security [107] [20] |
| Forest Research | Forest science, tree health | Monitoring and controlling tree diseases [108] |
GAP-DC is strategically aligned with the UK Biological Security Strategy (2023), which is built on four foundational pillars: understand, prevent, detect, and respond [107] [20]. The initiative serves as a direct implementation vehicle for the "detect" pillar, enhancing the UK's early warning surveillance capabilities [20]. Furthermore, GAP-DC actively engages with global One Health initiatives such as the European Partnership for Animal Health and Welfare and the African Field Epidemiology Network, fostering methodological alignment and data sharing to address transboundary disease threats [20].
The operational scope of GAP-DC is defined by six interconnected work packages (WPs), each targeting a critical aspect of the genomic surveillance pipeline [20].
WP1: Frontline Pathogen Detection at Borders
WP2: Pathogen Spillover at Wildlife-Domestic Interfaces
WP3: Syndromic and Complex Disease Elucidation
WP4: Outbreak Management of New and Re-Emerging Diseases
WP5: Mitigation of Endemic Diseases
WP6: Stakeholder Coordination and Policy Translation
The following diagram illustrates the logical flow and integration between these work packages and the cross-cutting technical domains that support them.
Four key technical domains provide a unified foundation across all work packages, ensuring data robustness and operational coherence [20]:
The experimental protocols within GAP-DC rely on a suite of advanced genomic technologies and reagents. The following table details key solutions essential for work in this field.
Table 2: Key Research Reagent Solutions for Genomic Surveillance
| Reagent / Technology | Function / Application | Example Use in GAP-DC |
|---|---|---|
| Portable Sequencers (e.g., Oxford Nanopore) | Enables real-time, long-read sequencing in field laboratories or at borders [7]. | Rapid detection of pathogens at high-risk locations (WP1) [20]. |
| Metagenomic Sequencing Kits | Allows for unbiased sequencing of all nucleic acids in a complex sample (e.g., tissue, soil, water) [7]. | Discovery of novel pathogens and analysis of complex disease etiologies (WP2, WP3) [20]. |
| Whole Genome Amplification Kits | Amplifies minute quantities of DNA from low-biomass or difficult-to-culture samples. | Generating sufficient material for sequencing from environmental or clinical swabs [20]. |
| Targeted Enrichment Panels | Probes designed to capture and enrich specific genomic regions (e.g., for a virus family or AMR genes). | Cost-effective deep sequencing of specific pathogens during outbreaks (WP4) [20]. |
| Standardized Nucleic Acid Extraction Kits | Ensures high-quality, inhibitor-free DNA/RNA from diverse sample matrices (plant, animal, environmental). | Maintains consistency and reproducibility across different consortium partners and sample types [20]. |
| Bioinformatics Pipelines (e.g., BWA, GATK, IQ-TREE) | Open-source software for sequence alignment, variant calling, and phylogenetic reconstruction. | Core data processing and analysis for all work packages; essential for tracking pathogen evolution [11] [20]. |
The economic imperative for robust biosecurity initiatives like GAP-DC is compelling. The UK faces significant and growing economic threats from pests and diseases affecting its agriculture, livestock, and natural environment [108]. The following table summarizes the economic impact of key diseases, underscoring the need for advanced surveillance.
Table 3: Economic Impact of Select Animal and Plant Diseases in the UK
| Disease / Pest | Affected Sector | Economic Impact / Cost |
|---|---|---|
| Ash Dieback | Forestry, Environment | Projected to cost £15 billion over coming decades [108]. |
| Avian Influenza | Poultry Agriculture | Cost the poultry meat sector over £100 million in a two-year period [108]. |
| Invasive Species | Multiple Sectors | Estimated to cost the UK economy £4 billion annually [108]. |
Investments in preventive genomic surveillance, as exemplified by GAP-DC's £10 million funding, are economically rational when measured against the potential billions in losses from uncontrolled outbreaks [108] [67]. The consortium aims to mitigate these costs by enabling earlier detection, more precise control measures, and more resilient agricultural and food systems.
The UK's GAP-DC initiative presents a sophisticated, operational model for integrating genomic surveillance within a national One Health biosecurity strategy. By uniting diverse institutions under a common framework, implementing a structured work package methodology, and leveraging cutting-edge genomic technologies and bioinformatics, GAP-DC enhances the UK's capacity to safeguard its animal, plant, and economic health. This initiative demonstrates the transformative potential of a coordinated, cross-sectoral approach to disease surveillance, offering a valuable template for other nations seeking to build resilient and proactive biosecurity systems in an interconnected world. The continued success of such programs relies on sustained investment, interdisciplinary collaboration, and the ongoing integration of technological advancements into public health and agricultural policy.
The integration of genomic sciences within the One Health framework provides an indispensable, unified approach to tackling complex health challenges that transcend species and ecosystems. The key takeaways confirm that genomic surveillance enables predictive insights into pandemics and AMR, cross-species genomic analyses reveal critical transmission pathways of pathogens, and overcoming data integration and collaborative barriers is essential for success. Furthermore, evidence demonstrates that One Health genomic systems offer superior outcomes in outbreak management and biosecurity compared to traditional methods. Future efforts must focus on building equitable genomic capacities, particularly in tropical regions, standardizing global data-sharing protocols, and deepening the integration of AI and multi-omics data to fully realize the potential of One Health genomics in advancing biomedical research and precision public health.