Skip to content

Latest commit

 

History

History
50 lines (38 loc) · 3 KB

README.md

File metadata and controls

50 lines (38 loc) · 3 KB

salmonellaml

Attributing the Source of Salmonella Enterica: A Machine Learning Approach to Understanding Pathogen Surveillance in Africa

Table of Contents

Introduction

Salmonella enterica is a pathogen that can be transmitted from animals to humans through food and is responsible for causing a considerable number of foodborne illnesses worldwide. These illnesses can manifest as gastrointestinal symptoms, fever, abdominal cramps, and, in severe cases, may lead to hospitalization and even deaths. While developed countries have made significant progress in reducing Salmonella infections through improved food safety practices and public health measures, it continues to pose a significant problem in low and middle-income countries. To address these challenges in resource-constrained settings, this project aims to utilize machine learning techniques to predict the sources of Salmonella outbreaks, with a specific focus on Africa. By combining data from various sources, including genomic data, epidemiological records, and environmental factors, the goal is to develop robust predictive models capable of identifying potential sources of Salmonella contamination. This approach could enable targeted interventions, leading to more effective prevention and control strategies.

Objectives

  1. Develop machine learning models to predict the source of Salmonella Enterica based on whole genome sequencing data
  2. Identify SNPs determinants associated with source attributions.

Data

The data that was used in this study was obtained from NCBI isolate brower database. The dataset used in the study is in accessions folder

Methods

workflow

Results

results results results results results results

Reproducibility

The reproducibility documentation of this project can be found here link

Team

  • Bonface Onyango
  • Brenda Muthoni
  • Asime Oba
  • Oscar Nyangiri
  • Olaitan Awe