A metagenome-derived artificial intelligence modeling framework advances the predictive diagnosis and interpretation of petroleum-polluted groundwater

Jonathan Wijaya, Joonhong Park, Yuyi Yang, Sharf Ilahi Siddiqui, Seungdae Oh

Research output: Contribution to journalArticlepeer-review

3 Citations (Scopus)

Abstract

Groundwater (GW) quality monitoring is vital for sustainable water resource management. The present study introduced a metagenome-derived machine learning (ML) model aimed at enhancing the predictive understanding and diagnostic interpretation of GW pollution associated with petroleum. In this framework, taxonomic and metabolic profiles derived from GW metagenomes were combined for use as the input dataset. By employing strategies that optimized data integration, model selection, and parameter tuning, we achieved a significant increase in diagnostic accuracy for petroleum-polluted GW. Explanatory artificial intelligence techniques identified petroleum degradation pathways and Rhodocyclaceae as strong predictors of a pollution diagnosis. Metagenomic analysis corroborated the presence of gene operons encoding aminobenzoate and xylene biodegradation within the de novo assembled genome of Rhodocyclaceae. Our genome-centric metagenomic analysis thus clarified the ecological interactions associated with microbiomes in breaking down petroleum contaminants, validating the ML-based diagnostic results. This metagenome-derived ML framework not only enhances the predictive diagnosis of petroleum pollution but also offers interpretable insights into the interaction between microbiomes and petroleum.

Original languageEnglish
Article number134513
JournalJournal of Hazardous Materials
Volume472
DOIs
Publication statusPublished - 2024 Jul 5

Bibliographical note

Publisher Copyright:
© 2024 Elsevier B.V.

All Science Journal Classification (ASJC) codes

  • Environmental Engineering
  • Environmental Chemistry
  • Waste Management and Disposal
  • Pollution
  • Health, Toxicology and Mutagenesis

Fingerprint

Dive into the research topics of 'A metagenome-derived artificial intelligence modeling framework advances the predictive diagnosis and interpretation of petroleum-polluted groundwater'. Together they form a unique fingerprint.

Cite this