Decision-Tree-based data mining and rule induction for predicting and mapping soil bacterial diversity

Kangsuk Kim, Keunje Yoo, Dongwon Ki, Il Suh Son, Kyong Joo Oh, Joonhong Park

Research output: Contribution to journalArticlepeer-review

11 Citations (Scopus)


Soilmicrobial ecology plays a significant role in global ecosystems. Nevertheless, methods of model prediction and mapping have yet to be established for soil microbial ecology. The present study was undertaken to develop an artificial-intelligence- and geographical information system (GIS)-integrated framework for predicting and mapping soil bacterial diversity using pre-existing environmental geospatial database information, and to further evaluate the applicability of soil bacterial diversity mapping for planning construction of eco-friendly roads. Using a stratified random sampling, soil bacterial diversity was measured in 196 soil samples in a forest area where construction of an eco-friendly road was planned. Model accuracy, coherence analyses, and tree analysis were systematically performed, and four-class discretized decision tree (DT) with ordinary pair-wise partitioning (OPP) was selected as the optimal model among tested five DT model variants. GIS-based simulations of the optimal DT model with varying weights assigned to soil ecological quality showed that the inclusion of soil ecology in environmental components, which are considered in environmental impact assessment, significantly affects the spatial distributions of overall environmental quality values as well as the determination of an environmentally optimized road route. This work suggests a guideline to use systematic accuracy, coherence, and tree analyses in selecting an optimal DT model from multiple candidate model variants, and demonstrates the applicability of the OPP-improved DT integrated with GIS in rule induction for mapping bacterial diversity. These findings also provide implication on the significance of soil microbial ecology in environmental impact assessment and eco-friendly construction planning.

Original languageEnglish
Pages (from-to)595-610
Number of pages16
JournalEnvironmental Monitoring and Assessment
Issue number1-4
Publication statusPublished - 2011 Jul

Bibliographical note

Funding Information:
Acknowledgement This research was supported by the Korea Ministry of Environment via the GAIA project (grant number: 141-081-034). in addition, This research was also supported by WCU (World Class University) program through the National Research Foundation of Korea funded by the Ministry of Education, Science and Technology (R33-10076).

All Science Journal Classification (ASJC) codes

  • Environmental Science(all)
  • Pollution
  • Management, Monitoring, Policy and Law


Dive into the research topics of 'Decision-Tree-based data mining and rule induction for predicting and mapping soil bacterial diversity'. Together they form a unique fingerprint.

Cite this