Improving generalization capability of neural networks based on simulated annealing

Yeejin Lee, Jong Seok Lee, Sun Young Lee, Cheol Hoon Park

Research output: Chapter in Book/Report/Conference proceedingConference contribution

7 Citations (Scopus)

Abstract

This paper presents a single-objective and a multiobjective stochastic optimization algorithms for global training of neural networks based on simulated annealing. The algorithms overcome the limitation of local optimization by the conventional gradient-based training methods and perform global optimization of the weights of the neural networks. Especially, the multiobjective training algorithm is designed to enhance generalization capability of the trained networks by minimizing the training error and the dynamic range of the network weights simultaneously. For fast convergence and good solution quality of the algorithms, we suggest the hybrid simulated annealing algorithm with the gradient-based local optimization method. Experimental results show that the performance of the trained networks by the proposed methods is better than that by the gradient-based local training algorithm and, moreover, the generalization capability of the networks is significantly improved by preventing overfitting phenomena.

Original languageEnglish
Title of host publication2007 IEEE Congress on Evolutionary Computation, CEC 2007
Pages3447-3453
Number of pages7
DOIs
Publication statusPublished - 2007
Event2007 IEEE Congress on Evolutionary Computation, CEC 2007 - , Singapore
Duration: 2007 Sept 252007 Sept 28

Publication series

Name2007 IEEE Congress on Evolutionary Computation, CEC 2007

Other

Other2007 IEEE Congress on Evolutionary Computation, CEC 2007
Country/TerritorySingapore
Period07/9/2507/9/28

All Science Journal Classification (ASJC) codes

  • Artificial Intelligence
  • Software
  • Theoretical Computer Science

Fingerprint

Dive into the research topics of 'Improving generalization capability of neural networks based on simulated annealing'. Together they form a unique fingerprint.

Cite this