HDLSS discrimination with adaptive data piling

Myung Hee Lee, Jeongyoun Ahn, Yongho Jeon

Research output: Contribution to journalArticlepeer-review

11 Citations (Scopus)

Abstract

We propose new discrimination methods for classification of high dimension, low sample size (HDLSS) data that regularize the degree of data piling. The within-class scatter of the HDLSS data, when projected onto a low-dimensional discriminant subspace, can be selected to be arbitrarily small. Using this fact, we develop two different ways of tuning the amount of within-class scatter, or equivalently, the degree of data piling. In the first approach,we consider a linear path connecting the maximal data piling and the least data piling directions. We also formulate a problem of finding the optimal classifier under a constraint on data piling. The data piling regularization methods are extended to multicategory problems. Simulated and real data examples show competitive performances of the proposed classification methods. Supplementary materials for this article are available online on the journal web site.

Original languageEnglish
Pages (from-to)433-451
Number of pages19
JournalJournal of Computational and Graphical Statistics
Volume22
Issue number2
DOIs
Publication statusPublished - 2013

Bibliographical note

Funding Information:
Ahn’s research was partly supported by the NSF grant DMS-0805758 and NIH grant 1R21CA152460-01A1. The authors are grateful to an associate editor and the reviewers for helpful comments.

All Science Journal Classification (ASJC) codes

  • Statistics and Probability
  • Discrete Mathematics and Combinatorics
  • Statistics, Probability and Uncertainty

Fingerprint

Dive into the research topics of 'HDLSS discrimination with adaptive data piling'. Together they form a unique fingerprint.

Cite this