Open Access
December 2015 “Virus hunting” using radial distance weighted discrimination
Jie Xiong, D. P. Dittmer, J. S. Marron
Ann. Appl. Stat. 9(4): 2090-2109 (December 2015). DOI: 10.1214/15-AOAS869


Motivated by the challenge of using DNA-seq data to identify viruses in human blood samples, we propose a novel classification algorithm called “Radial Distance Weighted Discrimination” (or Radial DWD). This classifier is designed for binary classification, assuming one class is surrounded by the other class in very diverse radial directions, which is seen to be typical for our virus detection data. This separation of the 2 classes in multiple radial directions naturally motivates the development of Radial DWD. While classical machine learning methods such as the Support Vector Machine and linear Distance Weighted Discrimination can sometimes give reasonable answers for a given data set, their generalizability is severely compromised because of the linear separating boundary. Radial DWD addresses this challenge by using a more appropriate (in this particular case) spherical separating boundary. Simulations show that for appropriate radial contexts, this gives much better generalizability than linear methods, and also much better than conventional kernel based (nonlinear) Support Vector Machines, because the latter methods essentially use much of the information in the data for determining the shape of the separating boundary. The effectiveness of Radial DWD is demonstrated for real virus detection.


Download Citation

Jie Xiong. D. P. Dittmer. J. S. Marron. "“Virus hunting” using radial distance weighted discrimination." Ann. Appl. Stat. 9 (4) 2090 - 2109, December 2015.


Received: 1 May 2014; Revised: 1 August 2015; Published: December 2015
First available in Project Euclid: 28 January 2016

zbMATH: 06560823
MathSciNet: MR3456367
Digital Object Identifier: 10.1214/15-AOAS869

Keywords: DNA sequencing , high-dimension low-sample size data analysis , nonlinear classification , Virus hunting

Rights: Copyright © 2015 Institute of Mathematical Statistics

Vol.9 • No. 4 • December 2015
Back to Top