TY - GEN
T1 - Logistic ensembles for random spherical linear oracles
AU - Peterson, Leif E.
AU - Coleman, Matthew A.
PY - 2007
Y1 - 2007
N2 - A random spherical linear oracle (RSLO) ensemble classifier for DNA microarray gene expression data is proposed. The oracle assigns different training(testing) samples to 2 sub-classifiers of the same type using hyperplane splits in order to increase the diversity of voting results since errors are not shared across sub-classifiers. Eleven classifiers were evaluated for performance as the base classifier including k nearest neighbor (kNN), naïve Bayes classifier (NBC), linear discriminant analysis (LDA), learning vector quantization (LVQ1), polytomous logistic regression (PLOG), artificial neural networks (ANN), constricted particle swarm optimization (CPSO), kernel regression (KREG), radial basis function networks (RBFN), gradient descent support vector machines (SVMGD), and least squares support vector machines (SVMLS). Logistic ensembles (PLOG) resulted in the best performance when used as a base classifier for RSLO. Random hyperplane splits used in RSLO resulted in degeneration of performance at the greatest levels of CV-fold and iteration number when compared with hyperplane splits in principal direction linear oracle (PDLO), which increased with increasing CV-fold and iteration number.
AB - A random spherical linear oracle (RSLO) ensemble classifier for DNA microarray gene expression data is proposed. The oracle assigns different training(testing) samples to 2 sub-classifiers of the same type using hyperplane splits in order to increase the diversity of voting results since errors are not shared across sub-classifiers. Eleven classifiers were evaluated for performance as the base classifier including k nearest neighbor (kNN), naïve Bayes classifier (NBC), linear discriminant analysis (LDA), learning vector quantization (LVQ1), polytomous logistic regression (PLOG), artificial neural networks (ANN), constricted particle swarm optimization (CPSO), kernel regression (KREG), radial basis function networks (RBFN), gradient descent support vector machines (SVMGD), and least squares support vector machines (SVMLS). Logistic ensembles (PLOG) resulted in the best performance when used as a base classifier for RSLO. Random hyperplane splits used in RSLO resulted in degeneration of performance at the greatest levels of CV-fold and iteration number when compared with hyperplane splits in principal direction linear oracle (PDLO), which increased with increasing CV-fold and iteration number.
UR - http://www.scopus.com/inward/record.url?scp=47349092176&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=47349092176&partnerID=8YFLogxK
U2 - 10.1109/ICMLA.2007.67
DO - 10.1109/ICMLA.2007.67
M3 - Conference contribution
AN - SCOPUS:47349092176
SN - 0769530699
SN - 9780769530697
T3 - Proceedings - 6th International Conference on Machine Learning and Applications, ICMLA 2007
SP - 618
EP - 623
BT - Proceedings - 6th International Conference on Machine Learning and Applications, ICMLA 2007
T2 - 6th International Conference on Machine Learning and Applications, ICMLA 2007
Y2 - 13 December 2007 through 15 December 2007
ER -