Creator: Cao, L. J. - LINDAT/CLARIAH-CZ Catalog Search Results

Start Over Creator Cao, L. J.

1. An empirical study of dimensionality reduction in support vector machine

Creator:: Cao, L. J., JingQing , Zhang, Zongwu, Cai, and Guan, Liam Kian
Format:: bez média and svazek
Type:: model:article and TEXT
Subject:: Support vector machines, principal component analysis, kernel principal component analysis, and independent component analysis
Language:: English
Description:: Recently, the the support vector machine (SVM) has become a popular tool in time series forecasting. In developing a successful SVM forecaster, the first step is feature extraction. This paper proposes the applications of principal component analysis (PCA), kernel principal component analysis (KPCA) and independent component analysis (ICA) to SVM for feature extraction. The PCA linearly transforms the original inputs into new uncorrelated features. The KPCA is a nonlinear PCA developed by using the kernel method. In ICA, the original inputs are linearly transformed into features which are mutually statistically independent. By examining the sunspot data, Santa Fe data set A and five real futures contracts, the experiment shows that SVM by feature extraction using PCA, KPCA or ICA can perform better than that without feature extraction. Furthermore, among the three methods, there is the best performance in the KPCA feature extraction, followed by the ICA feature extraction.
Rights:: http://creativecommons.org/publicdomain/mark/1.0/ and policy:public

2. An empirical study of feature selection in support vector machines

Creator:: Cao, L. J. and Jingqing , Zhang
Format:: bez média and svazek
Type:: model:article and TEXT
Subject:: Feature selection, support vector machines, structural risk minimization principle, saliency analysis, and genetic algorithm
Language:: English
Description:: Recently, a support vector machine (SVM) has been receiving increasing attention in the field of regression estimation due to its remarkable characteristics such as good generalization performance, the absence of local minima and sparse representation of the solution. However, within the SVMs framework, there are very few established approaches for identifying important features. Selecting significant features from all candidate features is the first step in regression estimation, and this procedure can improve the network performance, reduce the network complexity, and speed up the training of the network. This paper investigates the use of saliency analysis (SA) and genetic algorithm (GA) in SVMs for selecting important features in the context of regression estimation. The SA measures the importance of features by evaluating the sensitivity of the network output with respect to the feature input. The derivation of the sensitivity of the network output to the feature input in terms of the partial derivative in SVMs is presented, and a systematic approach to remove irrelevant features based on the sensitivity is developed. GA is an efficient search method based on the mechanics of natural selection and population genetics. A simple GA is used where all features are mapped into binary chromosomes with a bit "1" representing the inclusion of the feature and a bit of "0" representing the absence of the feature. The performances of SA and GA are tested using two simulated non-linear time series and five real financial time series. The experiments show that with the simulated data, GA and SA detect the same true feature set from the redundant feature set, and the method of SA is also insensitive to the kernel function selection. With the real financial data, GA and SA select different subsets of the features. Both selected feature sets achieve higher generation performance in SVMs than that of the full feature set. In addition, the generation performance between the selected feature sets of GA and SA is similar. All the results demonstrate that that both SA and GA are effective in the SVMs for identifying important features.
Rights:: http://creativecommons.org/publicdomain/mark/1.0/ and policy:public

3. Saliency analysis of Support Vector Machines for feature selection

Creator:: Tay, Francis E. H. and Cao, L. J.
Format:: bez média and svazek
Type:: model:article and TEXT
Subject:: saliency analysis, feature selection, support vector machines, and structural risk minimization principle
Language:: English
Description:: This paper deals with the application of saliency analysis to Support Vector Machines (SVMs) for feature selection. The importance of feature is ranked by evaluating the sensitivity of the network output to the feature input in terms of the partial derivative. A systematic approach to remove irrelevant features based on the sensitivity is developed. Two simulated non-linear time series and five real financial time series are examined in the experiment. Based on the simulation results, it is shown that that saliency analysis is effective in SVMs for identifying important features.
Rights:: http://creativecommons.org/publicdomain/mark/1.0/ and policy:public

Search

Search Constraints

Search Results

Limit your search

Creator

Format

Language

Rights

Subject

Type

Original context has metadata only

Harvested from