[pymvpa] Sensitivity analysis
Yaroslav Halchenko
debian at onerussian.com
Tue Sep 2 13:25:36 UTC 2014
On Tue, 02 Sep 2014, Dai Xijian wrote:
> Dear all,
> I faced with an embarrassing situation. I have completed some
> sensitivity analysis. When I used the script like below, I achieved a
> discouraging accuracy rate:0.51267.
> clf = LinearCSVMC()
> cv = CrossValidation(clf, NFoldPartitioner(), errorfx = lambda
> p,t:np.mean(p==t), enable_ca=['stats'],postproc = mean_sample())
> sensana = clf.get_sensitivity_analyzer()
> cv_sensana = RepeatedMeasure(sensana,
> ChainNode((NFoldPartitioner(), Splitter('partitions', attr_values=(1,)))))
> acc = cv(ds)
> sensmap_cv = cv_sensana(ds)
> Then I want to improve the accuracy. So I firstly select 20% features
> which contributed most for the predicted accuracy. However, I achieved a
> lower accuracy rate:0.40268.
> fsel =
> SensitivityBasedFeatureSelection(OneWayAnova(),FractionTailSelector(0.2,mode='select',tail='upper'))
> fclf = FeatureSelectionClassifier(clf, fsel)
> cv_sel = CrossValidation(fclf,NFoldPartitioner(), errorfx =
> lambda p,t:np.mean(p==t),enable_ca=['stats'],postproc = mean_sample())
> sensana_sel = fclf.get_sensitivity_analyzer()
> cv_sensana_sel = RepeatedMeasure(sensana_sel,
> ChainNode((NFoldPartitioner(), Splitter('partitions', attr_values=(1,)))))
> acc_sel = cv_sel(ds)
> sensmap_cv_sel = cv_sensana_sel(ds)
> What's wrong? Would you tell me?
most probably nothing is wrong -- just either that your data doesn't
have information about your targets, or information is strongly
non-univariate so univariate ANOVA based feature selection doesn't
improve results.
(if you are doing binary classification, 0.51 accuracy can well be just
due to a chance, so a slight augmentation of classifier just lead to
another chance level performance)
--
Yaroslav O. Halchenko, Ph.D.
http://neuro.debian.net http://www.pymvpa.org http://www.fail2ban.org
Research Scientist, Psychological and Brain Sciences Dept.
Dartmouth College, 419 Moore Hall, Hinman Box 6207, Hanover, NH 03755
Phone: +1 (603) 646-9834 Fax: +1 (603) 646-1419
WWW: http://www.linkedin.com/in/yarik
More information about the Pkg-ExpPsy-PyMVPA
mailing list