[pymvpa] New Elastic Net classifier
Per B. Sederberg
persed at princeton.edu
Tue Feb 24 18:35:03 UTC 2009
It's actually pretty slow and the time it takes scales (non-linearly)
with the number of features. I've had to do a feature selection step
first to get it in the range of acceptability. You are likely running
into memory issues, too (i.e., swapping quite a bit), but I could be
wrong. I'm not sure this speed issue is a problem of it being in R or
rather that the algorithm is slowish.
Can you try performing an easy feature selection of say 5-10K
features, first? One to try might be the new feature stability
measure I added that looks for the most stable features across runs
for each label (different from ANOVA):
stab_enet = FeatureSelectionClassifier(
ENET(lm=1.0,max_steps=500,trace=False,normalize=False),
SensitivityBasedFeatureSelection(
CorrStability(),
FixedNElementTailSelector(5000,mode='select',tail='upper')),
descr="ENET on 5K best(CorrStability) features")
Latros,
P
On Tue, Feb 24, 2009 at 1:20 PM, Michael Hanke <michael.hanke at gmail.com> wrote:
> Hi Per,
>
> On Wed, Feb 04, 2009 at 08:33:49AM +0100, Michael Hanke wrote:
>> On Tue, Feb 03, 2009 at 10:52:03PM -0500, Per B. Sederberg wrote:
>> > Hi Folks:
>> >
>> > I just wanted to let you know that I just added in a new
>> > regression-based classifier to PyMVPA that I think is going to be very
>> > effective: the elastic net (ENET). It's a variant on least angle
>> > regression (LARS), but it has both L1 and L2 priors so that it
>> > sparsifies the features AND keeps redundant features, which helps with
>> > both generalizability and with sensitivity analysis.
>>
>> Sounds great -- I will take a look if it is friendly to my datasets ;-)
>
> Started playing (or intended to). I started a simple cross-validation
> (5-fold) on a 80000-feature dataset (not much useful signal in it). I
> was curious to see how ENET performs. However, I started it on Monday
> and it is still not done. What is the perceived speed of the R
> implementation in comparison to other classifiers?
>
>
> Cheers,
>
> Michael
>
>
> --
> GPG key: 1024D/3144BE0F Michael Hanke
> http://apsy.gse.uni-magdeburg.de/hanke
> ICQ: 48230050
>
> _______________________________________________
> Pkg-ExpPsy-PyMVPA mailing list
> Pkg-ExpPsy-PyMVPA at lists.alioth.debian.org
> http://lists.alioth.debian.org/mailman/listinfo/pkg-exppsy-pymvpa
>
More information about the Pkg-ExpPsy-PyMVPA
mailing list