[Neurodebian-users] FSL and Condor
Bertram Walter
Bertram.Walter at psychol.uni-giessen.de
Sat Mar 31 11:47:05 UTC 2012
Hi,
here are my experiences with FSL and Condor:
On a Ubuntu 11.10 platform I have a working FSL-4.1.9-2-nd60+1
installation and fsl-selftest (feeds 4.1.9.1) works fine.
/etc/fsl/fsl.sh is sourced in .bashrc
- I installed condor as described in
http://neuro.debian.net/blog/2012/2012-03-09_parallelize_fsl_with_condor.html.
- condor_status gives now:
Name OpSys Arch State Activity LoadAv Mem
ActvtyTime
slot1 at bion05.uni-g LINUX X86_64 Owner Idle 0.370 3960
0+00:00:04
slot2 at bion05.uni-g LINUX X86_64 Owner Idle 0.000 3960
0+00:00:05
Total Owner Claimed Unclaimed Matched Preempting
Backfill
X86_64/LINUX 2 2 0 0 0
0 0
Total 2 2 0 0 0
0 0
- I started time fsl-selftest feat without FSLPARALLEL and got
...
start time = Fr 30. Mär 16:32:41 CEST 2012
hostname = bion05
os = Linux bion05 3.0.0-17-generic #30-Ubuntu SMP Thu Mar 8 20:45:39 UTC
2012 x86_64 x86_64 x86_64 GNU/Linux
Starting FEAT at Fr 30. Mär 16:32:41 CEST 2012
...
end time = Fr 30. Mär 16:36:28 CEST 2012
...
real 3m46.880s
user 3m39.774s
sys 0m4.216s
- I added the line
export FSLPARALLEL=condor
to .bashrc
- I started time fsl-selftest feat in a newly opened terminal
condor_q showed
-- Submitter: : <127.0.0.1:50413> : bion05.uni-giessen.de
ID OWNER SUBMITTED RUN_TIME ST PRI SIZE CMD
12.0 walter 3/30 16:43 0+00:00:05 R 0 1.0 bash
13.0 walter 3/30 16:43 0+00:00:00 H 0 1.0 bash
14.0 walter 3/30 16:43 0+00:00:00 I 0 0.0
cluster13_sentinel
15.0 walter 3/30 16:43 0+00:00:00 H 0 1.0 bash
16.0 walter 3/30 16:43 0+00:00:00 I 0 0.0
cluster15_sentinel
17.0 walter 3/30 16:43 0+00:00:00 H 0 1.0 bash
18.0 walter 3/30 16:43 0+00:00:00 I 0 0.0
cluster17_sentinel
19.0 walter 3/30 16:43 0+00:00:00 H 0 1.0 bash
20.0 walter 3/30 16:43 0+00:00:00 I 0 0.0
cluster19_sentinel
21.0 walter 3/30 16:43 0+00:00:00 H 0 1.0 bash
22.0 walter 3/30 16:43 0+00:00:00 I 0 0.0
cluster21_sentinel
11 jobs; 0 completed, 0 removed, 5 idle, 1 running, 5 held, 0 suspended
fsl-seftest resulted in
...
start time = Fr 30. Mär 16:43:48 CEST 2012
hostname = bion05
os = Linux bion05 3.0.0-17-generic #30-Ubuntu SMP Thu Mar 8 20:45:39 UTC
2012 x86_64 x86_64 x86_64 GNU/Linux
...
Starting FEAT at Fr 30. Mär 16:43:48 CEST 2012
...
end time = Fr 30. Mär 16:49:01 CEST 2012
...
real 5m13.042s
user 0m6.588s
sys 0m0.712s
Oooh, much slower than without Condor
- I changed in /etc/condor/config.d/00debconf the lines
CONDOR_HOST = 127.0.0.1
ALLOW_WRITE = 127.0.0.1
as described in Neurodebian-users Digest, Vol 19, Issue 10
and restarted ubuntu for restarting condor completely.
- I started time fsl-selftest feat again.
After 15 minutes I checked condor_q:
-- Submitter: : <127.0.0.1:45824> : bion05.uni-giessen.de
ID OWNER SUBMITTED RUN_TIME ST PRI SIZE CMD
23.0 walter 3/30 17:05 0+00:00:00 I 0 1.0 bash
24.0 walter 3/30 17:05 0+00:00:00 H 0 1.0 bash
25.0 walter 3/30 17:05 0+00:15:00 R 0 0.0
cluster24_sentinel
26.0 walter 3/30 17:05 0+00:00:00 H 0 1.0 bash
27.0 walter 3/30 17:05 0+00:15:00 R 0 0.0
cluster26_sentinel
28.0 walter 3/30 17:05 0+00:00:00 H 0 1.0 bash
29.0 walter 3/30 17:05 0+00:15:00 R 0 0.0
cluster28_sentinel
30.0 walter 3/30 17:05 0+00:00:00 H 0 1.0 bash
31.0 walter 3/30 17:05 0+00:15:00 R 0 0.0
cluster30_sentinel
32.0 walter 3/30 17:05 0+00:00:00 H 0 1.0 bash
33.0 walter 3/30 17:05 0+00:15:00 R 0 0.0
cluster32_sentinel
11 jobs; 0 completed, 0 removed, 1 idle, 5 running, 5 held, 0 suspended
- Quite disappointed I killed fsl-selftest.
What was going wrong?
Bertram
--
Dr. Bertram Walter
Bender Institute of Neuroimaging
University of Giessen
Otto-Behaghel-Str. 10H
35394 Giessen
Germany
Phone +49 (641) 99-26307
or +49 (641) 99-26331 (Secretary)
Fax +49 (641) 99-26309
www.bion.de
More information about the Neurodebian-users
mailing list