[Trilinos-Users] zoltan tests.
Jason Slemons
slemons at cray.com
Thu Oct 15 12:39:39 MDT 2009
I had a number of tests fail for the same reason in zoltan(see the output of ctest -VV in the attachment), it appears that ctest_zoltan.pl is in the wrong place. I am using pgi 9.0.4, and I typed ctest from my build directory(pgi.BUILD; which is a child of trilinos-10.0-Source dir). Here's is the error:
test 24
Start 24: Zoltan_ch_simple_zoltan_parallel
24: Test command: /opt/cmake/2.8.0-rc3/bin/cmake -DTEST_CONFIG= -P /lus/nid00008/slemo\
ns/trilinos/trilinos-10.0-Source/pgi.BUILD/packages/zoltan/test/ch_simple/Zoltan_ch_si\
mple_zoltan_parallel.cmake
24: Test timeout computed to be: 60
24:
24: XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
24:
24: Advanced Test: Zoltan_ch_simple_zoltan_parallel
24:
24: Running test commands: TEST_0
24:
24: ================================================================================
24:
24: TEST_0
24:
24: Running: "/usr/bin/perl" "../ctest_zoltan.pl" "4" "Zoltan" "1" "/usr/bin/aprun"
24:
24: --------------------------------------------------------------------------------
24:
24: Can't open perl script "../ctest_zoltan.pl": No such file or directory
24:
24: --------------------------------------------------------------------------------
24:
24: TEST_0: Return code = 2
24: TEST_0: Pass criteria = Return code
24: TEST_0: Result = FAILED
24:
24: ================================================================================
24:
24: OVERALL FINAL RESULT: TEST FAILED
24:
24: XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
24:
24/39 Test #24: Zoltan_ch_simple_zoltan_parallel ........***Failed Error regular expr\
ession found in output. Regex=[FAILED] 0.10 sec
So I put a copy of ctest_zoltan.pl in 'trilinos-10.0-Source/pgi.BUILD/packages/zoltan/test/' (as opposed to 'trilinos-10.0-Source/packages/zoltan/test/' which I believe is where it was) and now it 'passes'(though essentially fails I think) with a strange error about having passed unrecognized options 'mca' to aprun. The output is below:
24: Test command: /opt/cmake/2.8.0-rc3/bin/cmake -DTEST_CONFIG= -P /lus/nid00008/slemo\
ns/trilinos/trilinos-10.0-Source/pgi.BUILD/packages/zoltan/test/ch_simple/Zoltan_ch_si\
mple_zoltan_parallel.cmake
24: Test timeout computed to be: 60
24:
24: XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
24:
24: Advanced Test: Zoltan_ch_simple_zoltan_parallel
24:
24: Running test commands: TEST_0
24:
24: ================================================================================
24:
24: TEST_0
24:
24: Running: "/usr/bin/perl" "../ctest_zoltan.pl" "4" "Zoltan" "1" "/usr/bin/aprun"
24:
24: --------------------------------------------------------------------------------
24:
24: CTEST_FULL_OUTPUT
24: 4Zoltan1/usr/bin/aprun
24: DEBUG: package Zoltan
24: /usr/bin/aprun: unrecognized option `--mca'
24: Usage: aprun [-D level][-L node_list][-N pes][-S pes][-T]
24: [-a arch][-b][-cc cpu_list][-cp file][-F flag][-q][-m size]
24: [-P wfd[,rfd]][-sl numa_node_list][-sn numa_nodes][-ss]
24: [-t sec][-d depth] -n width command [options...]
24: [: -n width cmd2 [: -n width cmd3 [: ...] ] ]
24:
24: -a arch Architecture type (e.g. xt or x2)
24: -b Bypass application transfer to compute node
24: -cc cpu_list Cpu binding list or keyword
24: ([cpu#[,cpu# | cpu1-cpu2] | x]...] | keyword)
24: -cp file Cpu binding placement filename
24: -D level Debug level (1, 2, 4, 8, and 15)
24: -d depth Depth of each processor (number of threads)
24: -F flag exclusive or share node resources flag
24: -L node_list Manual placement list (node[,node | node1-node2]...)
24: -m size Memory limit in megabytes
24: (default node memory/number of processors)
24: K|M|G suffix supported (16 == 16M == 16 megabytes)
24: Add an 'h' suffix to request per PE huge page memory
24: (XT only)
24: Add an 's' to the 'h' suffix to make the per PE huge
24: page memory size strict (required) (XT only)
24: -N pes PEs per node
24: -n width Number of processors needed
24: -P pipes Write[,read] pipes (not applicable for general use)
24: -q Quiet mode; suppress aprun non-fatal messages
24: -S pes PEs per NUMA node
24: -sl numa_node_list List of NUMA nodes to use
24: (numa_node[,numa_node | numa_node1-numa_node2]...)
24: -sn numa_nodes Number of NUMA nodes needed per compute node
24: -ss Strict memory containment per NUMA node
24: -T Use synchronous TTY
24: -t sec Time limit in seconds (default unlimited)
24: : Separate binaries for MPMD mode
24: (Multiple Program, Multiple Data)
24: DEBUG: mpiexec /usr/bin/aprun
24: DEBUG Dir /lus/nid00008/slemons/trilinos/trilinos-10.0-Source/pgi.BUILD/packages/\
zoltan/test/ch_simple dirname simple
24: DEBUG Outfilebase: ; Dropbase:
24: Test simple: 0 out of 0 tests PASSED.
24:
24: --------------------------------------------------------------------------------
24:
24: TEST_0: Return code = 0
24: TEST_0: Pass criteria = Return code
24: TEST_0: Result = PASSED
24: ================================================================================
24:
24: OVERALL FINAL RESULT: TEST PASSED
24:
24: XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
24:
24/39 Test #24: Zoltan_ch_simple_zoltan_parallel ........ Passed 0.19 sec
Like I said these are my only zoltan failures. I think ctest_zoltan should just be moved to the right place, but I also have no idea where the 'mca' option is coming from to aprun.
Has anyone seen this or know how to fix the 'mca' problem?
Best,
-Jason
-------------- next part --------------
An HTML attachment was scrubbed...
URL: https://software.sandia.gov/pipermail/trilinos-users/attachments/20091015/2a418b62/attachment-0001.html
-------------- next part --------------
A non-text attachment was scrubbed...
Name: ctest.PGI.1
Type: application/octet-stream
Size: 462239 bytes
Desc: ctest.PGI.1
Url : https://software.sandia.gov/pipermail/trilinos-users/attachments/20091015/2a418b62/attachment-0001.obj
More information about the Trilinos-Users
mailing list