[Trilinos-Users] zoltan tests.

Jason Slemons slemons at cray.com
Thu Oct 15 12:39:39 MDT 2009


I had a number of tests fail for the same reason in zoltan(see the output of ctest -VV in the attachment), it appears that ctest_zoltan.pl is in the wrong place. I am using pgi 9.0.4, and I typed ctest from my build directory(pgi.BUILD; which is a child of trilinos-10.0-Source dir). Here's is the error:

test 24
      Start 24: Zoltan_ch_simple_zoltan_parallel
24: Test command: /opt/cmake/2.8.0-rc3/bin/cmake -DTEST_CONFIG= -P /lus/nid00008/slemo\
ns/trilinos/trilinos-10.0-Source/pgi.BUILD/packages/zoltan/test/ch_simple/Zoltan_ch_si\
mple_zoltan_parallel.cmake
24: Test timeout computed to be: 60
24:
24: XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
24:
24: Advanced Test: Zoltan_ch_simple_zoltan_parallel
24:
24: Running test commands: TEST_0
24:
24: ================================================================================
24:
24: TEST_0
24:
24: Running: "/usr/bin/perl" "../ctest_zoltan.pl" "4" "Zoltan" "1" "/usr/bin/aprun"
24:
24: --------------------------------------------------------------------------------
24:
24: Can't open perl script "../ctest_zoltan.pl": No such file or directory
24:
24: --------------------------------------------------------------------------------
24:
24: TEST_0: Return code = 2
24: TEST_0: Pass criteria = Return code
24: TEST_0: Result = FAILED
24:
24: ================================================================================
24:
24: OVERALL FINAL RESULT: TEST FAILED
24:
24: XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
24:
24/39 Test #24: Zoltan_ch_simple_zoltan_parallel ........***Failed  Error regular expr\
ession found in output. Regex=[FAILED]  0.10 sec


So I put a copy of ctest_zoltan.pl in 'trilinos-10.0-Source/pgi.BUILD/packages/zoltan/test/' (as opposed to 'trilinos-10.0-Source/packages/zoltan/test/' which I believe is where it was) and now it 'passes'(though essentially fails I think) with a strange error about having passed unrecognized options 'mca' to aprun. The output is below:

24: Test command: /opt/cmake/2.8.0-rc3/bin/cmake -DTEST_CONFIG= -P /lus/nid00008/slemo\
ns/trilinos/trilinos-10.0-Source/pgi.BUILD/packages/zoltan/test/ch_simple/Zoltan_ch_si\
mple_zoltan_parallel.cmake
24: Test timeout computed to be: 60
24:
24: XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
24:
24: Advanced Test: Zoltan_ch_simple_zoltan_parallel
24:
24: Running test commands: TEST_0
24:
24: ================================================================================
24:
24: TEST_0
24:
24: Running: "/usr/bin/perl" "../ctest_zoltan.pl" "4" "Zoltan" "1" "/usr/bin/aprun"
24:
24: --------------------------------------------------------------------------------
24:
24: CTEST_FULL_OUTPUT
24: 4Zoltan1/usr/bin/aprun
24: DEBUG:  package Zoltan
24: /usr/bin/aprun: unrecognized option `--mca'
24: Usage: aprun [-D level][-L node_list][-N pes][-S pes][-T]
24:           [-a arch][-b][-cc cpu_list][-cp file][-F flag][-q][-m size]
24:           [-P wfd[,rfd]][-sl numa_node_list][-sn numa_nodes][-ss]
24:           [-t sec][-d depth] -n width command [options...]
24:           [: -n width cmd2 [: -n width cmd3 [: ...] ] ]
24:
24:       -a arch        Architecture type (e.g. xt or x2)
24:       -b             Bypass application transfer to compute node
24:       -cc cpu_list   Cpu binding list or keyword
24:                        ([cpu#[,cpu# | cpu1-cpu2] | x]...] | keyword)
24:       -cp file       Cpu binding placement filename
24:       -D level       Debug level (1, 2, 4, 8, and 15)
24:       -d depth       Depth of each processor (number of threads)
24:       -F flag        exclusive or share node resources flag
24:       -L node_list   Manual placement list (node[,node | node1-node2]...)
24:       -m size        Memory limit in megabytes
24:                      (default node memory/number of processors)
24:                      K|M|G suffix supported (16 == 16M == 16 megabytes)
24:                      Add an 'h' suffix to request per PE huge page memory
24:                        (XT only)
24:                      Add an 's' to the 'h' suffix to make the per PE huge
24:                        page memory size strict (required) (XT only)
24:       -N pes         PEs per node
24:       -n width       Number of processors needed
24:       -P pipes       Write[,read] pipes (not applicable for general use)
24:       -q             Quiet mode; suppress aprun non-fatal messages
24:       -S pes         PEs per NUMA node
24:       -sl numa_node_list  List of NUMA nodes to use
24:                       (numa_node[,numa_node | numa_node1-numa_node2]...)
24:       -sn numa_nodes Number of NUMA nodes needed per compute node
24:       -ss            Strict memory containment per NUMA node
24:       -T             Use synchronous TTY
24:       -t sec         Time limit in seconds (default unlimited)
24:       :              Separate binaries for MPMD mode
24:                      (Multiple Program, Multiple Data)
24: DEBUG:  mpiexec /usr/bin/aprun
24: DEBUG  Dir /lus/nid00008/slemons/trilinos/trilinos-10.0-Source/pgi.BUILD/packages/\
zoltan/test/ch_simple dirname simple
24: DEBUG  Outfilebase: ;  Dropbase:
24: Test simple:  0 out of 0 tests PASSED.
24:
24: --------------------------------------------------------------------------------
24:
24: TEST_0: Return code = 0
24: TEST_0: Pass criteria = Return code
24: TEST_0: Result = PASSED
24: ================================================================================
24:
24: OVERALL FINAL RESULT: TEST PASSED
24:
24: XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
24:
24/39 Test #24: Zoltan_ch_simple_zoltan_parallel ........   Passed    0.19 sec

Like I said these are my only zoltan failures. I think ctest_zoltan should just be moved to the right place, but I also have no idea where the 'mca' option is coming from to aprun.

Has anyone seen this or know how to fix the 'mca' problem?
Best,
-Jason

-------------- next part --------------
An HTML attachment was scrubbed...
URL: https://software.sandia.gov/pipermail/trilinos-users/attachments/20091015/2a418b62/attachment-0001.html 
-------------- next part --------------
A non-text attachment was scrubbed...
Name: ctest.PGI.1
Type: application/octet-stream
Size: 462239 bytes
Desc: ctest.PGI.1
Url : https://software.sandia.gov/pipermail/trilinos-users/attachments/20091015/2a418b62/attachment-0001.obj 


More information about the Trilinos-Users mailing list