[Trilinos-Users] Tpetra+Anasazi performance

David Hysom hysom1 at llnl.gov
Wed May 14 11:58:40 MDT 2014

 >MPI should give better scaling than this

absolutely! But we have customers (funding) who want our application
to run in a shared memory environment. There are also aspects of our
code (e.g, connected-components) that are non-trivial to port, esp. with
good performance, to MPI.


On 05/14/2014 10:57 AM, Erik Boman wrote:
> David,
> MPI should give better scaling than this :)
> Others might be better positioned to comment on the performance of 
> Tpetra/Kokkos.
> Erik
> David Hysom wrote:
>> Hi,
>> Please see the attached for an example of what we're seeing for
>> strong scaling for LOBPCG+Tpetra, in a shared memory environment.
>> Bottom line is, this example shows a speedup of 3.0
>> We've run many problems with varying parameters/matrices, and typically
>> only see speedups between 2.0 and 3.0
>> Is this expected? Is there anything wrt upcoming trilinos development
>> that might increase scalability?
>> Stats are for trilinos-11.6.1
>> A second question: we've tested with OpenMP, Pthreads, and TBB.
>> We always find that OpenMP gives the best results (shortest execution
>> time), although Pthreads and TBB are reasonably close. Do you know
>> of circumstances (not limited to Anasazi) where Pthreads or TBB
>> outperform OpenMP?
>> thanks, David
>> ------------------------------------------------------------------------
>> _______________________________________________
>> Trilinos-Users mailing list
>> Trilinos-Users at software.sandia.gov
>> https://software.sandia.gov/mailman/listinfo/trilinos-users

More information about the Trilinos-Users mailing list