Skip to content

WeeklyTelcon_20160705

Geoff Paulsen edited this page Jul 5, 2016 · 4 revisions

Open MPI Weekly Telcon


  • Dialup Info: (Do not post to public mailing list or public wiki)

Attendees

  • Geoff Paulsen
  • Jeff Squyres
  • Howard Pritchard
  • Josh Hursey
  • Arm Patinyasakdikul
  • Joshua Ladd
  • Nathan Hjelm
  • Nysal
  • Ralph
  • Ryan Grant
  • Sylvain Jeaugey
  • Todd Kordenbrock

Agenda

Review 1.10

Review 2.0.x

Review Master MTT testing (https://mtt.open-mpi.org/)

  • Has improved. 233 failures, currently on Cisco.
    • Cisco - Many Cisco failures are local cluster issues. Art is working on cleaning up.
  • Jeff put in a patch into MTT to allow thread hangs to be marked as hangs.
  • nVidia failures are all PMIx failures.
    • Giles found a race condition in PMIx 2.0.
  • v2.x failures on Comm_spawn_loop.
  • overall not too bad.

MTT Dev status:

Status Updates:


Status Update Rotation

  1. Cisco, ORNL, UTK, NVIDIA
  2. Mellanox, Sandia, Intel
  3. LANL, Houston, IBM

Back to 2016 WeeklyTelcon-2016

Clone this wiki locally