Skip to content

2015 09 02

Andre Merzky edited this page Sep 2, 2015 · 4 revisions
  • Agenda:
    • open TODOs:
      • WIP AM: review communication model with MS
      • WIP AM/MS: prepare action/support plan for activities on BW
        • objectives, challenges, timelines, phase 1
      • DONE IP: anaconda support on client side?
        • client side seems to 'just work', which is good
        • agent side is expected to fail, and does

      • HOLD AM: check if we can switch to HeartbeatMonitor for pilot health checks
      • HOLD AM: suggest alternatives for PTY layer resource consumption
      • HOLD MS: Anaconda/SuperMUC (October)
      • HOLD MS: add NAMD examples eventually? (Tom Bishop)
      • HOLD AM: set up example on how to use synapse as RP workload
      • HOLD AM: check documentation of state diagram in released docs
      • HOLD MT: move semantic elements of tools into RP.utils
      • HOLD AM: proposal to json export to persistent storage
      • HOLD MS: proposal for persistent experimental data storage
    • Development Progress:
      • release plan:
        • 0.36: mid September
          • 1 week merging of branches (agent split, profiling)
          • 1 week of testing
        • 0.37: end September
          • documentation, examples, tutorials
        • 0.38: end October
          • module refactor
          • final state model
      • testing:
        • TODO AT:
          • move to RADICAL-Jenkins (with one fixture)
          • this week
      • Yarn:
        • merge with agent_split
        • DONE IP: toward dynamic single node
        • TODO IP: toward dynamic multi node (lower priority)
        • TODO AM: daemon startup over LMs?
        • WIP IP: check what (non)queue system is used on chameleon(?) cloud
        • DONE IP: check target application sizes
          • current data are considered small, large is only one order of magnitude larger
      • Spark
        • TODO GC: compare to Yarn integration
      • BW/OSG:
        • MS: transition of users pending on scale profiling and MPI support
        • spreading out to other Crays
        • SJ: OTP token for our allocation is still pending
        • AIMES wants to test OSG towards demo scenario
          • TODO MT: use feature/osgng branch for now for AIMES
          • TODO AM: ponder over missing state notifications
    • Data Roadmap:
    • Experiments:
      • Profiling III:
        • next week -- apologies
    • Publications:
      • --
    • AOB:
      • CECAM Tutorial
        • online documentation vs. online tutorial
        • begin to work on interactive examples (which involve user activity)
          • how to submit n tasks of size A and m tasks of size B, toward hosts X and Y
          • TODO AT: simple repex example
            • TODO AT: check with SJ about suitable example / exercise mode
          • TODO VB: simple MD example
          • TODO AM: simple RP example
        • execution env, software stack, applications/libraries
        • handle at documentation tickets
      • SC15 Tutorial
  • Notes:
    • what is up with comet for Antons
    • TODO AM: how much does AWS tutorial backup cost
    • DONE AM: add Nikhil, George to lists
Clone this wiki locally