Skip to content
This repository has been archived by the owner on Jun 6, 2024. It is now read-only.

v0.13.0: June 2019 Release

Compare
Choose a tag to compare
@abuccts abuccts released this 04 Jun 09:19
· 1347 commits to master since this release

Release v0.13.0

New Features

  • OpenPAI protocol:

  • Web portal:

    • Add login page for guests (#2544)

    • Add user home page (#2614)

      • Job Status
      • My virtual clusters
      • Available GPU nodes (whole cluster)
      • My recent jobs

      home

    • Add new user management page (#2726, #2796)

    • User Management UX refactoring with new layout and themes (#2726, #2796)

Improvements

  • OpenPAI protocol:

    • Update example jobs in marketplace v2 for OpenPAI protocol (#2827)
  • Web portal:

    • Refine styles in job pages (#2829, #2856, #2858, #2862)
    • Refine alert message in job pages (#2698)
    • Reduce the build bundle size to improve webportal performance (#2715)
  • Rest server:

    • Add job v1 config to v2 converter (#2756)
    • Check default runtime before starting Docker (#2754)
  • Framework launcher:

    • Upgrade to Hadoop 2.9.0 (#2704)
  • Job exporter:

    • Change triggering rule for exporter hangs (#2766)
    • Add GPU temperature detection (#2757)
  • Watchdog:

    • Use /api/v1/pods to get all pods (#2750)
  • Deployement:

    • Allow user to use Backspace in paictl input (#2769)
    • Disable InfiniBand driver installation by default (#2595)

Documentation

  • Refine document of VS Code extension (#2707)
  • Add document for PAI storage (#2822)
  • OpenPAI protocol specification document (#2260)
  • Job submission v2 plugin document (#2820)
  • Update RESTful API document for API v2 (#2816)
  • Fix typos in document (#2818)

Bug Fixes

  • Web portal:

    • Fix text broken when create or edit user (#2849)
    • Fix token authentication bug (#2843)
    • Fix retry count's margin-top (#2845)
    • Fix job clone bug (#2836)
    • Fix home page's responsive layout (#2805)
    • Fix job list page filter bug (#2787)
    • Fix home page failed to load virtual cluster list bug (#2774)
  • Rest server:

    • Check duplicate job in submission v2 (#2837)
  • Hadoop:

    • Increase YARN kill container timeout (#2778)
    • Remove cross origin in resource manager (#2758)
    • Fix Haddoop AI matching nvidia-smi regex (#2681)

Known Issues

  • Deployments issues on NVIDIA DGX2 (#2742)