Skip to content

Tensorboard dev #96

@elleobrien

Description

@elleobrien

One of the use cases we want to show for the CML site is producing a tensorboard link in the report. For example:

View your TensorBoard live at: https://tensorboard.dev/experiment/E31QaKfWTQaQuEUR6H03UA/

I'm running into some trouble using the tensorboard dev command inside my GH runner. This is my workflow (project repo is here):

name: train-my-model

on: [push]

jobs:
  run:
    runs-on: [ubuntu-latest]
    container: docker://dvcorg/cml-py3:latest

    steps:
      - uses: actions/checkout@v2

      - name: dvc_cml_run
        env:
          repo_token: ${{ secrets.GITHUB_TOKEN }}
        run: |
          pip3 install -r requirements.txt
          python train.py
          
          tensorboard dev upload --logdir logs

I've confirmed the commands work on my local machine. On the runner, I'm getting this error message:

Traceback (most recent call last):
  File "/usr/local/bin/tensorboard", line 8, in <module>
    sys.exit(run_main())
  File "/usr/local/lib/python3.6/dist-packages/tensorboard/main.py", line 75, in run_main
    app.run(tensorboard.main, flags_parser=tensorboard.configure)
  File "/usr/local/lib/python3.6/dist-packages/absl/app.py", line 299, in run
    _run_main(main, args)
  File "/usr/local/lib/python3.6/dist-packages/absl/app.py", line 250, in _run_main
    sys.exit(main(argv))
  File "/usr/local/lib/python3.6/dist-packages/tensorboard/program.py", line 289, in main
    return runner(self.flags) or 0
  File "/usr/local/lib/python3.6/dist-packages/tensorboard/uploader/uploader_main.py", line 633, in run
    return _run(flags)
  File "/usr/local/lib/python3.6/dist-packages/tensorboard/uploader/uploader_main.py", line 120, in _run
    _prompt_for_user_ack(intent)
  File "/usr/local/lib/python3.6/dist-packages/tensorboard/uploader/uploader_main.py", line 76, in _prompt_for_user_ack
    response = six.moves.input("Continue? (yes/NO) ")
EOFError: EOF when reading a line
Continue? (yes/NO) 
##[error]Process completed with exit code 1.

I tried passing in a "yes" with

echo "yes" | tensorboard dev upload --logdir logs but got the same error. Does this make any sense given the setup of the runner?

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions