New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
wandb sync not logging in while running wandb local #1239
Comments
Issue-Label Bot is automatically applying the label Links: app homepage, dashboard and code for this bot. |
@aclifton314 what email address did you use for your account? Can you try logging in to the local instance from an incognito window? |
I used the same email address as in #1222. I'm not sure if that would be causing a conflict. Either way it is a gmail address. I used firefox to open a private window (not sure if that is the same as incognito) and went to the local instance. I was taken to a page that says "Developer tools for deep learning" and clicked the login button. I get prompted to enter my email and password, and that sends me to my dashboard. I then went to the command line and had the following error:
|
Does your api key start with |
Yes, the api key starts with
I am indeed using wandb 0.9.7 |
@aclifton314 I'm at a loss for what's happening. You either have an incorrect api_key configured, or the entity / project you're trying to log to doesn't belong to the user you created. You can also try running your script with an api key set in your environment. Goto http://localhost:8080/settings and copy your api key. Then run your script with:
|
@vanpelt The dryrun I created was actually created on another computing cluster that has wandb but not docker. I wasn't able to view the wandb results because it doesn't have docker installed. So I moved the dryrun folder to my local machine to view the results. Do you think this could have something to do with it? The fact that the dryrun was created on one machine (with one certain api key) and I'm trying to view it on a different machine (with a different api key)? |
It could be looking at the |
It just says If that isn't helpful, is there a way to:
|
The only files wandb writes are
|
Ok. Here is what is in my
Also, here is the result of the python console commands: user@pop-os:~$ python
Python 3.7.6 (default, Jan 8 2020, 19:59:22)
[GCC 7.3.0] :: Anaconda, Inc. on linux
Type "help", "copyright", "credits" or "license" for more information.
>>> import wandb
>>> wandb.init(project='local_test')
Retry attempt failed:
Traceback (most recent call last):
File "/home/user/anaconda3/lib/python3.7/site-packages/urllib3/connection.py", line 157, in _new_conn
(self._dns_host, self.port), self.timeout, **extra_kw
File "/home/user/anaconda3/lib/python3.7/site-packages/urllib3/util/connection.py", line 84, in create_connection
raise err
File "/home/user/anaconda3/lib/python3.7/site-packages/urllib3/util/connection.py", line 74, in create_connection
sock.connect(sa)
ConnectionRefusedError: [Errno 111] Connection refused
During handling of the above exception, another exception occurred:
Traceback (most recent call last):
File "/home/user/anaconda3/lib/python3.7/site-packages/urllib3/connectionpool.py", line 672, in urlopen
chunked=chunked,
File "/home/user/anaconda3/lib/python3.7/site-packages/urllib3/connectionpool.py", line 387, in _make_request
conn.request(method, url, **httplib_request_kw)
File "/home/user/anaconda3/lib/python3.7/http/client.py", line 1252, in request
self._send_request(method, url, body, headers, encode_chunked)
File "/home/user/anaconda3/lib/python3.7/http/client.py", line 1298, in _send_request
self.endheaders(body, encode_chunked=encode_chunked)
File "/home/user/anaconda3/lib/python3.7/http/client.py", line 1247, in endheaders
self._send_output(message_body, encode_chunked=encode_chunked)
File "/home/user/anaconda3/lib/python3.7/http/client.py", line 1026, in _send_output
self.send(msg)
File "/home/user/anaconda3/lib/python3.7/http/client.py", line 966, in send
self.connect()
File "/home/user/anaconda3/lib/python3.7/site-packages/urllib3/connection.py", line 184, in connect
conn = self._new_conn()
File "/home/user/anaconda3/lib/python3.7/site-packages/urllib3/connection.py", line 169, in _new_conn
self, "Failed to establish a new connection: %s" % e
urllib3.exceptions.NewConnectionError: <urllib3.connection.HTTPConnection object at 0x7f2b4e2b2b50>: Failed to establish a new connection: [Errno 111] Connection refused
During handling of the above exception, another exception occurred:
Traceback (most recent call last):
File "/home/user/anaconda3/lib/python3.7/site-packages/requests/adapters.py", line 449, in send
timeout=timeout
File "/home/user/anaconda3/lib/python3.7/site-packages/urllib3/connectionpool.py", line 720, in urlopen
method, url, error=e, _pool=self, _stacktrace=sys.exc_info()[2]
File "/home/user/anaconda3/lib/python3.7/site-packages/urllib3/util/retry.py", line 436, in increment
raise MaxRetryError(_pool, url, error or ResponseError(cause))
urllib3.exceptions.MaxRetryError: HTTPConnectionPool(host='localhost', port=8080): Max retries exceeded with url: /graphql (Caused by NewConnectionError('<urllib3.connection.HTTPConnection object at 0x7f2b4e2b2b50>: Failed to establish a new connection: [Errno 111] Connection refused'))
During handling of the above exception, another exception occurred:
Traceback (most recent call last):
File "/home/user/anaconda3/lib/python3.7/site-packages/wandb/retry.py", line 95, in __call__
result = self._call_fn(*args, **kwargs)
File "/home/user/anaconda3/lib/python3.7/site-packages/wandb/apis/internal.py", line 108, in execute
return self.client.execute(*args, **kwargs)
File "/home/user/anaconda3/lib/python3.7/site-packages/gql/client.py", line 52, in execute
result = self._get_result(document, *args, **kwargs)
File "/home/user/anaconda3/lib/python3.7/site-packages/gql/client.py", line 60, in _get_result
return self.transport.execute(document, *args, **kwargs)
File "/home/user/anaconda3/lib/python3.7/site-packages/gql/transport/requests.py", line 38, in execute
request = requests.post(self.url, **post_args)
File "/home/user/anaconda3/lib/python3.7/site-packages/requests/api.py", line 116, in post
return request('post', url, data=data, json=json, **kwargs)
File "/home/user/anaconda3/lib/python3.7/site-packages/requests/api.py", line 60, in request
return session.request(method=method, url=url, **kwargs)
File "/home/user/anaconda3/lib/python3.7/site-packages/requests/sessions.py", line 533, in request
resp = self.send(prep, **send_kwargs)
File "/home/user/anaconda3/lib/python3.7/site-packages/requests/sessions.py", line 646, in send
r = adapter.send(request, **kwargs)
File "/home/user/anaconda3/lib/python3.7/site-packages/requests/adapters.py", line 516, in send
raise ConnectionError(e, request=request)
requests.exceptions.ConnectionError: HTTPConnectionPool(host='localhost', port=8080): Max retries exceeded with url: /graphql (Caused by NewConnectionError('<urllib3.connection.HTTPConnection object at 0x7f2b4e2b2b50>: Failed to establish a new connection: [Errno 111] Connection refused'))
wandb: Network error (ConnectionError), entering retry loop. See /tmp/wandb-debug.log for full traceback.
wandb: Tracking run with wandb version 0.9.7
wandb: Network error (ConnectionError), entering retry loop. See /home/user/wandb/debug.log for full traceback.
wandb: Run data is saved locally in wandb/run-20200917_205542-3vven7ou
wandb: Network error (ConnectionError), entering retry loop. See /home/user/wandb/debug.log for full traceback.
W&B Error: Can't connect to network to query entity from API key
>>> wandb: Network error (ConnectionError), entering retry loop. See /home/user/wandb/debug.log for full traceback.
wandb: Ctrl-c pressed. |
That error means we can't connect to |
On the same machine that has the container running, I did the following:
and was able to access |
From the same terminal you were running your python session in, can you run |
@vanpelt As always, thank you for your patience and cooperation working through this. Here is the output of that curl command:
|
It looks like you can't connect to localhost due to firewall or networking configuration issues. If you can connect via your browser it may be that the browser has a proper proxy configuration that allows it. You can use the HTTP_PROXY variable if you do indeed need to connect to localhost through a proxy, but that's unlikely. |
I have the HTTP_PROXY (and http_proxy) env variable set in my bashrc. Maybe it's not being read properly? I spoke with someone on my team familiar with docker and he suggested setting the |
You can run the docker command manually instead of using
|
Does |
Nope, I believe the "host" network is the default so I doubt this will fix it. The mystery is how can you're browser connect to http://localhost:8080 but your shell can't. |
Made some progress, but I don't know how, hahaha.
I'm not exactly sure how it ended up working, but it seems to be working fine now. Is there anything I can provide you that might help if this is a bug? |
Thanks goodness! Looks like it was actually the |
Just curious, does that |
@vanpelt ? |
This issue is stale because it has been open 60 days with no activity. |
I keep getting this error when I run this:
And when I run:
I am able to login in to wandb portal but model training is not synced. Kindly help me with this. |
We'll need the debug bundle. You can access it at http://13.92.184.80/system-admin from the menu in the upper right corner. You can email this to vanpelt@wandb.com |
This issue is stale because it has been open 60 days with no activity. |
For the record - I ran into this exact same issue because I accidentally ran wandb from a different conda environment than the one it was originally set up in. |
Closing this ticket for the lack of activity. |
System Info
wandb: 0.9.7
python: 3.7.6
OS: Pop!_OS 20.04 LTS
Description
I installed wandb 0.9.7, ran
wandb local
, navigated to http://localhost:8080, the webpage gave me a "Application Error" page. I clicked the refresh button provided on the page, created an account, it then asked me to change my password. When I entered in my new password, the page reloads and asks me repeatedly to change the password. However, if I click the icon in the upper right I can get to the profile page.I tried to run
wandb sync MY_DRYRUN
and was asked to runwandb login
. I ranwandb login
and a webpage briefly appeared with the local API key, but then quickly switched to the Change Password prompt. However, I was able to get the local API key from when I clicked on the icon in the upper right corner before.I paste the local API key into the command line and login successfully. I try to run the
wandb sync
command again and get the following:I'm not sure if this workaround has messed something up, but I cannot sync using
wandb local
. I know this is similar to #1222. The main difference is that now I am behind a company proxy. I did some work to try and fix that fordocker
and am able to rundocker run hello-world
successfully.Any thoughts about what might be going on?
The text was updated successfully, but these errors were encountered: