Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Resnet18 example doesn't run(Window 10) #1149

Closed
JonathanSum opened this issue Jul 2, 2021 · 4 comments
Closed

Resnet18 example doesn't run(Window 10) #1149

JonathanSum opened this issue Jul 2, 2021 · 4 comments
Labels

Comments

@JonathanSum
Copy link
Contributor

JonathanSum commented Jul 2, 2021

$ curl http://127.0.0.1:8080/predictions/resnet-18 -T kitten.jpg
2021-07-02 20:05:53,575 [DEBUG] W-9000-resnet-18_1.0 org.pytorch.serve.wlm.WorkerThread - System state is : WORKER_STARTED
2021-07-02 20:05:53,575 [DEBUG] W-9000-resnet-18_1.0 org.pytorch.serve.wlm.WorkerThread - Backend worker monitoring thread interrupted or backend worker process died.
java.lang.InterruptedException
at java.base/java.util.concurrent.locks.AbstractQueuedSynchronizer$ConditionObject.awaitNanos(AbstractQueuedSynchronizer.java:1668)
at java.base/java.util.concurrent.ArrayBlockingQueue.poll(ArrayBlockingQueue.java:435)
at org.pytorch.serve.wlm.WorkerThread.run(WorkerThread.java:188)
at java.base/java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:515)
at java.base/java.util.concurrent.FutureTask.run(FutureTask.java:264)
at java.base/java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1130)
at java.base/java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:630)
at java.base/java.lang.Thread.run(Thread.java:832)
2021-07-02 20:05:53,576 [WARN ] W-9000-resnet-18_1.0 org.pytorch.serve.wlm.BatchAggregator - Load model failed: resnet-18, error: Worker died.
2021-07-02 20:05:53,577 [DEBUG] W-9000-resnet-18_1.0 org.pytorch.serve.wlm.WorkerThread - W-9000-resnet-18_1.0 State change WORKER_STARTED -> WORKER_STOPPED
2021-07-02 20:05:53,577 [WARN ] W-9000-resnet-18_1.0 org.pytorch.serve.wlm.WorkerLifeCycle - terminateIOStreams() threadName=W-9000-resnet-18_1.0-stderr
2021-07-02 20:05:53,577 [WARN ] W-9000-resnet-18_1.0 org.pytorch.serve.wlm.WorkerLifeCycle - terminateIOStreams() threadName=W-9000-resnet-18_1.0-stdout
2021-07-02 20:05:53,578 [INFO ] W-9000-resnet-18_1.0-stdout org.pytorch.serve.wlm.WorkerLifeCycle - Stopped Scanner - W-9000-resnet-18_1.0-stdout
2021-07-02 20:05:53,578 [INFO ] W-9000-resnet-18_1.0 org.pytorch.serve.wlm.WorkerThread - Retry worker: 9000 in 1 seconds.
2021-07-02 20:05:53,579 [INFO ] W-9000-resnet-18_1.0-stderr org.pytorch.serve.wlm.WorkerLifeCycle - Stopped Scanner - W-9000-resnet-18_1.0-stderr
2021-07-02 20:05:55,284 [DEBUG] W-9000-resnet-18_1.0 org.pytorch.serve.wlm.WorkerThread - W-9000-resnet-18_1.0 State change WORKER_STOPPED -> WORKER_STARTED
2021-07-02 20:05:55,284 [INFO ] W-9000-resnet-18_1.0 org.pytorch.serve.wlm.WorkerThread - Connecting to: /127.0.0.1:9000
2021-07-02 20:05:55,431 [INFO ] nioEventLoopGroup-5-2 org.pytorch.serve.wlm.WorkerThread - 9000 Worker disconnected. WORKER_STARTED
2021-07-02 20:05:55,431 [DEBUG] W-9000-resnet-18_1.0 org.pytorch.serve.wlm.WorkerThread - System state is : WORKER_STARTED
2021-07-02 20:05:55,431 [DEBUG] W-9000-resnet-18_1.0 org.pytorch.serve.wlm.WorkerThread - Backend worker monitoring thread interrupted or backend worker process died.
java.lang.InterruptedException
at java.base/java.util.concurrent.locks.AbstractQueuedSynchronizer$ConditionObject.awaitNanos(AbstractQueuedSynchronizer.java:1668)
at java.base/java.util.concurrent.ArrayBlockingQueue.poll(ArrayBlockingQueue.java:435)
at org.pytorch.serve.wlm.WorkerThread.run(WorkerThread.java:188)
at java.base/java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1130)
at java.base/java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:630)
at java.base/java.lang.Thread.run(Thread.java:832)

My environment:
Windows 10
pytorch 1.6.0
openjdk version "16.0.1" 2021-04-20

https://github.com/pytorch/serve/tree/master/examples/image_classifier/resnet_18
image

Issue2: when it doesn't pop out the error logs in above, it just does not give any outputs. It is possible that it was just a file error in above, but JDK 16 still is not supported, so it did not pop out anything.

Update: I have downgraded it to Java 15. It still doesn't work.

logs.zip
TorchServe Version is 0.4.0

@JonathanSum JonathanSum changed the title Resnet18 example doesn't run Resnet18 example doesn't run(JDK 16!) Jul 2, 2021
@JonathanSum JonathanSum changed the title Resnet18 example doesn't run(JDK 16!) Resnet18 example doesn't run(JDK 16) Jul 2, 2021
@JonathanSum JonathanSum changed the title Resnet18 example doesn't run(JDK 16) Resnet18 example doesn't run(Window 10) Jul 2, 2021
@msaroufim
Copy link
Member

msaroufim commented Jul 6, 2021

Hi @JonathanSum can you try Java 11? That's the one that's been most reliable for me

@jonathan-conder-sm
Copy link
Contributor

uh think you tagged the wrong person 😄

@JonathanSum
Copy link
Contributor Author

image
image

@JonathanSum
Copy link
Contributor Author

JonathanSum commented Jul 7, 2021

Problem Solved, I will continue on the react example.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

3 participants