Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Training hub not working #645

Closed
1 task done
ilanb opened this issue Apr 16, 2024 · 8 comments
Closed
1 task done

Training hub not working #645

ilanb opened this issue Apr 16, 2024 · 8 comments
Assignees
Labels
bug Something isn't working fixed Bug is resolved

Comments

@ilanb
Copy link

ilanb commented Apr 16, 2024

Search before asking

  • I have searched the HUB issues and found no similar bug report.

HUB Component

Training

Bug

Tried multiple time to start training but nothing happen and after some minutes page reloaded and start training button appear again

Preparing your cloud instance... Hold tight!
This could take up to 10 minutes. Thank you for waiting!

After X time message removed and nothing happen, just start training button appear again

Environment

No response

Minimal Reproducible Example

No response

Additional

No response

@ilanb ilanb added the bug Something isn't working label Apr 16, 2024
Copy link

👋 Hello @ilanb, thank you for raising an issue about Ultralytics HUB 🚀! Please visit our HUB Docs to learn more:

  • Quickstart. Start training and deploying YOLO models with HUB in seconds.
  • Datasets: Preparing and Uploading. Learn how to prepare and upload your datasets to HUB in YOLO format.
  • Projects: Creating and Managing. Group your models into projects for improved organization.
  • Models: Training and Exporting. Train YOLOv5 and YOLOv8 models on your custom datasets and export them to various formats for deployment.
  • Integrations. Explore different integration options for your trained models, such as TensorFlow, ONNX, OpenVINO, CoreML, and PaddlePaddle.
  • Ultralytics HUB App. Learn about the Ultralytics App for iOS and Android, which allows you to run models directly on your mobile device.
    • iOS. Learn about YOLO CoreML models accelerated on Apple's Neural Engine on iPhones and iPads.
    • Android. Explore TFLite acceleration on mobile devices.
  • Inference API. Understand how to use the Inference API for running your trained models in the cloud to generate predictions.

If this is a 🐛 Bug Report, please provide screenshots and steps to reproduce your problem to help us get started working on a fix.

If this is a ❓ Question, please provide as much information as possible, including dataset, model, environment details etc. so that we might provide the most helpful response.

We try to respond to all issues as promptly as possible. Thank you for your patience!

@ultralytics ultralytics deleted a comment from pderrenger Apr 17, 2024
@ilanb
Copy link
Author

ilanb commented Apr 17, 2024 via email

@sergiuwaxmann
Copy link
Member

Hello @ilanb!
Thank you for reaching out and bringing this to our attention.
The message you're encountering is indeed expected behavior as part of the initialization process for your Cloud Training instance. This process involves spinning up a dedicated instance equipped with GPU resources, which can sometimes take a while depending on the current demand and availability of GPU resources.

Maybe you can try starting the process again and share the model ID with us so that we can analyze the logs of your Cloud Training instance? You can find the model ID in the URL of the model page.

@sergiuwaxmann sergiuwaxmann self-assigned this Apr 17, 2024
@ilanb
Copy link
Author

ilanb commented Apr 17, 2024 via email

@sergiuwaxmann
Copy link
Member

@ilanb Maybe the Roboflow dataset wasn't exported from Roboflow properly (issue on their end). If you try starting the Cloud Training again, I can monitor your instance and see if there are any errors.

@ilanb
Copy link
Author

ilanb commented Apr 17, 2024 via email

@sergiuwaxmann
Copy link
Member

@ilanb I just checked the logs on your instance and indeed, we have an issue in our logic when we download the Roboflow dataset. Please accept our apologies for the inconvenience caused. Our team is investigating this issue, and we'll update you as soon as we implement a solution.
We appreciate your patience and understanding.

@sergiuwaxmann
Copy link
Member

Hello @ilanb!
We just released a new version that fixes the issue you had.
Once again, apologies for the inconvenience caused.

@sergiuwaxmann sergiuwaxmann added the fixed Bug is resolved label Apr 17, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working fixed Bug is resolved
Projects
None yet
Development

No branches or pull requests

2 participants