Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

training ends up disconnected. The systeem says that the training ha snot ended. #782

Open
1 task done
laulenne opened this issue Jul 27, 2024 · 7 comments
Open
1 task done
Labels
app Issue related to Ultralytics HUB App question A HUB question that does not involve a bug

Comments

@laulenne
Copy link

Search before asking

Question

I have just bought a license for Ultralytics hub hoping to train a model. But the training does not ends properly without any indication. It indicates disconnected. I cannot find any info what is going on. I tried several time with low epochs but it remains the same.

Additional

No response

@laulenne laulenne added the question A HUB question that does not involve a bug label Jul 27, 2024
@UltralyticsAssistant UltralyticsAssistant added the app Issue related to Ultralytics HUB App label Jul 27, 2024
Copy link

👋 Hello @laulenne, thank you for raising an issue about Ultralytics HUB 🚀! Please visit our HUB Docs to learn more:

  • Quickstart. Start training and deploying YOLO models with HUB in seconds.
  • Datasets: Preparing and Uploading. Learn how to prepare and upload your datasets to HUB in YOLO format.
  • Projects: Creating and Managing. Group your models into projects for improved organization.
  • Models: Training and Exporting. Train YOLOv5 and YOLOv8 models on your custom datasets and export them to various formats for deployment.
  • Integrations. Explore different integration options for your trained models, such as TensorFlow, ONNX, OpenVINO, CoreML, and PaddlePaddle.
  • Ultralytics HUB App. Learn about the Ultralytics App for iOS and Android, which allows you to run models directly on your mobile device.
    • iOS. Learn about YOLO CoreML models accelerated on Apple's Neural Engine on iPhones and iPads.
    • Android. Explore TFLite acceleration on mobile devices.
  • Inference API. Understand how to use the Inference API for running your trained models in the cloud to generate predictions.

If this is a 🐛 Bug Report, please provide screenshots and steps to reproduce your problem to help us get started working on a fix.

If this is a ❓ Question, please provide as much information as possible, including dataset, model, environment details etc. so that we might provide the most helpful response.

We try to respond to all issues as promptly as possible. Thank you for your patience!

@pderrenger
Copy link
Member

Hi there,

Thank you for reaching out and for your support of Ultralytics HUB! I'm sorry to hear that you're experiencing issues with your training sessions disconnecting. Let's work through this together.

First, please ensure that you are using the latest version of the Ultralytics HUB and related packages. Sometimes, updates include important bug fixes that might resolve your issue.

Here are a few steps you can follow to troubleshoot the problem:

  1. Check Internet Connection: Ensure that your internet connection is stable throughout the training process. An unstable connection can sometimes cause disconnections.

  2. Review Logs: Check the logs for any error messages or warnings that might give us more insight into why the training is disconnecting. You can find the logs in the HUB interface under the specific training session.

  3. Resource Allocation: Verify that your system has enough resources (CPU, GPU, RAM) allocated for the training process. Insufficient resources can sometimes cause unexpected disconnections.

  4. Try a Different Dataset: If possible, try training with a different dataset to see if the issue persists. This can help determine if the problem is dataset-specific.

If the issue continues, please provide more details such as:

  • The specific model and dataset you are using.
  • Any error messages or logs that you can share.
  • Your system specifications (CPU, GPU, RAM).

This information will help us diagnose the problem more effectively.

Thank you for your patience and cooperation. We look forward to resolving this issue for you! 😊

@laulenne
Copy link
Author

laulenne commented Jul 27, 2024 via email

@pderrenger
Copy link
Member

@laulenne hi Laurent,

Thank you for providing detailed information about the issue you're experiencing. I'm sorry to hear that the training process isn't completing as expected. Let's try to resolve this together.

Firstly, please ensure that you are using the latest version of the Ultralytics HUB. Sometimes, updates include important bug fixes that might resolve your issue.

Here are a few steps to help troubleshoot the problem:

  1. Check Training Logs: Navigate to the training logs in the HUB interface to see if there are any error messages or warnings that might indicate why the training did not complete successfully. This can provide valuable insights.

  2. Resource Allocation: Ensure that your system has sufficient resources (CPU, GPU, RAM) allocated for the training process. Insufficient resources can sometimes cause the training to stop prematurely.

  3. Re-run Training with Fewer Epochs: Try running the training for a smaller number of epochs (e.g., 10 or 20) to see if the issue persists. This can help determine if the problem is related to the duration of the training.

  4. Dataset Integrity: Verify that your dataset is correctly formatted and uploaded. Sometimes, issues with the dataset can cause the training to fail. You can refer to our Datasets: Preparing and Uploading guide for more information.

If the issue continues, please provide the following additional details:

  • Any specific error messages or logs you can share.
  • Your system specifications (CPU, GPU, RAM).
  • The specific model you are training and any custom configurations you have applied.

This information will help us diagnose the problem more effectively.

Thank you for your patience and cooperation. We look forward to resolving this issue for you! 😊

@sergiuwaxmann
Copy link
Member

@laulenne Paula's suggestions are valid and I suggest focusing your attention on the dataset structure (see related issue).

@laulenne
Copy link
Author

I did fix the dataset and th etraining is working fine but i cannot download the model.
The interface looks like this but no download.
image

@sergiuwaxmann
Copy link
Member

@laulenne Can you share you model ID with me so I can investigate this further? The model ID is in the model's page URL.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
app Issue related to Ultralytics HUB App question A HUB question that does not involve a bug
Projects
None yet
Development

No branches or pull requests

4 participants