Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

exception after few hours of training with villain #1382

Closed
eassa opened this issue Apr 8, 2024 · 2 comments
Closed

exception after few hours of training with villain #1382

eassa opened this issue Apr 8, 2024 · 2 comments

Comments

@eassa
Copy link

eassa commented Apr 8, 2024

i am using ASUS TUF Gaming Radeon™ RX 7900 XT OC Edition 20GB GDDR6
windows 10
i followed these instruction
https://forum.faceswap.dev/app.php/faqpage?sid=47859b5acaac6c66cf49a85c70d6b1bd#f1r1
https://forum.faceswap.dev/viewtopic.php?t=20

DirectML installation

while training in villain with batch size of 20 , i am getting this error after few hours of training
, i have been getting this error multiple time already :

2024-04-08 03:15:23.769735: F tensorflow/c/logging.cc:43] HRESULT failed with 0x887a0005: chunk->resource->Map(0, nullptr, &upload_heap_data)
2024-04-08 03:15:23.769986: F tensorflow/c/logging.cc:43] HRESULT failed with 0x887

@eassa
Copy link
Author

eassa commented Apr 8, 2024

i always get the exception noted in the issue , but this time after 10 hours of training i got this exception as well
2024-04-08 13:32:47.839589: F tensorflow/c/logging.cc:43] HRESULT failed with 0x887a0001: dml_device_->GetDeviceRemovedReason()

@torzdf
Copy link
Collaborator

torzdf commented Apr 8, 2024

Unfortunately this issue is upstream from us and comes from a timeout within DirectML. See below for more information and potential mitigation steps:

https://forum.faceswap.dev/viewtopic.php?t=2567

@torzdf torzdf closed this as completed Apr 8, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants