-
Notifications
You must be signed in to change notification settings - Fork 77
Description
@jcace reported this issue in #boost-help in filecoin slack.
Summary
Retrieval attempts to Boost leveraging lotus client retrieve will sometimes hang indefinitely or until the client times out. This may require a restart of the process to get it unstuck, but the issue can recur after restarts.
Initial reported versions:
lotus version 1.19.0+mainnet+git.64059ca87.dirty
boostd version 1.5.0+git.dbedffb
Issue is still being seen on 1.5.1-rc3
Note from the user
I have a very strange retrieval issue. Consider these two different retrieval commands:
a) lotus client retrieve --provider f01963614 QmfB9Sz8UYnTd8o8BVHrmGcrRcqDgu8jM8vFXUnFsTGbGE --car
b) lotus client retrieve --provider f01886797 QmaQpw28KgCjS8PRCDwSmn5jaoScYN4RYpqWxERtmqEsnE --car
a) works perfect, transfer starts right away, but b) just hangs at PaymentChannelSkip (Ongoing) and never progresses.
It will eventually time out.
What’s strange about this is I control both SP nodes. In fact, they are both physically
running on the same hardware. They are on the same internet connection.
They are storing files on the same physical disks.
Both CIDs are in a sector with an unsealed copy, and I’ve verified that it exists.
The only difference is that node b) currently has all my sealing workers connected
to it, whereas node a) does not - thus, unsealing is not possible. However, looking into the code
it would seem that the IsUnsealed
function should be able to find the sector just fine. The lotus-miner instance has the storage directly attached to it in both instances.
Boost/Miner configs between the two nodes are almost identical - did not modify anything to do with retrievals.
What could possibly cause this?
References:
Metadata
Metadata
Assignees
Labels
Type
Projects
Status