Skip to content

Duplicated rows in actor states task when using CSV as storage #1104

Closed
@davidgasquez

Description

@davidgasquez

Describe the bug:

When running lily job run --storage CSV --task actor_states walk --from X --to Y, the actor_states.csv that is produced contains some duplicated rows.

Steps to Reproduce:

  1. Initialize Lily from any snapshot and run a walk covering 100 epochs storing the results in CSVs: lily job run --storage CSV --task actor_states walk --from X --to Y
  2. See duplicated lines with cat /tmp/actor_states.csv | sort | uniq -d

This is one of the dupicates I've found in case is helpful.

100300,bafy2bzacedwwkxgt3x4xo5bu2nxqmcc6b76qc3psoooszbu2qbyyqbzu3sxhi,bafkqaetgnfwc6mjpon2g64tbm5sw22lomvza,"{""Info"":{""/"":""bafy2bzacebnstv2ae4lufsgg65v4ezhzpqtcgyem2afswf76ix2ch4nj2hkyw""},""PreCommitDeposits"":""0"",""LockedFunds"":""0"",""VestingFunds"":{""/"":""bafy2bzacealbq6s7ptdud6gvpc2yv54opwotncjlqjxmzb2q2rnjxv753rwdc""},""InitialPledgeRequirement"":""0"",""PreCommittedSectors"":{""/"":""bafy2bzaceamp42wmmgr2g2ymg46euououzfyck7szknvfacqscohrvaikwfay""},""PreCommittedSectorsExpiry"":{""/"":""bafy2bzacedswlcz5ddgqnyo3sak3jmhmkxashisnlpq6ujgyhe4mlobzpnhs6""},""AllocatedSectors"":{""/"":""bafy2bzacea456askyutsf7uk4ta2q5aojrlcji4mhaqokbfalgvoq4ueeh4l2""},""Sectors"":{""/"":""bafy2bzacedswlcz5ddgqnyo3sak3jmhmkxashisnlpq6ujgyhe4mlobzpnhs6""},""ProvingPeriodStart"":100180,""CurrentDeadline"":2,""Deadlines"":{""/"":""bafy2bzaceakubxbluycf7ug7ve3o5yg2kwbhjaf65gtu5m4ethfrq2qxurtcy""},""EarlyTerminations"":[0]}"

Lily Version: v0.13.0

Metadata

Metadata

Assignees

No one assigned

    Labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

      Development

      Participants

      @davidgasquez

      Issue actions

        Duplicated rows in actor states task when using CSV as storage · Issue #1104 · filecoin-project/lily