Skip to content

issues Search Results · repo:apple/axlearn language:Python

Filter by

42 results
 (589 ms)

42 results

inapple/axlearn (press backspace or delete to remove)

Hi, I have a question regarding the dataset processing workflow. Currently, I know that in this codebase we shard by using TFDS shard options directly on single data files (potentially very large datasets). ...
  • carlesoctav
  • Opened 
    on Aug 5
  • #1329

Merged my working branch orbax-fuji-v2 with latest main and then started seeing this error. Posting here for now in case someone hit it and knows. I will revert back merge with main for now and investigate ...
stale
  • samos123
  • 3
  • Opened 
    on Aug 3
  • #1328

The output-uploader containers linger for 1 minute or so when the main container exits voluntarily. The 1 minute duration is likely a result of this code. It appears to me output-uploader should stop ...
stale
  • linxiulei
  • Opened 
    on Aug 1
  • #1327

Config: fuji-8B-v3-tiktoken-flash Getting the following error when running axlearn with following command command: python3 -m axlearn.cloud.gcp.jobs.launch run --nodry_run --instance_type=tpu-v6e-16 ...
stale
  • samos123
  • 2
  • Opened 
    on Jul 28
  • #1322

Related issue https://github.com/apple/axlearn/issues/892. I am trying to install the dependencies of axlearn repository on M1 air arm64 (macOS sequoia 15.5). I am following the setup instructions at ...
  • NarasimhanKovalai
  • 1
  • Opened 
    on Jul 26
  • #1319

The WandBWriter needs to be invoked only on the orchestrator process. To achieve this, the current implementation utilizes a processor_zero_only decorator, which determines if we re on the correct process ...
stale
  • dmarx
  • Opened 
    on Jul 8
  • #1287

Train a model, store the checkpoints, then load the checkpoints for inference using pathways and non-pathways. Relevant code: https://github.com/apple/axlearn/blob/8d4dedf352860795663dc4e34de435fd62dd49c8/axlearn/common/inference.py#L146 ...
stale
  • samos123
  • Opened 
    on May 20
  • #1192

I encountered the following error when trying to run the Fuji 3B model locally: Traceback (most recent call last): File fuji.py , line 339, in module run_main() File fuji.py , line 219, in ...
stale
  • yhtang
  • Opened 
    on May 2
  • #1151
Issue origami icon

Learn how you can use GitHub Issues to plan and track your work.

Save views for sprints, backlogs, teams, or releases. Rank, sort, and filter issues to suit the occasion. The possibilities are endless.Learn more about GitHub Issues
ProTip! 
Restrict your search to the title by using the in:title qualifier.
Issue origami icon

Learn how you can use GitHub Issues to plan and track your work.

Save views for sprints, backlogs, teams, or releases. Rank, sort, and filter issues to suit the occasion. The possibilities are endless.Learn more about GitHub Issues
ProTip! 
Restrict your search to the title by using the in:title qualifier.
Issue search results · GitHub