Skip to content

issues Search Results · repo:Lightning-AI/litgpt language:Python

Filter by

846 results
 (66 ms)

846 results

inLightning-AI/litgpt (press backspace or delete to remove)

Bug description Multiple redundant calls to generate_example() when using multiple GPUs Issue Description When training with multiple devices using Fabric, the generate_example() function is redundantly ...
bug
  • TheLukaDragar
  • Opened 
    yesterday
  • #1957

Bug description When attempting to download Falcon3-1B-Base I get the below error, which is generated because the download package only includes a single model.safetensors file but the model.safetensors.index.json ...
bug
  • emunsing
  • 1
  • Opened 
    2 days ago
  • #1954

Bug description Hi ! I want just try litgpt and start to just do simple inference with doc found here. So i start with simple chat : litgpt chat microsoft/phi-2 And at first char not english language ...
bug
  • navr32
  • Opened 
    3 days ago
  • #1953

Bug description I am training watt 8b for a task. My training set is 11,000+ records. At first, I tried the instruction , input , output format, but I noticed during the validation step that it didn ...
bug
  • HarrisonUnifyAI
  • Opened 
    6 days ago
  • #1951

LitGPT is lit! Thank you very much for your great tool and your support! Is there a way to get the probability distribution of the next word in case of a completion? For instance, I would input Hello, ...
question
  • SeriousJ55
  • Opened 
    6 days ago
  • #1950

Phi-4-mini-instruct is out. Is there an ETA for supporting this model?
enhancement
  • andrea-tomassi
  • 3
  • Opened 
    11 days ago
  • #1948

https://github.com/Lightning-AI/litgpt/blob/3b7f5bb624e59b2eb7b57d5334a272a4506f03e2/litgpt/config.py#L752C9-L752C23 In the config here, the value of n_query_groups=16 for Llama-405B. However, the config.json ...
  • prajwal1210
  • Opened 
    17 days ago
  • #1942

Hello, I have a Grace Hopper Chip, and I would like to install and run LitGPT. I was planning to use this dockerfile to build the container: FROM nvcr.io/nvidia/pytorch:24.10-py3 ENV DEBIAN_FRONTEND=noninteractive ...
question
  • TommasoBendinelli
  • Opened 
    19 days ago
  • #1941

I performed lora finetuning on Qwen2.5-3B-Instruct model on my custom dataset, and it seems that when I generate content using the litgpt chat cli interface the finetuned model performs flawlessly. However, ...
question
  • asmitnayak
  • Opened 
    23 days ago
  • #1937

Hi, I pretrained Qwen 2.5 0.5B base model with single layer (on purpose), when I chat with model it works. However when I try to evaluate model it fails: litgpt evaluate \ --tasks leaderboard ...
question
  • mtasic85
  • 3
  • Opened 
    27 days ago
  • #1936
Issue origami icon

Learn how you can use GitHub Issues to plan and track your work.

Save views for sprints, backlogs, teams, or releases. Rank, sort, and filter issues to suit the occasion. The possibilities are endless.Learn more about GitHub Issues
ProTip! 
Restrict your search to the title by using the in:title qualifier.
Issue origami icon

Learn how you can use GitHub Issues to plan and track your work.

Save views for sprints, backlogs, teams, or releases. Rank, sort, and filter issues to suit the occasion. The possibilities are endless.Learn more about GitHub Issues
ProTip! 
Press the
/
key to activate the search input again and adjust your query.
Issue search results · GitHub