issues Search Results · repo:Lightning-AI/litgpt language:Python
Filter by
846 results
(66 ms)846 results
inLightning-AI/litgpt (press backspace or delete to remove)Bug description
Multiple redundant calls to generate_example() when using multiple GPUs
Issue Description
When training with multiple devices using Fabric, the generate_example() function is redundantly ...
bug
TheLukaDragar
- Opened yesterday
- #1957
Bug description
When attempting to download Falcon3-1B-Base I get the below error, which is generated because the download package only
includes a single model.safetensors file but the model.safetensors.index.json ...
bug
emunsing
- 1
- Opened 2 days ago
- #1954
Bug description
Hi ! I want just try litgpt and start to just do simple inference with doc found here.
So i start with simple chat :
litgpt chat microsoft/phi-2
And at first char not english language ...
bug
navr32
- Opened 3 days ago
- #1953
Bug description
I am training watt 8b for a task. My training set is 11,000+ records.
At first, I tried the instruction , input , output format, but I noticed during the validation step that it didn ...
bug
HarrisonUnifyAI
- Opened 6 days ago
- #1951
LitGPT is lit! Thank you very much for your great tool and your support!
Is there a way to get the probability distribution of the next word in case of a completion?
For instance, I would input Hello, ...
question
SeriousJ55
- Opened 6 days ago
- #1950
Phi-4-mini-instruct is out. Is there an ETA for supporting this model?
enhancement
andrea-tomassi
- 3
- Opened 11 days ago
- #1948
https://github.com/Lightning-AI/litgpt/blob/3b7f5bb624e59b2eb7b57d5334a272a4506f03e2/litgpt/config.py#L752C9-L752C23
In the config here, the value of n_query_groups=16 for Llama-405B. However, the config.json ...
prajwal1210
- Opened 17 days ago
- #1942
Hello, I have a Grace Hopper Chip, and I would like to install and run LitGPT. I was planning to use this dockerfile to
build the container:
FROM nvcr.io/nvidia/pytorch:24.10-py3
ENV DEBIAN_FRONTEND=noninteractive ...
question
TommasoBendinelli
- Opened 19 days ago
- #1941
I performed lora finetuning on Qwen2.5-3B-Instruct model on my custom dataset, and it seems that when I generate content
using the litgpt chat cli interface the finetuned model performs flawlessly. However, ...
question
asmitnayak
- Opened 23 days ago
- #1937
Hi,
I pretrained Qwen 2.5 0.5B base model with single layer (on purpose), when I chat with model it works.
However when I try to evaluate model it fails:
litgpt evaluate \
--tasks leaderboard ...
question
mtasic85
- 3
- Opened 27 days ago
- #1936

Learn how you can use GitHub Issues to plan and track your work.
Save views for sprints, backlogs, teams, or releases. Rank, sort, and filter issues to suit the occasion. The possibilities are endless.Learn more about GitHub IssuesProTip!
Restrict your search to the title by using the in:title qualifier.
Learn how you can use GitHub Issues to plan and track your work.
Save views for sprints, backlogs, teams, or releases. Rank, sort, and filter issues to suit the occasion. The possibilities are endless.Learn more about GitHub IssuesProTip!
Press the /
key to activate the search input again and adjust your query.