GitHub - eamag/seemore

Configuring hyperparameters through CLI (e.g. using a yaml/json) [I like makefiles]
A simple solution (can be any free library or service) for hyperparameter management and tracking
Storing and visualizing the training loss
A simple solution for profiling the training performance to identify bottlenecks in the model configuration
Include as many best practices into the training that you know of to ensure the fastest performance possible (i.e. half precision, ...)
Extension of this training function in order to be scaleable to a multi-GPU or multi-node setting.

Run with uv run main.py

Or with make experiment1 and keep track of experiments there (makes it easier later with artefacts, names etc)

See metrics at uv run mlflow ui

Mixed precision out of the box with lightning (uncomment in main, easier than pass to args), batch size tuning. See more at https://lightning.ai/docs/pytorch/stable/levels/intermediate_level_11.html

Same with distributed GPUs. I used horovod before, but now without access to severalGPUs I can't test it. I haven't worked with deepspeed before, so I wanted to make it work but without several GPUs also couldn't test it https://lightning.ai/docs/pytorch/stable/advanced/model_parallel/deepspeed.html. I tried to play around with multi-cpu deepspeed, but couldn't manage to make it work. You can try it with uv run deepspeed --bind_cores_to_rank --num_accelerators 4 --num_nodes=2 modules/main.py --deepspeed --deepspeed_config ds_config.json;

https://www.deepspeed.ai/tutorials/accelerator-setup-guide/

Name		Name	Last commit message	Last commit date
Latest commit History 60 Commits
images		images
modules		modules
.gitignore		.gitignore
.python-version		.python-version
LICENSE		LICENSE
README.md		README.md
ds_config.json		ds_config.json
input.txt		input.txt
main.py		main.py
makefile		makefile
pyproject.toml		pyproject.toml
seeMoE.py		seeMoE.py
seeMoE_from_Scratch.ipynb		seeMoE_from_Scratch.ipynb
seemore.py		seemore.py
seemore_Concise.ipynb		seemore_Concise.ipynb
seemore_from_Scratch.ipynb		seemore_from_Scratch.ipynb
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages