{"payload":{"feedbackUrl":"https://github.com/orgs/community/discussions/53140","repo":{"id":728553883,"defaultBranch":"main","name":"llm-finetune","ownerLogin":"truefoundry","currentUserCanPush":false,"isFork":false,"isEmpty":false,"createdAt":"2023-12-07T07:30:29.000Z","ownerAvatar":"https://avatars.githubusercontent.com/u/93512441?v=4","public":true,"private":false,"isOrgOwned":true},"refInfo":{"name":"","listCacheKey":"v0:1718372996.0","currentOid":""},"activityList":{"items":[{"before":"0c9c01a5826750c3131ed28525db398cd5f171e6","after":"2071597e4cf532755523902656e9e6b1422603c7","ref":"refs/heads/main","pushedAt":"2024-06-18T19:38:10.000Z","pushType":"push","commitsCount":1,"pusher":{"login":"chiragjn","name":"Chirag Jain","path":"/chiragjn","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/10295418?s=80&v=4"},"commit":{"message":"Update deepspeed","shortMessageHtmlLink":"Update deepspeed"}},{"before":"33bb49da47957dc678885c44c259fe09e5084d66","after":"0c9c01a5826750c3131ed28525db398cd5f171e6","ref":"refs/heads/main","pushedAt":"2024-06-14T13:48:18.000Z","pushType":"push","commitsCount":1,"pusher":{"login":"chiragjn","name":"Chirag Jain","path":"/chiragjn","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/10295418?s=80&v=4"},"commit":{"message":"Fix multi gpu lora merging","shortMessageHtmlLink":"Fix multi gpu lora merging"}},{"before":"872152d44e719ec3b001d58ffcce1878177c1ede","after":"33bb49da47957dc678885c44c259fe09e5084d66","ref":"refs/heads/main","pushedAt":"2024-06-13T14:52:25.000Z","pushType":"push","commitsCount":1,"pusher":{"login":"chiragjn","name":"Chirag Jain","path":"/chiragjn","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/10295418?s=80&v=4"},"commit":{"message":"Default to using tokenizer chat template if available","shortMessageHtmlLink":"Default to using tokenizer chat template if available"}},{"before":"063d82821186ce3a828c42d51856546d66d2fd77","after":"872152d44e719ec3b001d58ffcce1878177c1ede","ref":"refs/heads/main","pushedAt":"2024-06-12T18:26:27.000Z","pushType":"push","commitsCount":1,"pusher":{"login":"chiragjn","name":"Chirag Jain","path":"/chiragjn","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/10295418?s=80&v=4"},"commit":{"message":"pass the chat_template value correctly","shortMessageHtmlLink":"pass the chat_template value correctly"}},{"before":"08c075787b94501f84bbd8198d4e128de33a8bee","after":"fae430edb1b80821d3a07a6d8934d19be2ea8f1d","ref":"refs/heads/dependabot/pip/jupyter-server-proxy-4.2.0","pushedAt":"2024-06-12T15:34:08.000Z","pushType":"force_push","commitsCount":0,"pusher":{"login":"dependabot[bot]","name":null,"path":"/apps/dependabot","primaryAvatarUrl":"https://avatars.githubusercontent.com/in/29110?s=80&v=4"},"commit":{"message":"Bump jupyter-server-proxy from 4.1.1 to 4.2.0\n\nBumps [jupyter-server-proxy](https://github.com/jupyterhub/jupyter-server-proxy) from 4.1.1 to 4.2.0.\n- [Release notes](https://github.com/jupyterhub/jupyter-server-proxy/releases)\n- [Changelog](https://github.com/jupyterhub/jupyter-server-proxy/blob/main/RELEASE.md)\n- [Commits](https://github.com/jupyterhub/jupyter-server-proxy/compare/v4.1.1...v4.2.0)\n\n---\nupdated-dependencies:\n- dependency-name: jupyter-server-proxy\n dependency-type: direct:production\n...\n\nSigned-off-by: dependabot[bot] ","shortMessageHtmlLink":"Bump jupyter-server-proxy from 4.1.1 to 4.2.0"}},{"before":"6a1333769990ec08f633975bfc356cca46e961dd","after":"063d82821186ce3a828c42d51856546d66d2fd77","ref":"refs/heads/main","pushedAt":"2024-06-12T15:33:27.000Z","pushType":"push","commitsCount":1,"pusher":{"login":"chiragjn","name":"Chirag Jain","path":"/chiragjn","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/10295418?s=80&v=4"},"commit":{"message":"Update notebook for auto flash attention detection","shortMessageHtmlLink":"Update notebook for auto flash attention detection"}},{"before":"25890d704327b81c3f4daa349612942de54fe3d7","after":null,"ref":"refs/heads/cj_fix_params_logging","pushedAt":"2024-06-12T15:24:24.000Z","pushType":"branch_deletion","commitsCount":0,"pusher":{"login":"chiragjn","name":"Chirag Jain","path":"/chiragjn","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/10295418?s=80&v=4"}},{"before":"906dbf009c1fe75444541e6db30ddcf22614c946","after":null,"ref":"refs/heads/cj_20240603","pushedAt":"2024-06-12T15:24:23.000Z","pushType":"branch_deletion","commitsCount":0,"pusher":{"login":"chiragjn","name":"Chirag Jain","path":"/chiragjn","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/10295418?s=80&v=4"}},{"before":"beffd7e37473cdbaa8617cdb14fdcdc000d0e0e6","after":"6a1333769990ec08f633975bfc356cca46e961dd","ref":"refs/heads/main","pushedAt":"2024-06-12T15:23:56.000Z","pushType":"push","commitsCount":1,"pusher":{"login":"chiragjn","name":"Chirag Jain","path":"/chiragjn","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/10295418?s=80&v=4"},"commit":{"message":"Bump axolotl, data validation warnings, bf16, flash-attn only on sm_80+, highlight error","shortMessageHtmlLink":"Bump axolotl, data validation warnings, bf16, flash-attn only on sm_8…"}},{"before":"ae2c6300091f5f934df121fa08af42dd3b745a01","after":"906dbf009c1fe75444541e6db30ddcf22614c946","ref":"refs/heads/cj_20240603","pushedAt":"2024-06-12T15:21:40.000Z","pushType":"push","commitsCount":1,"pusher":{"login":"chiragjn","name":"Chirag Jain","path":"/chiragjn","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/10295418?s=80&v=4"},"commit":{"message":"Bump axolotl, data validation warnings, bf16, flash-attn only on sm_80+, highlight error","shortMessageHtmlLink":"Bump axolotl, data validation warnings, bf16, flash-attn only on sm_8…"}},{"before":null,"after":"08c075787b94501f84bbd8198d4e128de33a8bee","ref":"refs/heads/dependabot/pip/jupyter-server-proxy-4.2.0","pushedAt":"2024-06-11T21:13:24.000Z","pushType":"branch_creation","commitsCount":0,"pusher":{"login":"dependabot[bot]","name":null,"path":"/apps/dependabot","primaryAvatarUrl":"https://avatars.githubusercontent.com/in/29110?s=80&v=4"},"commit":{"message":"Bump jupyter-server-proxy from 4.1.1 to 4.2.0\n\nBumps [jupyter-server-proxy](https://github.com/jupyterhub/jupyter-server-proxy) from 4.1.1 to 4.2.0.\n- [Release notes](https://github.com/jupyterhub/jupyter-server-proxy/releases)\n- [Changelog](https://github.com/jupyterhub/jupyter-server-proxy/blob/main/RELEASE.md)\n- [Commits](https://github.com/jupyterhub/jupyter-server-proxy/compare/v4.1.1...v4.2.0)\n\n---\nupdated-dependencies:\n- dependency-name: jupyter-server-proxy\n dependency-type: direct:production\n...\n\nSigned-off-by: dependabot[bot] ","shortMessageHtmlLink":"Bump jupyter-server-proxy from 4.1.1 to 4.2.0"}},{"before":"91f7fbe769657af4046cb2efde0033ed6025e943","after":"beffd7e37473cdbaa8617cdb14fdcdc000d0e0e6","ref":"refs/heads/main","pushedAt":"2024-06-11T12:42:15.000Z","pushType":"push","commitsCount":1,"pusher":{"login":"chiragjn","name":"Chirag Jain","path":"/chiragjn","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/10295418?s=80&v=4"},"commit":{"message":"Update docstring","shortMessageHtmlLink":"Update docstring"}},{"before":"028db8e9a7f906e14c7d939ba9198ff429d8ba11","after":"91f7fbe769657af4046cb2efde0033ed6025e943","ref":"refs/heads/main","pushedAt":"2024-06-07T10:42:31.000Z","pushType":"pr_merge","commitsCount":1,"pusher":{"login":"chiragjn","name":"Chirag Jain","path":"/chiragjn","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/10295418?s=80&v=4"},"commit":{"message":"Update axolotl for newer architectures\n\n* Update requirements, cleanup README\r\n\r\n* Add unsloth options to config\r\n\r\n* wip - try chat template\r\n\r\n* Update requirements and try unsloth\r\n\r\n* disable unsloth, it does not play well with deepspeed\r\n\r\n* Cleanup unnecessary code\r\n\r\n* Remove import\r\n\r\n* Add chat template selection logic\r\n\r\n* more fixes\r\n\r\n* Revert truefoundry lib to stable version\r\n\r\n* Update config, readme, sample run","shortMessageHtmlLink":"Update axolotl for newer architectures"}},{"before":"c6179ebfd685efddf9685c45c7e98b1bc6155d41","after":"ae2c6300091f5f934df121fa08af42dd3b745a01","ref":"refs/heads/cj_20240603","pushedAt":"2024-06-07T09:21:02.000Z","pushType":"push","commitsCount":1,"pusher":{"login":"chiragjn","name":"Chirag Jain","path":"/chiragjn","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/10295418?s=80&v=4"},"commit":{"message":"Update config, readme, sample run","shortMessageHtmlLink":"Update config, readme, sample run"}},{"before":"2e14a2960caf63083491ff8a457ae18b634c7942","after":"c6179ebfd685efddf9685c45c7e98b1bc6155d41","ref":"refs/heads/cj_20240603","pushedAt":"2024-06-07T05:12:43.000Z","pushType":"push","commitsCount":1,"pusher":{"login":"chiragjn","name":"Chirag Jain","path":"/chiragjn","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/10295418?s=80&v=4"},"commit":{"message":"Revert truefoundry lib to stable version","shortMessageHtmlLink":"Revert truefoundry lib to stable version"}},{"before":"5fe9b067c1e7f62346c88a4b389c3119b5c0c645","after":"2e14a2960caf63083491ff8a457ae18b634c7942","ref":"refs/heads/cj_20240603","pushedAt":"2024-06-05T14:46:31.000Z","pushType":"push","commitsCount":1,"pusher":{"login":"chiragjn","name":"Chirag Jain","path":"/chiragjn","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/10295418?s=80&v=4"},"commit":{"message":"more fixes","shortMessageHtmlLink":"more fixes"}},{"before":"3df0b3679659fabc2bef479f1b455ef272ad948b","after":"5fe9b067c1e7f62346c88a4b389c3119b5c0c645","ref":"refs/heads/cj_20240603","pushedAt":"2024-06-05T13:25:15.000Z","pushType":"push","commitsCount":2,"pusher":{"login":"chiragjn","name":"Chirag Jain","path":"/chiragjn","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/10295418?s=80&v=4"},"commit":{"message":"Add chat template selection logic","shortMessageHtmlLink":"Add chat template selection logic"}},{"before":"7b0589b4a39c748fcf0f3befd3c90a06a73ac52b","after":"3df0b3679659fabc2bef479f1b455ef272ad948b","ref":"refs/heads/cj_20240603","pushedAt":"2024-06-05T12:17:16.000Z","pushType":"push","commitsCount":1,"pusher":{"login":"chiragjn","name":"Chirag Jain","path":"/chiragjn","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/10295418?s=80&v=4"},"commit":{"message":"Cleanup unnecessary code","shortMessageHtmlLink":"Cleanup unnecessary code"}},{"before":"3920257a6f77ad5881db0106de78f8f2da027716","after":"7b0589b4a39c748fcf0f3befd3c90a06a73ac52b","ref":"refs/heads/cj_20240603","pushedAt":"2024-06-05T12:14:14.000Z","pushType":"push","commitsCount":1,"pusher":{"login":"chiragjn","name":"Chirag Jain","path":"/chiragjn","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/10295418?s=80&v=4"},"commit":{"message":"disable unsloth, it does not play well with deepspeed","shortMessageHtmlLink":"disable unsloth, it does not play well with deepspeed"}},{"before":"d0403470076cfac5e85ca7c01629557ce31ea17e","after":"3920257a6f77ad5881db0106de78f8f2da027716","ref":"refs/heads/cj_20240603","pushedAt":"2024-06-05T09:45:37.000Z","pushType":"push","commitsCount":1,"pusher":{"login":"chiragjn","name":"Chirag Jain","path":"/chiragjn","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/10295418?s=80&v=4"},"commit":{"message":"Update requirements and try unsloth","shortMessageHtmlLink":"Update requirements and try unsloth"}},{"before":"ecf4a6ba7a6a4eaf6985106bab08c678700fc434","after":"d0403470076cfac5e85ca7c01629557ce31ea17e","ref":"refs/heads/cj_20240603","pushedAt":"2024-06-04T23:45:05.000Z","pushType":"push","commitsCount":1,"pusher":{"login":"chiragjn","name":"Chirag Jain","path":"/chiragjn","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/10295418?s=80&v=4"},"commit":{"message":"wip - try chat template","shortMessageHtmlLink":"wip - try chat template"}},{"before":"251bcfba0b4825c8131e5015dea05701656548b6","after":"ecf4a6ba7a6a4eaf6985106bab08c678700fc434","ref":"refs/heads/cj_20240603","pushedAt":"2024-06-04T20:21:11.000Z","pushType":"push","commitsCount":1,"pusher":{"login":"chiragjn","name":"Chirag Jain","path":"/chiragjn","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/10295418?s=80&v=4"},"commit":{"message":"Add unsloth options to config","shortMessageHtmlLink":"Add unsloth options to config"}},{"before":null,"after":"251bcfba0b4825c8131e5015dea05701656548b6","ref":"refs/heads/cj_20240603","pushedAt":"2024-06-03T14:43:02.000Z","pushType":"branch_creation","commitsCount":0,"pusher":{"login":"chiragjn","name":"Chirag Jain","path":"/chiragjn","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/10295418?s=80&v=4"},"commit":{"message":"Update requirements, cleanup README","shortMessageHtmlLink":"Update requirements, cleanup README"}},{"before":"5a5d7f1cb1fb3c05e074074b293c4db0aa687024","after":"028db8e9a7f906e14c7d939ba9198ff429d8ba11","ref":"refs/heads/main","pushedAt":"2024-05-03T06:00:28.000Z","pushType":"push","commitsCount":1,"pusher":{"login":"chiragjn","name":"Chirag Jain","path":"/chiragjn","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/10295418?s=80&v=4"},"commit":{"message":"Remove optimizer form deepspeed config and let engine be destroyed within train at end","shortMessageHtmlLink":"Remove optimizer form deepspeed config and let engine be destroyed wi…"}},{"before":"bdc362f27a1f8932a12fe5e16b20f0ff3c20990d","after":"5a5d7f1cb1fb3c05e074074b293c4db0aa687024","ref":"refs/heads/main","pushedAt":"2024-05-02T16:10:05.000Z","pushType":"push","commitsCount":2,"pusher":{"login":"chiragjn","name":"Chirag Jain","path":"/chiragjn","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/10295418?s=80&v=4"},"commit":{"message":"Bump axolotl version - removes sigint handler","shortMessageHtmlLink":"Bump axolotl version - removes sigint handler"}},{"before":"367620f8de363fe6fe5083874d8074ecd79f5339","after":"bdc362f27a1f8932a12fe5e16b20f0ff3c20990d","ref":"refs/heads/main","pushedAt":"2024-04-29T11:32:10.000Z","pushType":"push","commitsCount":1,"pusher":{"login":"chiragjn","name":"Chirag Jain","path":"/chiragjn","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/10295418?s=80&v=4"},"commit":{"message":"Fix memory leaks, model merging on gpus, missing pad tokens","shortMessageHtmlLink":"Fix memory leaks, model merging on gpus, missing pad tokens"}},{"before":"7fd789fdf5ce0e5b080a162a73c3a800f17f9e20","after":null,"ref":"refs/tags/v0.1.13","pushedAt":"2024-03-22T05:31:33.000Z","pushType":"branch_deletion","commitsCount":0,"pusher":{"login":"chiragjn","name":"Chirag Jain","path":"/chiragjn","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/10295418?s=80&v=4"}},{"before":"6c57622739cdbbc96ca046cac6d485393757a352","after":"367620f8de363fe6fe5083874d8074ecd79f5339","ref":"refs/heads/main","pushedAt":"2024-03-22T05:26:57.000Z","pushType":"push","commitsCount":1,"pusher":{"login":"chiragjn","name":"Chirag Jain","path":"/chiragjn","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/10295418?s=80&v=4"},"commit":{"message":"Fix axolotl bug with LlamaRotaryEmbedding","shortMessageHtmlLink":"Fix axolotl bug with LlamaRotaryEmbedding"}},{"before":"6005f80cc9cfbaf54a93e3f6aba79a502ae105c8","after":null,"ref":"refs/heads/dependabot/pip/jupyter-server-proxy-4.1.1","pushedAt":"2024-03-20T21:12:00.000Z","pushType":"branch_deletion","commitsCount":0,"pusher":{"login":"dependabot[bot]","name":null,"path":"/apps/dependabot","primaryAvatarUrl":"https://avatars.githubusercontent.com/in/29110?s=80&v=4"}},{"before":"7fd789fdf5ce0e5b080a162a73c3a800f17f9e20","after":"6c57622739cdbbc96ca046cac6d485393757a352","ref":"refs/heads/main","pushedAt":"2024-03-20T21:11:52.000Z","pushType":"pr_merge","commitsCount":1,"pusher":{"login":"chiragjn","name":"Chirag Jain","path":"/chiragjn","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/10295418?s=80&v=4"},"commit":{"message":"Bump jupyter-server-proxy from 4.1.0 to 4.1.1 (#10)\n\nBumps [jupyter-server-proxy](https://github.com/jupyterhub/jupyter-server-proxy) from 4.1.0 to 4.1.1.\r\n- [Release notes](https://github.com/jupyterhub/jupyter-server-proxy/releases)\r\n- [Changelog](https://github.com/jupyterhub/jupyter-server-proxy/blob/main/RELEASE.md)\r\n- [Commits](https://github.com/jupyterhub/jupyter-server-proxy/compare/v4.1.0...v4.1.1)\r\n\r\n---\r\nupdated-dependencies:\r\n- dependency-name: jupyter-server-proxy\r\n dependency-type: direct:production\r\n...\r\n\r\nSigned-off-by: dependabot[bot] \r\nCo-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>","shortMessageHtmlLink":"Bump jupyter-server-proxy from 4.1.0 to 4.1.1 (#10)"}}],"hasNextPage":true,"hasPreviousPage":false,"activityType":"all","actor":null,"timePeriod":"all","sort":"DESC","perPage":30,"cursor":"djE6ks8AAAAEaNYm_wA","startCursor":null,"endCursor":null}},"title":"Activity · truefoundry/llm-finetune"}