{"payload":{"feedbackUrl":"https://github.com/orgs/community/discussions/53140","repo":{"id":58576874,"defaultBranch":"develop","name":"lbann","ownerLogin":"LLNL","currentUserCanPush":false,"isFork":false,"isEmpty":false,"createdAt":"2016-05-11T20:04:20.000Z","ownerAvatar":"https://avatars.githubusercontent.com/u/5921419?v=4","public":true,"private":false,"isOrgOwned":true},"refInfo":{"name":"","listCacheKey":"v0:1715033044.0","currentOid":""},"activityList":{"items":[{"before":"0ddb93f0bc8a6c3fe547fdafc9d746cdaf5cfa3d","after":null,"ref":"refs/heads/ppad-fix","pushedAt":"2024-05-06T22:04:04.000Z","pushType":"branch_deletion","commitsCount":0,"pusher":{"login":"tbennun","name":"Tal Ben-Nun","path":"/tbennun","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/8348955?s=80&v=4"}},{"before":"eb84553f9c2a5e9de523d2f8f8760309fca750cc","after":"f9944787ed5fbb1a4499be4e14c332dd3b7cdbf0","ref":"refs/heads/develop","pushedAt":"2024-05-06T22:04:02.000Z","pushType":"pr_merge","commitsCount":1,"pusher":{"login":"tbennun","name":"Tal Ben-Nun","path":"/tbennun","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/8348955?s=80&v=4"},"commit":{"message":"Fix import issue in Periodic Padding test (#2446)","shortMessageHtmlLink":"Fix import issue in Periodic Padding test (#2446)"}},{"before":null,"after":"0ddb93f0bc8a6c3fe547fdafc9d746cdaf5cfa3d","ref":"refs/heads/ppad-fix","pushedAt":"2024-05-06T21:58:47.000Z","pushType":"branch_creation","commitsCount":0,"pusher":{"login":"tbennun","name":"Tal Ben-Nun","path":"/tbennun","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/8348955?s=80&v=4"},"commit":{"message":"Update test_unit_module_periodic_padding.py","shortMessageHtmlLink":"Update test_unit_module_periodic_padding.py"}},{"before":"3dd7b01c672fda113151de1f1cf099c2a8c760ad","after":"eb84553f9c2a5e9de523d2f8f8760309fca750cc","ref":"refs/heads/develop","pushedAt":"2024-05-06T21:30:27.000Z","pushType":"pr_merge","commitsCount":1,"pusher":{"login":"benson31","name":"Tom Benson","path":"/benson31","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/30674819?s=80&v=4"},"commit":{"message":"'flux mini' -> 'flux' (#2445)","shortMessageHtmlLink":"'flux mini' -> 'flux' (#2445)"}},{"before":"7c6743fc50bc30e4e34f217b9d4c0e9f06713554","after":"3dd7b01c672fda113151de1f1cf099c2a8c760ad","ref":"refs/heads/develop","pushedAt":"2024-05-06T20:04:29.000Z","pushType":"pr_merge","commitsCount":1,"pusher":{"login":"benson31","name":"Tom Benson","path":"/benson31","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/30674819?s=80&v=4"},"commit":{"message":"Accept any h2 version (#2444)\n\nAlternatively we could bump this to 0.4.0, check for either 0.3.0 or\r\n0.4.0, or do an extra explicit check for version >= 0.3.0 outside of\r\nthe find_package syntax.","shortMessageHtmlLink":"Accept any h2 version (#2444)"}},{"before":"b6188e31d6d7f65e2412e7d0b92c5056662a6ab7","after":"7c6743fc50bc30e4e34f217b9d4c0e9f06713554","ref":"refs/heads/develop","pushedAt":"2024-05-06T15:08:01.000Z","pushType":"pr_merge","commitsCount":1,"pusher":{"login":"benson31","name":"Tom Benson","path":"/benson31","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/30674819?s=80&v=4"},"commit":{"message":"Fix existing-source builds of Clara (#2443)","shortMessageHtmlLink":"Fix existing-source builds of Clara (#2443)"}},{"before":"29b2dccb69ca17c53b48e5e054207d160bcd4dfd","after":"b6188e31d6d7f65e2412e7d0b92c5056662a6ab7","ref":"refs/heads/develop","pushedAt":"2024-05-01T18:55:12.000Z","pushType":"pr_merge","commitsCount":1,"pusher":{"login":"fiedorowicz1","name":"Pier Fiedorowicz","path":"/fiedorowicz1","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/117680821?s=80&v=4"},"commit":{"message":"Fixes In-Place Activation Reference Counting (#2442)\n\n* Fixes activation reference counting for in-place layers\r\n\r\n* Add in-place reference counting test","shortMessageHtmlLink":"Fixes In-Place Activation Reference Counting (#2442)"}},{"before":"7d1d17cf21f2b1ce5780ebea12e35d0e2e83a4e5","after":"29b2dccb69ca17c53b48e5e054207d160bcd4dfd","ref":"refs/heads/develop","pushedAt":"2024-04-29T21:39:27.000Z","pushType":"pr_merge","commitsCount":1,"pusher":{"login":"benson31","name":"Tom Benson","path":"/benson31","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/30674819?s=80&v=4"},"commit":{"message":"Add NCCL to the superbuild (#2441)\n\n* Add NCCL to the superbuild\r\n\r\n* Update copyright year\r\n\r\n* Update Superbuild README","shortMessageHtmlLink":"Add NCCL to the superbuild (#2441)"}},{"before":"052c60241c92f66193bad66b4973eca2f1174a87","after":"7d1d17cf21f2b1ce5780ebea12e35d0e2e83a4e5","ref":"refs/heads/develop","pushedAt":"2024-04-29T21:14:31.000Z","pushType":"pr_merge","commitsCount":1,"pusher":{"login":"bvanessen","name":"Brian Van Essen","path":"/bvanessen","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/6210171?s=80&v=4"},"commit":{"message":"Add aws-ofi-rccl plugin to the superbuild (#2440)\n\n* Add aws-ofi-rccl plugin to the superbuild\r\n\r\n* Minor adjustment to if/else blocks\r\n\r\n* Add aws-ofi-rccl build to the example rocm script\r\n\r\n* Add license statement to aws-ofi-rccl recipe\r\n\r\n* Remove a superfluous DEPENDS_ON","shortMessageHtmlLink":"Add aws-ofi-rccl plugin to the superbuild (#2440)"}},{"before":"b22d2e51e0100a83e4dc4c79001b631d04d757c2","after":"052c60241c92f66193bad66b4973eca2f1174a87","ref":"refs/heads/develop","pushedAt":"2024-04-18T18:41:33.000Z","pushType":"pr_merge","commitsCount":1,"pusher":{"login":"fiedorowicz1","name":"Pier Fiedorowicz","path":"/fiedorowicz1","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/117680821?s=80&v=4"},"commit":{"message":"Fix to PeriodicPadding to match pytorch circular padding (#2435)\n\n* Fix to PeriodicPadding to match pytorch circular padding\r\n\r\n* Minor changes to docstring and kwargs\r\n\r\nAs batch size is implicit, remove from docstring.\r\nRemove unused \"name\" kwarg in function.\r\n\r\n* Fix test by comparing against pytorch F.pad with mode=\"circular\"\r\n\r\n* Skip test if pytorch isn't available\r\n\r\n---------\r\n\r\nCo-authored-by: Pier Fiedorowicz ","shortMessageHtmlLink":"Fix to PeriodicPadding to match pytorch circular padding (#2435)"}},{"before":"1db91a2ba387c722c36cdc9a7f0ad325296c8965","after":"b22d2e51e0100a83e4dc4c79001b631d04d757c2","ref":"refs/heads/develop","pushedAt":"2024-04-17T16:05:33.000Z","pushType":"pr_merge","commitsCount":1,"pusher":{"login":"fiedorowicz1","name":"Pier Fiedorowicz","path":"/fiedorowicz1","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/117680821?s=80&v=4"},"commit":{"message":"Switch to hydrogen for MPI calls except for Spectrum MPI sends (#2436)","shortMessageHtmlLink":"Switch to hydrogen for MPI calls except for Spectrum MPI sends (#2436)"}},{"before":"811af60f3c210f94f474752003743302825ae6b8","after":"1db91a2ba387c722c36cdc9a7f0ad325296c8965","ref":"refs/heads/develop","pushedAt":"2024-04-04T01:41:10.000Z","pushType":"pr_merge","commitsCount":1,"pusher":{"login":"fiedorowicz1","name":"Pier Fiedorowicz","path":"/fiedorowicz1","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/117680821?s=80&v=4"},"commit":{"message":"Python Dataset Reader (#2414)\n\n* Add skeleton for new python data reader\r\n\r\n* Implement basic functionality\r\n\r\n* Fix initialization for distconv\r\n\r\n* Add support for labels\r\n\r\n* Add python library supporting classes\r\n\r\n* clang format\r\n\r\n* Raise exception if rank/io parts not set\r\n\r\n* Rename to python dataset\r\n\r\n* Add optional module dir argument to add to path\r\n\r\n* Add unit tests\r\n\r\n* Simplify naming\r\n\r\n* Add cosmoflow example and reader helper\r\n\r\n* Update release notes\r\n\r\n* Save dataset pickle in work dir\r\n\r\n* Overhaul new data reader to support prefetching multiple samples/batches\r\n\r\n* Fix worker index calculation\r\n\r\n* clang-format\r\n\r\n* Clarify proto comments\r\n\r\n* Throw error if file fails to open\r\n\r\n* Add docstrings and type hints\r\n\r\n* Update CosmoFlow example and enable parallel IO\r\n\r\n* Add basic sample size checking, remove label reconstruction, general clean up\r\n\r\n* Switch to multiprocessing pool\r\n\r\n* Implement response shuffling for distconv\r\n\r\n* fix typo\r\n\r\nCo-authored-by: Tal Ben-Nun \r\n\r\n---------\r\n\r\nCo-authored-by: Tal Ben-Nun ","shortMessageHtmlLink":"Python Dataset Reader (#2414)"}},{"before":"f3172acffa63f2a4a58f1d35b2d8bc6411863d96","after":"811af60f3c210f94f474752003743302825ae6b8","ref":"refs/heads/develop","pushedAt":"2024-03-21T18:44:49.000Z","pushType":"pr_merge","commitsCount":1,"pusher":{"login":"tbennun","name":"Tal Ben-Nun","path":"/tbennun","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/8348955?s=80&v=4"},"commit":{"message":"Implement multi-dimensional reduction and refactor cuTENSOR support (#2430)","shortMessageHtmlLink":"Implement multi-dimensional reduction and refactor cuTENSOR support (#…"}},{"before":"6011d0387b99fb6de9b12f464ff2aa2c102fc331","after":"f3172acffa63f2a4a58f1d35b2d8bc6411863d96","ref":"refs/heads/develop","pushedAt":"2024-02-22T00:25:12.000Z","pushType":"pr_merge","commitsCount":1,"pusher":{"login":"tbennun","name":"Tal Ben-Nun","path":"/tbennun","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/8348955?s=80&v=4"},"commit":{"message":"Support layer parallelism in transformer application (#2420)\n\nThis PR adds the capability to support layer parallelism in transformers, variable-length version of The Pile pretokenized dataset, updates to the LBANN graph visualizer script, and some minor tweaks to weights layer.","shortMessageHtmlLink":"Support layer parallelism in transformer application (#2420)"}},{"before":"2f691af1713981e43a376ea20433d6f14e148e14","after":"6011d0387b99fb6de9b12f464ff2aa2c102fc331","ref":"refs/heads/benchmarking","pushedAt":"2024-02-15T18:50:35.000Z","pushType":"push","commitsCount":1,"pusher":{"login":"bvanessen","name":"Brian Van Essen","path":"/bvanessen","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/6210171?s=80&v=4"},"commit":{"message":"Fix CosmoFlow Double-Reading (#2425)\n\n* Fix double-read when not using datastore\r\n\r\n* clang-format","shortMessageHtmlLink":"Fix CosmoFlow Double-Reading (#2425)"}},{"before":"2f691af1713981e43a376ea20433d6f14e148e14","after":"6011d0387b99fb6de9b12f464ff2aa2c102fc331","ref":"refs/heads/develop","pushedAt":"2024-02-14T21:57:27.000Z","pushType":"pr_merge","commitsCount":1,"pusher":{"login":"fiedorowicz1","name":"Pier Fiedorowicz","path":"/fiedorowicz1","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/117680821?s=80&v=4"},"commit":{"message":"Fix CosmoFlow Double-Reading (#2425)\n\n* Fix double-read when not using datastore\r\n\r\n* clang-format","shortMessageHtmlLink":"Fix CosmoFlow Double-Reading (#2425)"}},{"before":"c00fa685c9ba0bf3a7a2a6a1c3a4525bb86acdeb","after":"2f691af1713981e43a376ea20433d6f14e148e14","ref":"refs/heads/benchmarking","pushedAt":"2024-02-14T20:06:48.000Z","pushType":"push","commitsCount":34,"pusher":{"login":"bvanessen","name":"Brian Van Essen","path":"/bvanessen","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/6210171?s=80&v=4"},"commit":{"message":"FSDP: Enable limiting scope using trainer grid rows/columns (#2424)","shortMessageHtmlLink":"FSDP: Enable limiting scope using trainer grid rows/columns (#2424)"}},{"before":"3d611619007232f49da47e824a60fced5dc736d2","after":"2f691af1713981e43a376ea20433d6f14e148e14","ref":"refs/heads/develop","pushedAt":"2024-02-14T17:29:56.000Z","pushType":"pr_merge","commitsCount":1,"pusher":{"login":"tbennun","name":"Tal Ben-Nun","path":"/tbennun","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/8348955?s=80&v=4"},"commit":{"message":"FSDP: Enable limiting scope using trainer grid rows/columns (#2424)","shortMessageHtmlLink":"FSDP: Enable limiting scope using trainer grid rows/columns (#2424)"}},{"before":"32a761bd11552478a1cdf6db3372d0aea4f0703b","after":"3d611619007232f49da47e824a60fced5dc736d2","ref":"refs/heads/develop","pushedAt":"2024-02-13T00:49:18.000Z","pushType":"pr_merge","commitsCount":1,"pusher":{"login":"tbennun","name":"Tal Ben-Nun","path":"/tbennun","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/8348955?s=80&v=4"},"commit":{"message":"External metrics (#2412)\n\nThis PR adds the capability to run a Python script or executable during evaluation, in order to get external metrics.","shortMessageHtmlLink":"External metrics (#2412)"}},{"before":"3492fbde234e3fabb5457378851b0850c6f93ddb","after":"32a761bd11552478a1cdf6db3372d0aea4f0703b","ref":"refs/heads/develop","pushedAt":"2024-02-12T18:07:24.000Z","pushType":"pr_merge","commitsCount":1,"pusher":{"login":"tbennun","name":"Tal Ben-Nun","path":"/tbennun","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/8348955?s=80&v=4"},"commit":{"message":"Do not backpropagate through layers without gradient sources (#2423)","shortMessageHtmlLink":"Do not backpropagate through layers without gradient sources (#2423)"}},{"before":"835f79171e28ea350468272196e6edfd25a4840d","after":"3492fbde234e3fabb5457378851b0850c6f93ddb","ref":"refs/heads/develop","pushedAt":"2024-01-31T17:43:51.000Z","pushType":"pr_merge","commitsCount":1,"pusher":{"login":"benson31","name":"Tom Benson","path":"/benson31","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/30674819?s=80&v=4"},"commit":{"message":"Fix typo in example superbuild script (#2422)\n\n* Fix typo in example superbuild script\r\n\r\n* Fix a bad variable name","shortMessageHtmlLink":"Fix typo in example superbuild script (#2422)"}},{"before":"4f62fab9682e43d04af76cfca96389f71d363963","after":"835f79171e28ea350468272196e6edfd25a4840d","ref":"refs/heads/develop","pushedAt":"2024-01-31T16:47:40.000Z","pushType":"pr_merge","commitsCount":1,"pusher":{"login":"benson31","name":"Tom Benson","path":"/benson31","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/30674819?s=80&v=4"},"commit":{"message":"Restore the superbuild (#2421)\n\n* Restore the superbuild\r\n\r\n* Update copyright year","shortMessageHtmlLink":"Restore the superbuild (#2421)"}},{"before":"d7c5780cb30b7ff0be55071922527dd0aeaa3fd4","after":"4f62fab9682e43d04af76cfca96389f71d363963","ref":"refs/heads/develop","pushedAt":"2024-01-30T01:39:31.000Z","pushType":"pr_merge","commitsCount":1,"pusher":{"login":"bvanessen","name":"Brian Van Essen","path":"/bvanessen","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/6210171?s=80&v=4"},"commit":{"message":"Cleaned up a few more instances of counters that should be converted to uint64_t in the data ingestion pipeline. (#2405)","shortMessageHtmlLink":"Cleaned up a few more instances of counters that should be converted …"}},{"before":"f9ea28b510efbe184217e09df782f57b1e486305","after":null,"ref":"refs/heads/cleanup-gradcheck","pushedAt":"2024-01-25T00:50:29.000Z","pushType":"branch_deletion","commitsCount":0,"pusher":{"login":"tbennun","name":"Tal Ben-Nun","path":"/tbennun","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/8348955?s=80&v=4"}},{"before":"cebacc2ea6beb21ee34c021c7641f31f3dea00ba","after":"d7c5780cb30b7ff0be55071922527dd0aeaa3fd4","ref":"refs/heads/develop","pushedAt":"2024-01-25T00:49:07.000Z","pushType":"pr_merge","commitsCount":1,"pusher":{"login":"tbennun","name":"Tal Ben-Nun","path":"/tbennun","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/8348955?s=80&v=4"},"commit":{"message":"Use model methods for gradient checking (#2374)","shortMessageHtmlLink":"Use model methods for gradient checking (#2374)"}},{"before":"46155282fa6efe0ddfcc7e4cc99070d63bc9a667","after":"f9ea28b510efbe184217e09df782f57b1e486305","ref":"refs/heads/cleanup-gradcheck","pushedAt":"2024-01-24T22:02:55.000Z","pushType":"force_push","commitsCount":0,"pusher":{"login":"tbennun","name":"Tal Ben-Nun","path":"/tbennun","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/8348955?s=80&v=4"},"commit":{"message":"Use model methods for gradient checking","shortMessageHtmlLink":"Use model methods for gradient checking"}},{"before":"275692006df1a16ee7a79897c52b13e85e189f75","after":"cebacc2ea6beb21ee34c021c7641f31f3dea00ba","ref":"refs/heads/develop","pushedAt":"2024-01-24T17:22:18.000Z","pushType":"pr_merge","commitsCount":1,"pusher":{"login":"benson31","name":"Tom Benson","path":"/benson31","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/30674819?s=80&v=4"},"commit":{"message":"Initial layer parallelism (#2342)\n\n* Quick implementation of basic layer parallelism\r\n\r\n* Layer-parallel lenet! (probably remove before merge)\r\n\r\n* Address comments from review -- BROKEN\r\n\r\n* This just crashes, which is maybe better than a hang?\r\n\r\n* Revert lenet changes; add layer-parallel lenet driver\r\n\r\n* Fixes so layer-parallel lenet actually runs\r\n\r\n* Apply suggestions from code review\r\n\r\nCo-authored-by: Tal Ben-Nun \r\n\r\n* Address review concerns\r\n\r\n* Add comment to lenet_lp.py\r\n\r\n* Remove some questionable code\r\n\r\n* Remove obsolete method and fix incorrect callbacks\r\n\r\n* Fix misuse of rank API for matrix participation and avoid calling unnecessary collectives\r\n\r\n* clang-format\r\n\r\n* Fix a logic error in subgrid setup\r\n\r\n* Improve definition of is_participating\r\n\r\n* Remove incorrect early returns\r\n\r\n* Fix the grid tag logic\r\n\r\n* Fix is_participating for cross_grid_sum_slice\r\n\r\n---------\r\n\r\nCo-authored-by: Tal Ben-Nun \r\nCo-authored-by: Tal Ben-Nun ","shortMessageHtmlLink":"Initial layer parallelism (#2342)"}},{"before":"9c3343422ba40d1eba47a9e69e4cb83c493eadfc","after":"46155282fa6efe0ddfcc7e4cc99070d63bc9a667","ref":"refs/heads/cleanup-gradcheck","pushedAt":"2024-01-22T22:08:09.000Z","pushType":"force_push","commitsCount":0,"pusher":{"login":"tbennun","name":"Tal Ben-Nun","path":"/tbennun","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/8348955?s=80&v=4"},"commit":{"message":"Use model methods for gradient checking","shortMessageHtmlLink":"Use model methods for gradient checking"}},{"before":"196b0cb7fca800933c29c95ad421992c3c78fdf3","after":"275692006df1a16ee7a79897c52b13e85e189f75","ref":"refs/heads/develop","pushedAt":"2024-01-12T20:54:05.000Z","pushType":"pr_merge","commitsCount":1,"pusher":{"login":"tbennun","name":"Tal Ben-Nun","path":"/tbennun","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/8348955?s=80&v=4"},"commit":{"message":"Add GitHub Actions action that builds LBANN (#2418)","shortMessageHtmlLink":"Add GitHub Actions action that builds LBANN (#2418)"}},{"before":"20dfbca2837ce83d7e9bd3de95b735e67ac2c9e5","after":"196b0cb7fca800933c29c95ad421992c3c78fdf3","ref":"refs/heads/develop","pushedAt":"2024-01-12T19:26:50.000Z","pushType":"pr_merge","commitsCount":1,"pusher":{"login":"bvanessen","name":"Brian Van Essen","path":"/bvanessen","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/6210171?s=80&v=4"},"commit":{"message":"Removed the specific version of the PrgEnv module used on Cray systems. (#2416)\n\n* Removed the specific version of the PrgEnv module used on Cray systems.\r\n\r\n* Updated the version of cray-mpich being used.\r\n\r\n* Updated the cce version.\r\n\r\n* Fixed cray-mpich version.\r\n\r\n* Fixed version.","shortMessageHtmlLink":"Removed the specific version of the PrgEnv module used on Cray system…"}}],"hasNextPage":true,"hasPreviousPage":false,"activityType":"all","actor":null,"timePeriod":"all","sort":"DESC","perPage":30,"cursor":"djE6ks8AAAAEQvpHQQA","startCursor":null,"endCursor":null}},"title":"Activity · LLNL/lbann"}