Skip to content

Commit d8f264d

Browse files
committed
Prepare for release 0.5.1
1 parent 9ba7621 commit d8f264d

File tree

6 files changed

+12
-11
lines changed

6 files changed

+12
-11
lines changed

CHANGES.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1,4 +1,4 @@
1-
## [0.5.1] -- 2024-12-31
1+
## [0.5.1] -- 2025-01-01
22

33
## Added
44

README.md

Lines changed: 7 additions & 6 deletions
Original file line numberDiff line numberDiff line change
@@ -64,7 +64,6 @@ IMPORTANT: due to potential bugs, debug logging from CUDA in complex settings cu
6464

6565
This is very tentative.
6666

67-
* 0.5.1: Automatic synchronization for transfers between host and devices where unambiguous.
6867
* 0.5.2: Apple Metal backend.
6968
* 0.6: Replicate the scaffolding from [llm.c](https://github.com/karpathy/llm.c) for training GPT-2.
7069
* More of primitive numeric operations.
@@ -96,11 +95,13 @@ This is very tentative.
9695

9796
For more details, see [CHANGES](CHANGES.md).
9897

99-
* **0.5: Stream-to-stream synchronization at the buffer level.**
100-
* Support for CUDA events, and `Condition`-based events for CPU backends.
101-
* Overhaul of the backend interfaces, both user-facing but especially internal: full code sharing.
102-
* Automatic stream-to-stream synchronization on a per-tensor-node basis.
103-
* **0.4.1 Half precision, mixed precision, CUDA virtual devices** (virtual devices renamed to streams in 0.4.2)
98+
* **0.5: Synchronization and automation at the buffer level.**
99+
* **0.5.1: Automatic synchronization for transfers between host and devices.**
100+
* **0.5.0: Stream-to-stream synchronization at the buffer level.**
101+
* Support for CUDA events, and `Condition`-based events for CPU backends.
102+
* Overhaul of the backend interfaces, both user-facing but especially internal: full code sharing.
103+
* Automatic stream-to-stream synchronization on a per-tensor-node basis.
104+
* **0.4.1 Half precision, mixed precision, CUDA virtual devices** (virtual devices renamed to streams in 0.5.0)
104105
* Half precision. Maybe improvements for mixed-precision computations.
105106
* Resolve remaining issues with the new scheduler.
106107
* Initial version of [lib/nn_blocks.ml](lib/nn_blocks.ml).

arrayjit.opam

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1,6 +1,6 @@
11
# This file is generated by dune, edit dune-project instead
22
opam-version: "2.0"
3-
version: "0.5.0"
3+
version: "0.5.1"
44
synopsis:
55
"An array language compiler with multiple backends (CPU, CUDA), staged compilation"
66
description:

dune-project

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -4,7 +4,7 @@
44

55
(name ocannl)
66

7-
(version 0.5.0)
7+
(version 0.5.1)
88

99
(generate_opam_files true)
1010

neural_nets_lib.opam

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1,6 +1,6 @@
11
# This file is generated by dune, edit dune-project instead
22
opam-version: "2.0"
3-
version: "0.5.0"
3+
version: "0.5.1"
44
synopsis:
55
"A from-scratch Deep Learning framework with an optimizing compiler, shape inference, concise syntax"
66
description:

ocannl_npy.opam

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1,6 +1,6 @@
11
# This file is generated by dune, edit dune-project instead
22
opam-version: "2.0"
3-
version: "0.5.0"
3+
version: "0.5.1"
44
synopsis: "Numpy file format support for ocaml"
55
maintainer: ["Lukasz Stafiniak <lukstafi@gmail.com>"]
66
authors: ["Laurent Mazare"]

0 commit comments

Comments
 (0)