@@ -96,28 +96,28 @@ This is very tentative.
9696
9797For more details, see [ CHANGES] ( CHANGES.md ) .
9898
99- * 0.5: stream -to-stream synchronization at the buffer level.
99+ * ** 0.5: Stream -to-stream synchronization at the buffer level.**
100100 * Support for CUDA events, and ` Condition ` -based events for CPU backends.
101101 * Overhaul of the backend interfaces, both user-facing but especially internal: full code sharing.
102102 * Automatic stream-to-stream synchronization on a per-tensor-node basis.
103103* ** 0.4.1 Half precision, mixed precision, CUDA virtual devices** (virtual devices renamed to streams in 0.4.2)
104104 * Half precision. Maybe improvements for mixed-precision computations.
105105 * Resolve remaining issues with the new scheduler.
106106 * Initial version of [ lib/nn_blocks.ml] ( lib/nn_blocks.ml ) .
107- * ** v0.4 merge buffers, C-syntax backend builder** : a significant refactoring of the API.
108- * ** v0.3 shape inference, jitted routines** : a major rewrite of the whole project.
107+ * ** v0.4 Merge buffers, C-syntax backend builder** : a significant refactoring of the API.
108+ * ** v0.3 Shape inference, jitted routines** : a major rewrite of the whole project.
109109 * ** v0.3.3** : continuous integration and opam release.
110110 * ** v0.3.2** : new shape inference feature: tracking leftmost axes -- complete inference for splicing, ellipsis-in-the-middle allowed in einsum notation.
111111 * ** v0.3.1** : sanitizing code inclusion (rootness checks).
112112 * ** v0.3.0** : declarative shape inference; replaced the session interface with a "jitted code routines" API. Cuda defunct.
113- * ** v0.2 inching toward GPU** :
113+ * ** v0.2 Inching toward GPU** :
114114 * ** v0.2.1 naive-cuda** : a Cuda backend where blocks and threads are exposed via dedicated axis types.
115115 * ** v0.2.0 stack-as-device** : treating the C function stack as the "device memory".
116116* ** v0.1 GCCJIT backend** :
117117 * ** v0.1.2** : multicore computations using a thread-local "task id" index.
118118 * ** v0.1.1** : inlining scalar constants, improved inlining for virtual nodes.
119119 * ** v0.1.0** : a ` Gccjit ` backend, single and double precision floats, code compiled as a monolithic update step function.
120- * ** v0.0 untagged ** : basic design around shape inference, high-level and low-level code representation. Now-abandoned Meta-OCaml and OCaml backends.
120+ * ** v0.0 Untagged ** : basic design around shape inference, high-level and low-level code representation. Now-abandoned Meta-OCaml and OCaml backends.
121121
122122## Why not just use [ OWL] ( https://ocaml.xyz/ ) ?
123123
0 commit comments