Releases · BerkeleyLab/inference-engine

14 Aug 04:15

rouson

0.13.0

3443f8d

New JSON file format Latest

Latest

The new file format includes

A file version indicator currently named acceptable_engine_tag to denote the git tag used to create the new format,
Better nomenclature:
- The minima and maxima fields are now intercept and slope, respectively, to better match the function's purpose: to define a linear map to and from the unit interval.
- The encompassing inputs_range and outputs_range objects are now inputs_map and outputs_map, 🗺️
A fix for cloud_microphysics/setup.sh. ☁️
Other minor bug fixes and enhancements.

What's Changed

doc(README): specify required gfortran versions by @rouson in #185
Enhance saturated mixing ratio example by @rouson in #189
Refactor tensor_map_m to improve nomenclature & move phase_space_bin_t to cloud-microphysics by @rouson in #192
Add git tag to JSON file to denote inference-engine version that reads and writes the format by @rouson in #194
Fixes #195 by @rouson in #196

Full Changelog: 0.12.0...0.13.0

Contributors

rouson

Assets 2

21 Jul 06:00

rouson

0.12.0

7d564b8

Support LLVM Flang + add entropy-maximizing filter to speed convergence

This release adds a feature to speed convergence and adds support for a fourth compiler in addition to the GNU, NAG, & Intel compilers:

All tests pass with the LLVM Flang (flang-new) compiler,
The cloud-microphysics/app/train-cloud-microphysics.f90 program includes new options
- --bins filters the training data to maximize the Shannon entropy by selecting only one data point per bin in a five-dimensional phase space,
- --report controls the frequency of writing JSON files to reduce file I/O costs,
Eliminates several warning messages from the NAG compiler (nagfor).
Switches a dependency from Sourcery to Julienne to eliminate the requirement for coarray feature support,
Adds the GELU activation function.
Speeds up the calculation of the data needed to construct histograms.
Adds a new cloud-microphysics/train.sh program to manage the training process,
Adds the ability to terminate a training run based on a cost-function tolerance rather than a fixed number of epochs.

What's Changed

Remove second, unneeded and no longer supported build of gcc by @ktras in #150
build: update to sourcery 4.8.1 by @rouson in #151
doc(README): add instructions for auto offload by @rouson in #152
Work around ifx automatic-offloading bugs by @rouson in #145
Add bug workarounds for gfortran-14 associate-stmt bug by @ktras in #155
Switching from Sourcery to Julienne by @rouson in #154
Update fpm manifest with tag for v1.0 of dependency julienne by @ktras in #157
Support compilation with LLVM Flang by @ktras in #159
Update cloud-microphysics compiler and dependencies by @rouson in #160
Add GELU activation function by @rouson in #161
Feature: Faster histogram construction when the number of bins exceeds 80 by @rouson in #162
Read & perform inference on networks for which the hidden-layer width varies across layers by @rouson in #166
Fix/Feature(JSON): disambiguate tensor_range objects and allow flexible line-positioning of objects by @rouson in #165
Feature: Support JSON output for networks with varying-width hidden layers by @rouson in #167
Feature: filter training data for maximal information entropy via flat multidimensional output-tensor histograms by @rouson in #169
Features: maximize information entropy and variable reporting interval. by @rouson in #170
build: add single-file compile script by @rouson in #171
Add ubuntu to CI by @ktras in #156
Feature: add training script in cloud-microphysics/train.sh by @rouson in #172
feat(train.sh): graceful exits by @rouson in #173
refac(train): rm rendundant array allocations by @rouson in #174
feat(cloud-micro): write 1st/last cost, fewer JSON by @rouson in #175
feat(train.sh): add outer loop for refinement by @rouson in #176
feat(cloud-micro): terminate on cost-tolerance by @rouson in #177
Concurrent loop through each mini-batch during training by @rouson in #178
test(adam): reset iterations so all tests pass with flang-new by @rouson in #179
doc(README): add flags to optimize builds by @rouson in #180
fix(single-source): mv script outside fpm's purview by @rouson in #182
doc(README): optimize ifx builds by @rouson in #181
Eliminate compiler warnings by @rouson in #183
fix(single-file-source): respect file extension case by @rouson in #184

Full Changelog: 0.11.1...0.12.0

Contributors

ktras and rouson

Assets 2

26 Apr 00:39

rouson

0.11.1

007de65

Selective test execution and compiler workarounds

New Feature

This release enables the selecting a subset of tests to run based on a search for substrings contained in the test output.
All test output is of the form

<Subject>
   passes on <description 1>.
   FAILS on <description 2>.

where the subject describes what is being tested (.e.g, A tensor_range_t object) and the description details how the subject is being tested (e.g., component-wise construction followed by conversion to and from JSON). The subject typically contains a type name such as tensor_range_t. The description typically does not contain a type name. Therefore, running the command

fpm test -- --contains tensor_range_t

will execute and report the outcome of all tests of the given subject, tensor_range_t, and only those tests. For test output similar to that shown above, this would display two test outcomes: one passing and one failing.

By contrast, running the command

fpm test -- --contains "component-wise construction"

would execute and report the outcome of the tests with descriptions containing component-wise construction for any subject.

This release also works around a few compiler bugs and reorders tests so that the fastest and most stable run first.

What's Changed

Work around ifx bug by @rouson in #142
Fix filename extension for file that has directives by @ktras in #143
feat(inference_engine_t): tensor_range_t getters (later removed) by @rouson in #147
Cray bug workarounds for compile time bugs by @ktras in #146
Feature: redesigned functionality for mapping input and output tensors to and from training ranges by @rouson in #148
Test reordering and selective test execution by @rouson in #149

Full Changelog: 0.11.0...0.11.1

Contributors

ktras and rouson

Assets 2

04 Apr 21:36

rouson

0.11.0

f572e1d

Batch normalization, more concurrency, & NAG compiler support

This release adds

A tensor_range_t type that 🧑‍🌾 🌱
- Encapsulates input and output tensor component minima and maxima,
- Provides type-bound map_to_training_range and map_from_training_range procedures for mapping tensors to and from the unit interval [0,1], and 🤺
- Provides a type-bound in_range procedure that users can employ to check whether inference input or output data involve extrapolation beyond the respective ranges employed in training.
BREAKING CHANGE: the network JSON file format has been updated to include input_range and output_range objects. The JSON file reader in this release may fail to read or write network files that are written or read by older versions of Inference-Engine. 🚒 🗄️ 📂
Automatic use of the aforementioned mapping capability during inference. 🧭
Enhanced concurrency to improve performance: 🐎
- Additional use of do concurrent in the training algorithm and 🚄 🚋
- Enabling building with OpenMP in the setup.sh script. 🏗️ 👷‍♀️
Additional compiler support: this is the first release that builds with the NAG Fortran compiler starting with compiler Build 7202.

What's Changed

Simplify app: rm redundant procedures by @rouson in #102
Concurrent inference example by @rouson in #103
Exploit additional concurrency in the training algorithm by @rouson in #105
feat(example): add nested do-loop inferences by @rouson in #106
chore(examples): match program names to file names by @rouson in #109
feat(infer): allow non-type-bound invocations by @rouson in #110
doc(README): minimum gfortran version 13 by @rouson in #111
Add new fpm subproject icar-trainer by @ktras in #108
Enable OpenMP in setup script & work around related compiler bug by @rouson in #114
fix(run-fpm.sh): revert to copying header into build dir by @rouson in #115
Remove module keyword from abstract interface by @ktras in #116
Compute & output tensor histograms in columnar format & add gnuplot script by @rouson in #118
Bugfixes for nag by @ktras in #119
fix(examples): .f90->.F90 to preprocess files by @rouson in #121
Get beyond one type of Intel bugs by @ktras in #120
Nagfor workaround by @rouson in #122
chore(test): rm nagfor compiler workaround by @rouson in #129
Workaround intel bug by @ktras in #128
doc(README): add compilers in testing instructions by @rouson in #130
build(fpm.toml): increment dependency versions by @rouson in #131
More robust Adam optimizer test by @rouson in #134
Ifx workarounds + train longer in Adam test to pass with nagfor by @rouson in #135
Store tensor ranges by @rouson in #137
build(random_init): rm final nagfor workaround by @rouson in #136
Feature: Add input/output tensor component ranges to network files by @rouson in #138
Feature: map input to unit range & output tensors from unit range in inference_engine_t infer procedure by @rouson in #139
Normalize in training by @rouson in #140
Fix training restarts by @rouson in #141

New Contributors

@ktras made their first contribution in #108

Full Changelog: 0.10.0...0.11.0

Contributors

ktras and rouson

Assets 2

26 Nov 04:54

rouson

0.10.0

9673024

Train cloud microphysics

This is the first release that contains an app/train_cloud_microphysics.f90 program and a training_configuration.json file that exhibits convergent behavior of the (Adam) training algorithm on an ICAR-generated training data set by as demonstrated by a monotonically decreasing cost function:

    ./build/run-fpm.sh run train-cloud-microphysics -- --base training --epochs 10 --start 720
    ...
            Epoch         Cost (avg)
               1  0.121759593
               2   1.61784310E-02
               3   5.31613547E-03
               4   2.68347375E-03
               5   1.63242721E-03
               6   1.11283606E-03
               7   8.27088661E-04
               8   6.59517595E-04
               9   5.56710584E-04
              10   4.91619750E-04
     Training time:    39.319034000000002      for          10 epochs
     System clock time:    353.68379099999999

What's Changed

fix(deploy-docs.yml) - use linuxbrew to install ford 7 by @rouson in #99
Train Thompson microphysics proxy by @rouson in #98
doc(README): clarify language by @rouson in #100

Full Changelog: 0.9.0...0.10.0

Contributors

rouson

Assets 2

06 Nov 09:33

rouson

0.9.0

1abb927

Training configuration JSON file I/O and new examples

New in this release

The app/train-cloud-microphysics.f90 program reads hyperparameters and network configuration from the new training_configuration.json input file and defines the corresponding variables in the program.
The new example/print-training-configuration.f90 program displays a sample input file as shown below.
The new example/learn-microphysics-procedures.f90 program learns to model two functions from [ICAR]'s Thompson cloud microphysics model.
Updated netcdf-interfaces dependency.

./build/run-fpm.sh run --example print-training-configuration 
Project is up to date
 {
     "hyperparameters": {
         "mini-batches" : 10,
         "learning rate" : 1.50000000,
         "optimizer" : "adam"
     }
 ,
     "network configuration": {
         "skip connections" : false,
         "nodes per layer" : [2,72,2],
         "activation function" : "sigmoid"
     }
 }

What's Changed

Cleanup examples by @rouson in #91
Train neural net proxy for two functions from ICAR's Thompson microphysics model by @rouson in #92
JSON-formatted input for training configuration by @rouson in #94
doc(README) add training configuration material by @rouson in #95
App reads training configuration JSON file by @rouson in #96
fix(example): work around associate issues by @rouson in #97

Full Changelog: 0.8.0...0.9.0

Contributors

rouson

Assets 2

04 Oct 16:59

rouson

0.8.0

967b0da

New training algorithms, examples, documentation, and bug fixes

New training algorithms:

Stochastic gradient descent
The Adam optimizer

New examples:

Training neural nets to learn basic mathematical operations

The example/learn-*.f90 programs train neural networks to learn basic math functions. Given 8 input variables x(1), … ,x(8), the training algorithm can now learn to produce the following 6 comma-separated outputs corresponding to

Addition: [x(1)+x(2), x(2)+x(3), x(3)+x(4), x(4)+x(5), x(5)+x(6), x(6)+x(8)]
Multiplication: [x(1)*x(2), x(2)*x(3), x(3)*x(4), x(4)*x(5), x(5)*x(6), x(6)*x(8)]
Exponentiation: [x(1)**2, x(2)**3, x(3)**4, x(4)**4, x(5)**3, x(6)**2]
Power series: [1 + x(1) + (x(1)**2)/2 + (x(1)**3)/6, x(2), x(3), x(4), x(5), x(6)]

Inference-Engine's first application-relevant training example

The learn-saturated-mixing-ratio.f90 function trains a network with 2 inputs (normalized temperature and pressure), 1 output (saturated mixing ratio), and 1 hidden layer containing 72 nodes. The training inputs correspond to a uniform cartesian grid laid over the 2D space of procedure argument values bounded by the minimum and maximum values for the corresponding variables from an actual run of the ICAR regional climate model. The training data outputs are the actual result of a refactored version of the ICAR's sat_mr function, wherein a test was used to verify that the refactored code gives the same answer to all significant digits across the entire grid of input values. The inputs are normalized so that the grid covers a unit square. The outputs are unnormalized.

What's Changed

Refactoring

Breaking change: Adopt PyTorch nomenclature: replace input/output types with new tensor_t type by @rouson in #70

Features

Add app that reads training data from ICAR output by @rouson in #73
Train ICAR cloud microphysics by @rouson in #74
feat(train): make cost calculation/return optional by @rouson in #75
feat(app): rm zero-derivative points; report cost by @rouson in #76
feat(app): allow initial parameters to vary by @rouson in #77
Add Adam optimizer and stochastic gradient descent by @rouson in #78
New app features & refactoring: checkpoint/restart and user-specified training time range by @rouson in #79
feat(app): train in strided epochs by @rouson in #81
Reshuffle training data for each epoch by @rouson in #82
feat(app): 1st converging cloud microphysics model by @rouson in #83
feat(example): train ICAR saturated mixing ratio by @rouson in #90

Documentation

Add inference_engine_t class diagram by @kareem-weaver in #71

Bug fixes

fix(setup.sh/run-fpm.sh): add -cpp flag by @rouson in #72
fix(app): only read opened json file if 'old' by @rouson in #80
chore(setup.sh): rm homebrew installation of cmake by @rouson in #84

Examples and Tests

Train linear combinations of inputs by @rouson in #88
Training examples: Learn math operations and a function from a cloud microphysics model by @rouson in #89
Train identity network from identity with 10% maximum random perturbations by @rouson in #86
test(train):test near radius of non-convergence by @rouson in #87

New Contributors

@kareem-weaver made their first contribution in #71

Full Changelog: 0.7.0...0.8.0

Contributors

rouson and kareem-weaver

Assets 2

30 Jul 21:29

rouson

0.7.0

efb01ea

0.7.0 Train deep networks

This is the first release containing passing unit tests that train deep neural networks. The release uses a new implementation of the mini-batch back propagation algorithm originally developed by @davytorres and refactored by @rouson. The new algorithm uses arrays structured differently from previous versions of Inference-Engine. In this release, every procedure has been refactored to eliminate all references to the previous array structures. This refactoring also results in considerable speedup of the test suite.

What's Changed

Fix terminology by @rouson in #59
Fix construction from json by @rouson in #60
Add copyright statements by @rouson in #62
ci: use newer version of gcc by @everythingfunctional in #64
Train deep networks by @rouson in #63
Make trainable_engine_t independent by @rouson in #65
Refactor tests to use the new data structure by @rouson in #66
Refactor: remove legacy component arrays in inference_engine_t by @rouson in #67
Remove inference strategies and integrate netCDF file I/O into library & test suite by @rouson in #68

Full Changelog: 0.6.2...0.7.0

Contributors

everythingfunctional, davytorres, and rouson

Assets 2

25 May 19:28

rouson

0.6.2

6e0ea8f

Shorter test-suite execution time

This release updates the test for the new experimental capability for training neural networks. The current unit tests verify convergence for training single-hidden-layer networks using gradient descent with updates averaged across mini-batches of input/output pairs. Future work will include verifying the training of deep neural networks and introducing stochastic gradient descent.

This release replaces the deleted 0.6.1 release. Relative to release 0.6.1, the current release provides

Nearly an order of magnitude reduction in execution time for the mini-batch training example and for the similar unit test,
A fix for the training example to invoke the correct object definition function for the involved trainable_engine_t object, and
A fix for the inference_engine_t type-bound procedure to_json() to eliminate an erroneous trailing comma that led to invalid JSON output for networks with a single hidden layer.

What's Changed

Fix json output, actually shorten test-suite runtime by @rouson in #58

Full Changelog: 0.6.1...0.6.2

Contributors

rouson

Assets 2

24 May 23:23

rouson

0.6.0

ed0afa5

Experimental training capability

This is the first release with an experimental capability for training neural networks. The current unit tests verify convergence for single-hidden-layer networks using gradient descent with updates averaged across mini-batches of input/output pairs. Future work will include verifying the training of deep neural networks and introducing stochastic gradient descent.

What's Changed

CI: install gfortran on macOS by @everythingfunctional in #48
Encapsulate all output and store unactivated weighted/biased neuron output for training by @rouson in #47
Add activation-function derivative functions by @rouson in #46
fix(setup.sh): brew install netcdf-fortran by @rouson in #45
fix(netcdf-interfaces): patch for fpm v >= 0.8 by @rouson in #49
Test: add test for single-layer perceptron by @rouson in #50
Add back-propagation by @rouson in #51
Export everything via one common module by @rouson in #52
Add nominally complete training algorithm by @rouson in #53
doc(README): mention training & additional future work by @rouson in #55
Group input/output pairs into mini-batches by @rouson in #54
doc(README): mention experimental training feature by @rouson in #56

New Contributors

@everythingfunctional made their first contribution in #48

Full Changelog: 0.5.0...0.6.0

Contributors

everythingfunctional and rouson

Assets 2

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

What's Changed

Contributors

What's Changed

Contributors

New Feature

What's Changed

Contributors

What's Changed

New Contributors

Contributors

What's Changed

Contributors

New in this release

What's Changed

Contributors

New training algorithms:

New examples:

Training neural nets to learn basic mathematical operations

Inference-Engine's first application-relevant training example

What's Changed

Refactoring

Features

Documentation

Bug fixes

Examples and Tests

New Contributors

Contributors

What's Changed

Contributors

What's Changed

Contributors

What's Changed

New Contributors

Contributors

Releases: BerkeleyLab/inference-engine

New JSON file format

What's Changed

Contributors

Support LLVM Flang + add entropy-maximizing filter to speed convergence

What's Changed

Contributors

Selective test execution and compiler workarounds

New Feature

What's Changed

Contributors

Batch normalization, more concurrency, & NAG compiler support

What's Changed

New Contributors

Contributors

Train cloud microphysics

What's Changed

Contributors

Training configuration JSON file I/O and new examples

New in this release

What's Changed

Contributors

New training algorithms, examples, documentation, and bug fixes

New training algorithms:

New examples:

Training neural nets to learn basic mathematical operations

Inference-Engine's first application-relevant training example

What's Changed

Refactoring

Features

Documentation

Bug fixes

Examples and Tests

New Contributors

Contributors

0.7.0 Train deep networks

What's Changed

Contributors

Shorter test-suite execution time

What's Changed

Contributors

Experimental training capability

What's Changed

New Contributors

Contributors