TIR: Transformation-Interaction-Rational Symbolic Regression (v1.0)

TIR is a fast and simple Evolutionary Algorithm for Symbolic Regression developed in Haskell. Check out the documentation if you want to extend the code.

Transformation-Interaction-Rational (TIR) searches for models of the form:

$$f_{TIR}(\mathbf{x,w_p,w_q})=g\left(\frac{p(\mathbf{x,w_p})}{1 + q(\mathbf{x, w_q})}\right)$$

where g is an invertible function, p, q are IT expressions with m_p > 0 and m_q \geq 0 terms defined as:

$$f_{IT}(\mathbf{x,w}) = w_0 + \sum_{j = 1}^{m}{w_{j} \cdot (f_j \circ r_j)(\mathbf{x})}$$

representing a model with mterms where w are the coefficients of the affine combination, f_j is the j-th transformation function and r_j is the interaction function:

$$r_j(\mathbf{x}) = \prod_{i = 1}^{d}{x_i^{k_{ij}}}$$

where k_ij represents the integral exponents for each variable.

Dependencies

For Haskell-only:

BlAS
LAPACK
GSL

For Python wrapper:

Numpy
Pandas
Scikit-Learn

Installation

Clone the repository with git clone https://github.com/folivetti/tir.git.

Using Haskell Stack

Install the Haskell Stack tool following the instructions at https://docs.haskellstack.org/en/stable/README/.
Run install_stack.sh

Using Cabal

Run install_cabal.sh

Using Nix flake

Run install_nix.sh

Running

In order to run the algorithm, first create the training and test set files as a comma separated values without a header (see datasets folder for some examples) and then create a config file for the experiment (see configs folder for some examples).

The config file is split into three sections where you can set different hyperparameters:

[IO]
train = path and name of the training set
test  = path and name of the test set
log   = PartialLog "path and name of the output file"

[Mutation]
krange      = (-3, 3)
transfunctions = [Id, Sin, Cos, Tanh, SqrtAbs, Log, Exp]
ytransfunctions  = [Id, Exp, Sin]

[Algorithm]
npop      = 1000
ngens     = 500
algorithm = GPTIR
measures  = ["RMSE", "NMSE", "MAE", "R^2"]
task      = Regression
probmut   = 0.8
probcx    = 0.8
seed      = Nothing

[Constraints]
penalty = NoPenalty
shapes  = []
domains = []
evaluator = Nothing

Run the algorithm with the command:

stack run config <conf-file>

where is the path and name of the config file.

As an alternative you can use the python wrapper as illustrated in example.py.

Configuration options

IO

train - path of a comma separated file containing the training data
test - path of a comma separated file containing the test data (use the same as train if you don't want to use a test set)
log - Screen for screen-only results, PartialLog "directory" folder of where to store a partial log of the final results, FullLog "directory" folder where to store a full log of the whole evolutionary process.

Mutation

krange - tuple of integers of the minimum and maximum exponents of the interaction terms.
transfunctions - list of transformation functions. Available functions are Id, Abs, Sin, Cos, Tan, Sinh, Cosh, Tanh, ASin, ACos, ATan, ASinh, ACosh, ATanh, Sqrt, Square, Log, Exp.
ytransfunctions - list of invertible transformation function. Available functions are Id, Sin, Cos, Tan, Tanh, ASin, ACos, ATan, ATanh, Sqrt, Square, Log, Exp.

Algorithm

npop - population size
ngens - number of generations
algorithm - algorithm: GPTIR for genetic programming TIR or SCTIR for shape-constrained genetic programming TIR.
measures - list of performance measures to calculate: "RMSE", "NMSE", "MAE", "R^2". The first measure is the fitness function.
task - Regression / Classification (currently unsupported)
probmut - mutation probability
probcx - crossover probability
seed - random seed to be used: Nothing for default seed (i.e., current time), Just 42 for the random seed 42.

Constraints

penalty - the penalty term to be added to the fitness function: NoPenalty, Len 0.01 (0.01 times the number of nodes of the expression), Shape 0.01 (0.01 times the number of shape-constraint violations).
shapes - list of shape-constraints (refer to https://github.com/folivetti/shape-constraint).
domains - list of tuples with the minimum and maximum values of each variable.
evaluator - constraint evaluator: Nothing, Just InnerInterval, Just OuterInterval, Just (Sampling 100) (evaluate with 100 samples), Just Hybrid, Just Bisection (refer to https://github.com/folivetti/shape-constraint).

Cite

Fabrício Olivetti de França. 2022. Transformation-interaction-rational representation for symbolic regression. In Proceedings of the Genetic and Evolutionary Computation Conference (GECCO '22). Association for Computing Machinery, New York, NY, USA, 920–928. https://doi.org/10.1145/3512290.3528695

Bibtex:

@inproceedings{10.1145/3512290.3528695,
    author = {de Fran\c{c}a, Fabr\'{\i}cio Olivetti},
    title = {Transformation-Interaction-Rational Representation for Symbolic Regression},
    year = {2022},
    isbn = {9781450392372},
    publisher = {Association for Computing Machinery},
    address = {New York, NY, USA},
    url = {https://doi.org/10.1145/3512290.3528695},
    doi = {10.1145/3512290.3528695},
    booktitle = {Proceedings of the Genetic and Evolutionary Computation Conference},
    pages = {920–928},
    numpages = {9},
    keywords = {genetic programming, regression, symbolic regression},
    location = {Boston, Massachusetts},
    series = {GECCO '22}
    }

Experiments Results

Notice that the results in this repository are not the same as those in the referenced paper due to constant improvements to the source code (that sometimes fails). The original results for tir and every other algorithm in the paper are available at https://github.com/folivetti/tir/releases/tag/0.1

Contact

Maintained by Fabrício Olivetti de França (folivetti at ufabc.edu.br)

Acknowledgments

This project is supported by Fundação de Amparo à Pesquisa do Estado de São Paulo (FAPESP), grant number 2018/14173-8.

License

GNU GPLv3, see LICENSE

Name		Name	Last commit message	Last commit date
Latest commit History 81 Commits
app		app
bin		bin
diversity		diversity
docs		docs
log		log
papers		papers
python		python
srbench		srbench
src/MachineLearning		src/MachineLearning
test		test
.gitignore		.gitignore
ChangeLog.md		ChangeLog.md
LICENSE		LICENSE
README.md		README.md
Setup.hs		Setup.hs
cabal.project		cabal.project
dbg		dbg
example.py		example.py
flake.bkp		flake.bkp
flake.lock		flake.lock
flake.nix		flake.nix
genDocs.sh		genDocs.sh
install_cabal.sh		install_cabal.sh
install_ghcup.sh		install_ghcup.sh
install_nix.sh		install_nix.sh
install_stack.sh		install_stack.sh
nixbuild.sh		nixbuild.sh
optuna_example.py		optuna_example.py
package.yaml		package.yaml
plotPareto.py		plotPareto.py
stack.yaml		stack.yaml
stack.yaml.lock		stack.yaml.lock
tir.cabal		tir.cabal
tmp.cfg		tmp.cfg

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

TIR: Transformation-Interaction-Rational Symbolic Regression (v1.0)

Dependencies

Installation

Using Haskell Stack

Using Cabal

Using Nix flake

Running

Configuration options

IO

Mutation

Algorithm

Constraints

Cite

Experiments Results

Contact

Acknowledgments

License

About

Releases 4

Packages

Languages

License

folivetti/tir

Folders and files

Latest commit

History

Repository files navigation

TIR: Transformation-Interaction-Rational Symbolic Regression (v1.0)

Dependencies

Installation

Using Haskell Stack

Using Cabal

Using Nix flake

Running

Configuration options

IO

Mutation

Algorithm

Constraints

Cite

Experiments Results

Contact

Acknowledgments

License

About

Resources

License

Stars

Watchers

Forks

Releases 4

Packages 0

Languages

Packages