feat: Getting up to date with main #34

jpitoskas · 2024-06-14T10:34:02Z

Need to use the One hot encoder

This commit adds the following implementations to the models_torch subpackage: - Added __init__.py for the subpackage - Implemented GraphAttentionNetwork in graph_attention_network.py - Implemented GraphConvolutionalNetwork in graph_convolutional_network.py - Implemented GraphSAGENetwork in graph_sage_network.py - Implemented GraphTransformerNetwork in graph_transformer_network.py These implementations provide configurable nn architectural support for training graph-based models using PyTorch.

Create a new directory named jaqpotpy_torch/ to organize all torch-related code. We'll decide later whether to keep this code here or move it to an entirely new standalone torch-specific package.

This commit initializes the featurizers_torch subpackage, adding the following implementations: - Added __init__.py for the subpackage - Implemented SmilesGraphFeaturizer in smiles_graph_featurizer.py. The SmilesGraphFeaturizer class is designed to create custom graph featurizations from SMILES strings. It offers highly configurable options, allowing users to choose from a wide range of both atom and bond characteristics to be included.

This commit initializes the datasets_torch subpackage, adding the following implementations: - Added __init__.py for the subpackage - Implemented SmilesGraphDataset in smiles_graph_dataset.py. The SmilesGraphDataset class is designed to create a custom torch Dataset for graph-featurized SMILES. Its __getitem__ method is overridden to return a torch_geometric Data object enacpsulating the following information: - Node attributes (x) - Edge indices (edge_index) - Edge attributes (edge_attr) - Target labels (y) - The original SMILES representation (smiles) SmilesGraphDataset enables straightforward integration into torch-based ML pipelines, facilitating the development of graph-based predictive models.

This commit removes the _torch suffix directory names within the jaqpotpy_torch module: - Renamed datasets_torch directory to datasets - Renamed featurizers_torch directory to featurizers - Renamed models_torch directory to models

This commit initializes the trainers subpackage, adding the following implementations: - Added __init__.py for the subpackage - Implemented an initial version of TorchModelTrainer abstract class in torch_model_trainer.py.

This commit initializes the trainers subpackage, adding the following implementations: - Added BinaryGraphModelTrainer, RegressionGraphModelTrainer in __init__.py - Extended TorchModelTrainer class with additional attributes.

This commit adds the following implementations to the trainers subpackage: - BinaryGraphModelTrainer subclass - RegressionGraphModelTrainer subclass

This commit adds the SmilesGraphDatasetWithExternal class implementation to the datasets subpackage. This class inherits from SmilesGraphDataset, and adds the functionality of providing an external feature vector along with the smiles representation.

This commit adds the implementation of the Featurizer abstract class. Also the abstract method featurize() is defined.

This commit adds the FullyConnectedNetwork class implementation to the models.

This commit adds the following implementations to the models subpackage: - GraphAttentionNetworkWithExternal - GraphConvolutionalNetworkWithExternal - GraphSAGENetworkWithExternal - GraphTransformerNetworkWithExternal In these models the corresponding graph neural network is employed to produce global level representations from smiles. Then these are concatenated with the external feature vectors and the concatenated vector is passed through a fully connected network to produce the final output.

- Fixed a circular import error of the FullyConnectedNetwork class - Added super().__init__() to all the models supporting external features

This commit adds the implementation of the deploy_model for both RegressionGraphModelTrainer and BinaryGraphModelTrainer. deploy_model() is an abstract method of the TorchModelTrainer base class, and must be implemented in every class that inherits from TorchModelTrainer, to support model deployment on Jaqpot.

In this commit we: - Implement the deployment logic for models and trainers that use external features - Set deploy_model function to be on the TorchModelTrainer class - Define the abstract method prepare_for_deployment() with a dynamic set of arguments per trainer subclass which transforms the data into the appropriate JSON - Add 'SMILES' in a protected namespace so that external features can't be named like this - Fix bugs regarding model input arguments

…graph-training

…com/ntua-unit-of-control-and-informatics/jaqpotpy into feat/JAQPOT-62/torch-graph-training

This commit provides: - Ready for deployment torch models are implemented - Everything up to date with the current structure of the API

This commit implements: - TabularDataset class inheriting from torch.utils.data.Dataset - BinaryFCModelTrainer & RegressionFCModelTrainer - The required changed in the BinaryModelTrainer and RegressionModelTrainer abstract classes to support data from torch.utils.data.DataLoader as well

In this commit we: - Fix default value of heads argument - Remove unnecessary 'Optional' types

In this commit we: - Add zero_division=0 to f1 - Add labels to confusion matrix function

This commit: - Adds example code blocks in SmilesGraphFeaturizer - Sets '.' instead of '_' for the separator when showing atom/bond characteristics vector labels - Add __call__ to to call abstract featurize() method in abstract class Featurizer

jpitoskas and others added 30 commits May 15, 2024 15:30

feat: Move torch-related code in jaqpotpy_torch/

69d0c8f

Create a new directory named jaqpotpy_torch/ to organize all torch-related code. We'll decide later whether to keep this code here or move it to an entirely new standalone torch-specific package.

refactor: Rename subdirectories in jaqpotpy_torch

35ed377

This commit removes the _torch suffix directory names within the jaqpotpy_torch module: - Renamed datasets_torch directory to datasets - Renamed featurizers_torch directory to featurizers - Renamed models_torch directory to models

feat: Add trainers package for torch models

05a8791

This commit initializes the trainers subpackage, adding the following implementations: - Added __init__.py for the subpackage - Implemented an initial version of TorchModelTrainer abstract class in torch_model_trainer.py.

feat: Extended trainers package for torch models

6fd3be3

This commit initializes the trainers subpackage, adding the following implementations: - Added BinaryGraphModelTrainer, RegressionGraphModelTrainer in __init__.py - Extended TorchModelTrainer class with additional attributes.

feat: Implement Binary & Regression Graph Trainers

6aa65ba

This commit adds the following implementations to the trainers subpackage: - BinaryGraphModelTrainer subclass - RegressionGraphModelTrainer subclass

refactor: Add abstract class Featurizer

9bbd19e

This commit adds the implementation of the Featurizer abstract class. Also the abstract method featurize() is defined.

feat: Implement Fully Connected Network model

b26555c

This commit adds the FullyConnectedNetwork class implementation to the models.

fix: Circular import error and add super()

59895d8

- Fixed a circular import error of the FullyConnectedNetwork class - Added super().__init__() to all the models supporting external features

fix: Change "mu" to "mean" for consistency

611430c

fix: Fix log_filepath var name & rm unused libs

686106e

Merge remote-tracking branch 'origin/main' into feat/JAQPOT-62/torch-…

6b683aa

…graph-training

Merge branch 'main' into feat/JAQPOT-62/torch-graph-training

c3227d2

Merge branch 'feat/JAQPOT-62/torch-graph-training' of https://github.…

9d5a50b

…com/ntua-unit-of-control-and-informatics/jaqpotpy into feat/JAQPOT-62/torch-graph-training

refactor: Change dir structure of trainers

dedceae

feat: Fully functional torch model upload

e863a12

This commit provides: - Ready for deployment torch models are implemented - Everything up to date with the current structure of the API

feat: Method to get all installed packages in env

26fb527

feat: Add bond featurs as edge_attr

927d84d

feat: Make categorical values as str

c61006c

feat: Implement Multiclass Trainer

7eee23f

fix: Model type of multiclass fc model trainer

5dbe355

fix: multiclass_fc_model_trainer.py rename typo

0e68b16

fix: Add zero_division in both precision & recall

f026247

jpitoskas added 15 commits June 20, 2024 01:26

fix: Fix args types of all networks/models

4c7725f

In this commit we: - Fix default value of heads argument - Remove unnecessary 'Optional' types

fix: Confusion Matrix return as matrix not vector

31cfe29

fix: Two minor fixes in metrics

8c5333f

In this commit we: - Add zero_division=0 to f1 - Add labels to confusion matrix function

feat: Remove labels from conf_mat of binary

5600dea

refactor: Add headers to all branch files

9177278

refactor: Add documentation to models

374e11e

refactor: Fix in docs of models

e23581a

refactor: Add documentation to trainers

250575a

feat: Add scheduler to trainer

d92c875

refactor: Add docstring to smiles graph featurizer

5cb1b00

fix: Forgot scheduler to regression_model_trainer

1927cfb

fix: Edge dim to transformer graph network

2482033

refactor: Add docstrings to Datasets

f5762ce

refactor: Refactor docs for featurizer and dataset

4d04f16

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: Getting up to date with main #34

feat: Getting up to date with main #34

jpitoskas commented Jun 14, 2024

feat: Getting up to date with main #34

Are you sure you want to change the base?

feat: Getting up to date with main #34

Conversation

jpitoskas commented Jun 14, 2024