Support/aws bedrock #120

MartBakler · 2023-12-12T17:07:13Z

Implementing support for Bedrock and LLama models through bedrock as the teacher models
Main changes

Create a function config class to track function configurations (previously was a dict)
Create a model config object to describe models with different executions (openai vs bedrock, different model names, different prompting options)
Create a Bedrock and LLamaBedrock API to execute bedrock modelling
Add the option for users to configure teacher models for patched functions

…nd added claude2 support, wip

…tion parameters, possibility to change teacher model

JackHopkins · 2024-01-15T11:30:16Z

src/tanuki/language_models/Llama_bedrock_api.py

+LLM_GENERATION_PARAMETERS = ["temperature", "top_p", "max_new_tokens"]
+
+
+class LLama_Bedrock_API(Bedrock_API):


Strange capitalization.

JackHopkins · 2024-01-15T11:36:41Z

src/tanuki/language_models/llm_configs/default_models.py

+from tanuki.language_models.llm_configs.openai_config import OpenAIConfig
+from tanuki.language_models.llm_configs.claude_config import ClaudeConfig
+from tanuki.language_models.llm_configs.llama_config import LlamaBedrockConfig
+DEFAULT_MODELS = {


So I suggest this goes to init.py

JackHopkins · 2024-01-15T11:38:10Z

src/tanuki/language_models/llm_configs/model_config_factory.py

+            return input_config
+        if isinstance(input_config, str):
+            # This is purely for backwards compatibility as we used to save the model as a string
+            if type == "distillation":


"distillation" -> into a DISTILLATION constant.

JackHopkins · 2024-01-15T11:44:07Z

src/tanuki/function_modeler.py


    def _get_dataset_info(self, dataset_type, func_hash, type="length"):
        """
        Get the dataset size for a function hash
        """
        return self.data_worker.load_dataset(dataset_type, func_hash, return_type=type)
+
+    def _configure_teacher_models(self, teacher_models: list, func_hash: str):


Please include more expressive type definition.

JackHopkins · 2024-01-15T11:51:58Z

src/tanuki/models/api_manager.py

+        """
+        Adds an API provider to the API manager.
+        """
+        if provider =="openai":


Replace with constant.

JackHopkins · 2024-01-15T11:59:08Z

src/tanuki/models/function_config.py

+    """
+    distilled_model: BaseModelConfig = DEFAULT_MODELS["gpt-3.5-finetune"]
+    current_model_stats : Dict = {
+        "trained_on_datapoints": 0,


Constants...

JackHopkins · 2024-01-15T12:03:42Z

src/tanuki/language_models/llm_configs/default_models.py

+            "anthropic.claude-v2:1": ClaudeConfig(model_name = "anthropic.claude-v2:1", context_length = 200000),
+            "llama_70b_chat_aws": LlamaBedrockConfig(model_name = "meta.llama2-70b-chat-v1", context_length = 4096),
+            "llama_13b_chat_aws": LlamaBedrockConfig(model_name = "meta.llama2-13b-chat-v1", context_length = 4096),
+            "ada-002": OpenAIConfig(model_name="text-embedding-ada-002", context_length=-1)


Lets add at least 1 bedrock embedding model.

MartBakler added 16 commits December 4, 2023 21:19

wip commit

e397738

first draft of change of config file

ab62bbd

fixed some bugs to do with retrieving finetuned models

e4f6c14

small bugfix with repair

1a2226e

added possibility to send in custom system and instruction messages a…

9e8691c

…nd added claude2 support, wip

added support for LLama 70B models, possibility to send in LLM genera…

4886a7f

…tion parameters, possibility to change teacher model

added llm generation parameter support

7aa4d6d

fixed somes tests

5fe5783

added tests and fixed some bugs

ec99d53

naming changes

db851d0

updated embeddings to use the new configs

e76fc17

small bugfix to embedding example

91027ff

added new additions to repair loop

d7e546b

removed testing code

4cb4269

updated prompt

7201aa8

updated docs

ab4f4d0

MartBakler marked this pull request as ready for review December 15, 2023 15:00

MartBakler requested a review from JackHopkins as a code owner December 15, 2023 15:00

MartBakler added 8 commits December 19, 2023 13:30

updated docs

6adf20d

added conditional imports for bedrock apu

ab5e667

updated setup.py with conditional importing

5d9893d

updated error message

fadc070

renamed init

f8c49b8

small bugfix to api import error handling

cbbf52a

updated docs

d670f51

updated docs

4f2fe46

JackHopkins requested changes Jan 15, 2024

View reviewed changes

MartBakler added 3 commits January 15, 2024 15:37

Added titan embedding example

bcd8f15

moved strings to constants and updated location of default models

fcbbfd6

updated teacher configuration with function type

9efef1d

MartBakler and others added 7 commits January 15, 2024 16:23

updated bedrock api

4b84db1

updated saving and how teachers are tracked

8bc9123

updated log

44cf52c

updated docs

4ea8a8b

updated tests

85d7b83

pulled instructions out of language manager

93d487e

Rename Llama_bedrock_api.py to llama_bedrock_api.py

604a445

JackHopkins approved these changes Jan 16, 2024

View reviewed changes

MartBakler merged commit ca36726 into master Jan 16, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support/aws bedrock #120

Support/aws bedrock #120

MartBakler commented Dec 12, 2023 •

edited

JackHopkins Jan 15, 2024

JackHopkins Jan 15, 2024

JackHopkins Jan 15, 2024

JackHopkins Jan 15, 2024

JackHopkins Jan 15, 2024

JackHopkins Jan 15, 2024

JackHopkins Jan 15, 2024

		LLM_GENERATION_PARAMETERS = ["temperature", "top_p", "max_new_tokens"]


		class LLama_Bedrock_API(Bedrock_API):

Support/aws bedrock #120

Support/aws bedrock #120

Conversation

MartBakler commented Dec 12, 2023 • edited

JackHopkins Jan 15, 2024

Choose a reason for hiding this comment

JackHopkins Jan 15, 2024

Choose a reason for hiding this comment

JackHopkins Jan 15, 2024

Choose a reason for hiding this comment

JackHopkins Jan 15, 2024

Choose a reason for hiding this comment

JackHopkins Jan 15, 2024

Choose a reason for hiding this comment

JackHopkins Jan 15, 2024

Choose a reason for hiding this comment

JackHopkins Jan 15, 2024

Choose a reason for hiding this comment

MartBakler commented Dec 12, 2023 •

edited