SN1-13: Organic Scoring implementation #268

dbobrenko · 2024-06-14T11:56:34Z

Organic Scoring Implementation

Changes

This implementation is based on the Generic Organic Scoring framework introduced here.
Organic scoring runs in a separate asyncio task alongside current benchmarking tasks.
Organic queries are received via an open validator axon and stored in the organic queue.
For each organic or synthetic query, a reference answer is generated by the LLM.
Rewards and penalties are calculated based on the relevance metric for both organic and synthetic queries, which is defined as the cosine similarity between sentence embeddings of the reference and completions.
Currently, LMSys-chat-1m is used for synthetic queries. (TODO: Change to generated conversations by LLM and modified existing organic queries by LLM or query synth data via API)
Logging includes elapsed time between steps inside the organic loop, organic queue length, and other default logs used by benchmarking tasks, except prompts and completions, which are excluded from logging into W&B.
Validator queries 5 random miners from the network to stream back completions for organic queries (defined in config as neuron.organic_sample_size).
Reward step for organic or synthetic queue is triggered every 15 seconds and scaled down to 2 seconds if the organic queue is growing (defined in config as neuron.organic_trigger, neuron.organic_trigger_frequency, and neuron.organic_trigger_frequency_min).

Process Workflow

Trigger Check: Upon triggering the rewarding process, the system checks if the organic queue is empty.
If the queue is empty, synthetic datasets (defined in organic_scoring/synth_dataset_base.py) are used to bootstrap
the organic scoring mechanism. Otherwise, samples from the organic queue are utilized.
Data Processing: The sampled data is concurrently passed to the _query_miners and _generate_reference
methods.
Reward Generation: After receiving responses from miners and any reference data, the information
is processed by the _generate_rewards method.
Weight Setting: The generated rewards are then applied through the _set_weights method.
Logging: Finally, the results can be logged using the _log_results method, along with all relevant data
provided as arguments, and default time elapsed on each step of rewarding process.

Hollyqui

LGTM, do check comments

prompting/agent.py

prompting/llms/vllm_llm.py

prompting/base/validator.py

prompting/rewards/pipeline.py

dbobrenko added 2 commits June 14, 2024 11:51

Add draft organic dataset, task, validator axon

246a482

Merge branch 'main' into feature/organic-task

b158b90

dbobrenko self-assigned this Jun 14, 2024

dbobrenko added 4 commits June 19, 2024 07:01

[WIP] Add architecture, rewards, task, dataset etc

8786155

Update draft notebook

f36ed99

Add minor organic changes

02d2de2

Merge with main branch

6db5919

dbobrenko changed the base branch from feature/organic to staging June 19, 2024 14:51

dbobrenko added 9 commits June 19, 2024 20:54

Add WIP miners response

d9927d3

Finish end-to-end organic communication

785afb6

Remove commented code

b4f29fe

WIP Organic module refactor

d565732

WIP-2 Organic thread refactor

7782d5d

WIP concurrent streams

ec51132

[WIP] Fix issue with dendride streaming

1b6c16b

WIP

7d26c26

SN1-109: Refactor organics to base organics framework

5c5078e

dbobrenko changed the title ~~[WIP] Validator axon, organic task, dataset~~ [WIP] Organic Scoring implementation Jul 18, 2024

dbobrenko changed the title ~~[WIP] Organic Scoring implementation~~ Organic Scoring implementation Jul 18, 2024

dbobrenko changed the base branch from staging to pre-staging July 18, 2024 11:17

dbobrenko changed the base branch from pre-staging to staging July 18, 2024 11:17

dbobrenko changed the base branch from staging to pre-staging July 18, 2024 11:37

dbobrenko changed the base branch from pre-staging to staging July 18, 2024 11:38

dbobrenko added 4 commits July 18, 2024 11:41

SN1-131: Clean up the code

ba5641f

Add organic sampling method to config

262e5c5

Merge with staging

66c5655

Add axon disabled warning

fcb653d

dbobrenko requested review from Hollyqui, bkb2135 and steffencruz July 18, 2024 12:14

dbobrenko added 18 commits July 23, 2024 00:29

Remove unused import

f8f7deb

Move axon serve after organic init

d8203f0

Move axon serve after organic init

702ef8f

Move run to async function, apply lock to LLM

fdf4749

Revert debugging timeout to 15 sec

8116e36

Revert debugging blacklist key

caeaa81

Add try except blocks for organic scoring

0a9f657

Merge with debug csv branch

3b76700

Remove unused import

152f371

Move organic to main loop

358dbee

Remove debugging blacklist hotkey

41746dd

Rvert asyncio loop

204f3e4

Clean up the code

3d1342a

Reduce rewards for synth

6988799

Merge branch 'feature/organic-csv-debug' into feature/organic-task

e14d56f

Small fixes

64d2ae3

Remove weights scale

8bc004e

Add LLM locks

aa456f6

Hollyqui approved these changes Jul 24, 2024

View reviewed changes

prompting/agent.py Show resolved Hide resolved

prompting/llms/vllm_llm.py Show resolved Hide resolved

bkb2135 reviewed Jul 24, 2024

View reviewed changes

prompting/base/validator.py Outdated Show resolved Hide resolved

dbobrenko added 5 commits July 24, 2024 18:57

Make synth dataset optional

0799aa1

Address comments

ef8f820

Fix unavailable synth dataset

fdf8a38

Update README for validators

77caafa

Add rouge, reduce penalty

22d3ae1

bkb2135 reviewed Jul 25, 2024

View reviewed changes

prompting/rewards/pipeline.py Outdated Show resolved Hide resolved

bkb2135 reviewed Jul 25, 2024

View reviewed changes

prompting/rewards/pipeline.py Outdated Show resolved Hide resolved

Address Kalei's comments

0629b29

dbobrenko merged commit 91af582 into staging Jul 25, 2024

dbobrenko deleted the feature/organic-task branch August 2, 2024 09:22

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

SN1-13: Organic Scoring implementation #268

SN1-13: Organic Scoring implementation #268

Uh oh!

dbobrenko commented Jun 14, 2024 •

edited

Loading

Uh oh!

Hollyqui left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

SN1-13: Organic Scoring implementation #268

SN1-13: Organic Scoring implementation #268

Uh oh!

Conversation

dbobrenko commented Jun 14, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Organic Scoring Implementation

Changes

Process Workflow

Uh oh!

Hollyqui left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

dbobrenko commented Jun 14, 2024 •

edited

Loading