Message tree state machine #555

andreaskoepf · 2023-01-08T21:17:46Z

Message trees are created with an initial prompt. After checking the initial prompt quality via peer-review a tree enters the growing phase in which concurrently prompter/assistant replies and labeling tasks are handed out. When the desired number of messages surpassing the minimum acceptable quality has been collected the tree goes into ranking phase. In ranking phase replies of messages with more than one reply are presented to humans to be ordered from best to worst. When enough ranking results have been collected for all messages the tree enters the ready for scoring phase in which the scoring algorithm combines all user feedback and computes the ranking-scores of all children in the message tree. After the scoring algorithm ran the message tree is completed and enters the ready for export state.

Beside the ready for export additional terminal error-states exist: scoring_failed, aborted_low_grade, halted_by_moderator .. in terminal states no further tasks are handed out to users.

…_LABELING setting)

…tree_state_machine2

yk

looks awesome!

one question I had was if by nature of how we grow trees and when we stop, can it be that there are many children that have no siblings? in that case, those would never be ranked at all. Not something we have to fix now, just to keep in mind.

Also, very dangerous stuff around raw sql statements, makes things more interesting ;) let's just keep this in mind and be super weary of any property changes to the models. Eventually, we might want to re-write this to sqlmodel/sqlalchemy

yk · 2023-01-11T08:31:46Z

backend/oasst_backend/tree_manager.py

+    num_required_rankings: int = 3
+    """Number of rankings in which the message participated."""
+
+    mandatory_labels_initial_prompt: Optional[list[protocol_schema.TextLabel]] = [


I feel we shouldn't ask "helpful" as mandatory, at least not for initial prompt and prompter reply, because it doesn't really apply to their situation. Maybe we also need to re-think the helpful label, maybe calling it "appropriate" or so, meaning that a prompt is really a prompt, an assistant response is of (helpful) assistance, etc.

thinking a bit more, I think "appropriate" as I formulate it here would just be the opposite of "spam". the question is, if we know something is "not spam", in what way is that not enough, to the point where we'd need to collect another quality-gate label?

ok, I removet helpful from all and also simplified the

def _calculate_acceptance(self, labels: list[TextLabels]): # calculate acceptance based on spam label return np.mean([1 - l.labels[protocol_schema.TextLabel.spam] for l in labels])

yk · 2023-01-11T08:34:27Z

backend/oasst_backend/tree_manager.py

+            task_type = TaskType(np.random.choice(a=len(task_weights), p=task_weights))
+
+        logger.debug(f"Selected task type: {task_type}")
+        return TaskType(task_type)


isn't task_type already a TaskType?

yes, I think added the cast to the np.random.choice and forgot to remove it there.

yk · 2023-01-11T08:35:56Z

backend/oasst_backend/tree_manager.py

+
+        num_missing_replies = sum(x.remaining_messages for x in active_tree_sizes)
+
+        task_tpye = self._task_selection(


tpye is my favorite typo ;-)

andreaskoepf · 2023-01-11T09:11:02Z

one question I had was if by nature of how we grow trees and when we stop, can it be that there are many children that have no siblings? in that case, those would never be ranked at all. Not something we have to fix now, just to keep in mind.

The trees are constructed randomly. Especially the deeper nodes are at risk of being alone. It currently depends much on the following configuration settings:

    max_tree_depth: int = 6
    """Maximum depth of message tree."""

    max_children_count: int = 5
    """Maximum number of reply messages per tree node."""

    goal_tree_size: int = 15
    """Total number of messages to gather per tree"""

Parents have a higher chance of being selected: Mainly because they are longer in the tree and participate in more selection processes. New children first have to go through the review-phase before they themselves can become parents.

Also, very dangerous stuff around raw sql statements, makes things more interesting ;) let's just keep this in mind and be super weary of any property changes to the models. Eventually, we might want to re-write this to sqlmodel/sqlalchemy

I see the problem with schema-changes but problems likely surface quickly since most of the queries are run for the first few interactions with the system. Maybe moving the queries into stored-procedures would also be a good option (not sure how the larger queries would look as sql-alchemy code).

fozziethebeat · 2023-01-11T09:43:45Z

Approve regarding the broken web e2e tests. We'll fix that in a separate PR.

andreaskoepf added 2 commits January 8, 2023 22:16

first bits of tree_manager

50f79a3

add query_incomplete_rankings()

fbd6cc4

fozziethebeat added the backend label Jan 9, 2023

andreaskoepf and others added 23 commits January 9, 2023 14:54

Add SQL queries for TreeManager task selection

f283a43

first working version of TreeManager.next_task()

6144451

remove old generate_task(), add mandatory_labels to text_labels task

3e81722

Add ConversationMessage list to Ranking tasks

ab5e7de

add more sophisticated sql queries to find extendible trees

b1971fe

add TreeManager.query_extendible_parents()

0bfc166

fix task validation, seed data insertion (reviewed)

590573c

provide user for task selection in text-frontend

c256548

enter 'growing' state

69a0983

enter 'aborted_low_grade' state

4013990

enter 'ranking' state

6d9efa7

check tree 'growing' state upon relpy insertion

c18f87b

exclude user from labeling their own messages (added DEBUG_ALLOW_SELF…

8d32ac4

…_LABELING setting)

add DEBUG_ALLOW_SELF_LABELING to docker-compose.yaml

2c6e7d2

fix ranking submission

7954205

add query_tree_ranking_results()

b4c241d

add ranked_message_ids to RankingReactionPayload

0fc1d30

fix reply_messages instead of prompt_messages

8df05c0

incorment 'ranking_count' of ranked replies

bfb950d

Merge remote-tracking branch 'upstream/main' into 531_inital_message_…

a7b25a5

…tree_state_machine2

added logic to check_condition_for_scoring_state

1da41ac

changes to msg_tree_state_machine

68fc4e5

pre-commit changes

6577aa6

danielpatrickhug mentioned this pull request Jan 11, 2023

patch to Msg tree state machine #611

Closed

enter 'ready_for_scoring' state

4d01f7e

andreaskoepf marked this pull request as ready for review January 11, 2023 07:08

andreaskoepf requested a review from yk as a code owner January 11, 2023 07:08

andreaskoepf added 2 commits January 11, 2023 08:38

re-add HF embedding call (lost during merge)

702f98d

use prepare_conversation() helper for seed-data creation

ee066d6

yk approved these changes Jan 11, 2023

View reviewed changes

Partially add user specified task selection

4d747f9

andreaskoepf merged commit 14fa08e into main Jan 11, 2023

andreaskoepf deleted the 531_inital_message_tree_state_machine2 branch January 11, 2023 09:54

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Message tree state machine #555

Message tree state machine #555

andreaskoepf commented Jan 8, 2023

yk left a comment

yk Jan 11, 2023

yk Jan 11, 2023

andreaskoepf Jan 11, 2023

yk Jan 11, 2023

andreaskoepf Jan 11, 2023

yk Jan 11, 2023

andreaskoepf Jan 11, 2023

andreaskoepf commented Jan 11, 2023

fozziethebeat commented Jan 11, 2023


		num_missing_replies = sum(x.remaining_messages for x in active_tree_sizes)

		task_tpye = self._task_selection(

Message tree state machine #555

Message tree state machine #555

Conversation

andreaskoepf commented Jan 8, 2023

yk left a comment

Choose a reason for hiding this comment

yk Jan 11, 2023

Choose a reason for hiding this comment

yk Jan 11, 2023

Choose a reason for hiding this comment

andreaskoepf Jan 11, 2023

Choose a reason for hiding this comment

yk Jan 11, 2023

Choose a reason for hiding this comment

andreaskoepf Jan 11, 2023

Choose a reason for hiding this comment

yk Jan 11, 2023

Choose a reason for hiding this comment

andreaskoepf Jan 11, 2023

Choose a reason for hiding this comment

andreaskoepf commented Jan 11, 2023

fozziethebeat commented Jan 11, 2023