WiP: Communication protocol by pmitros · Pull Request #77 · ArgLab/writing_observer

pmitros · 2023-05-16T13:41:25Z

This is a format for how we define query, execution trees, and request data. It's a work-in-progress. Do not merge.

…s page

…lect

…king on keys

pmitros · 2023-06-05T14:17:46Z

learning_observer/learning_observer/communication_protocol/executor.py

+            'inputs': self.inputs,
+            'context': self.context,
+            'timestamp': datetime.datetime.utcnow().isoformat(),
+            'traceback': ''.join(traceback.format_tb(self.__traceback__))


We should add a comment on data format:

Eventually, this should be a serialized traceback of some kind, rather than a string, with formatting happening in the debug interface.

pmitros · 2023-06-27T12:58:06Z

learning_observer/learning_observer/communication_protocol/executor.py

+    response = []
+    for k in keys:
+        if isinstance(k, dict) and 'key' in k:
+            item = {


query_response_element or something?

I might also include some kind of comment that the value will be added below. I tend to do this not in English but e.g. in commented pseudocode. E.g.

'value': [To be added after query]

Or:
'value': None # Populated after query

But it's nice to give a sense of the full data structure.

Could also be in a docstring :)

pmitros · 2023-06-27T13:01:06Z

learning_observer/learning_observer/communication_protocol/executor.py

+            )
+        kvs_out = await KVS[k['key']]
+        if kvs_out is None:
+            kvs_out = k['default']


A short comment explaining why this is here would be helpful. E.g.
# We haven't run the reducer for this key yet, so we return the default value from the module

pmitros · 2023-06-27T13:02:01Z

learning_observer/learning_observer/communication_protocol/executor.py

+    :return: The generated keys
+    :rtype: list
+    """
+    pass


Shouldn't this raise an UnimplementedException instead of a pass?

pmitros · 2023-06-27T13:02:43Z

learning_observer/learning_observer/communication_protocol/executor.py

+
+
+@handler(learning_observer.communication_protocol.query.DISPATCH_MODES.KEYS)
+def hack_handle_keys(function, STUDENTS=None, STUDENTS_path=None, RESOURCES=None, RESOURCES_path=None):


Needs a docstring explaining the hack, why we have it, and how we plan to eventually fix it.

pmitros · 2023-06-27T13:04:36Z

learning_observer/learning_observer/communication_protocol/executor.py

+
+def _has_error(node):
+    '''
+    Non-recursive function to find and return 'error' value and its path from any dictionary within the node.


I don't know:

Why you need this and when it's called

What an 'error' value is

What a node is

This sort of mile-high context is important.

pmitros · 2023-06-27T13:05:48Z

learning_observer/learning_observer/communication_protocol/executor.py

+    while queue:
+        current, path = queue.pop(0)
+        if 'error' in current:
+            return current, path


Note this will only return one error, even if there are multiple. Of course, I have no idea right now what kinds of errors this is looking for until I understand the context in the codebase.

pmitros · 2023-06-27T13:09:04Z

learning_observer/learning_observer/communication_protocol/executor.py

+                for idx, i in enumerate(current[c]):
+                    if isinstance(i, dict):
+                        queue.append((i, path + [c, idx]))
+    return None, []


Depth-first search will be exponentially slow (in big O). This will only matter if you have a complex, interconnected DAG, but keeping some log of visited nodes would prevent this.

This could be fixed or (since we have simple DAGs now) left as a TODO, but it should be clearly documented (with appropriate keywords if someone is looking for performance issues).

pmitros · 2023-06-27T13:14:36Z

learning_observer/learning_observer/communication_protocol/executor.py

+
+def _sanitaize_output(variable):
+    '''
+    Sanitizes output by removing specified keys from each level of a dictionary or a list of dictionaries.


sanitize <-- spelling

Shouldn't KEYS_TO_REMOVE be a parameter (perhaps with a default?). Why is this considered a sanitization? Would we be leaking sensitive data here, or are we just cleaning unnecessary data?

In the docstring, it's not so much necessary to explain what this does as why we're doing it.

If all we're using this for, the name might be strip_context or similar.

pmitros · 2023-06-27T13:15:53Z

learning_observer/learning_observer/communication_protocol/executor.py

+
+async def execute_dag(endpoint, parameters, functions, target_exports=None):
+    """
+    Execute a flattened directed acyclic graph (DAG).


Explain "flattened" here. Or move the flatten function inside so it doesn't need to be flattened. This should clearly document what can and cannot go in.

pmitros · 2023-06-27T13:17:07Z

learning_observer/learning_observer/communication_protocol/executor.py

+    :param functions: The functions available for execution
+    :type functions: dict
+    :return: The result of the execution
+    :rtype: dict


I have no idea from the comment what any of these are, or their format. Either:

Add a doctest (perhaps too complex here); or

Clearly point to the example we built before, so people know where to figure this out.

I want to know the format of all the parameters to all of the functions in enough detail to be able to make use of them. As a new developer, I don't want to need to add print() statements or search the entire codebase to understand what goes in and what comes out.

pmitros · 2023-06-27T13:22:17Z

learning_observer/learning_observer/communication_protocol/executor.py

+    :rtype: dict
+    """
+    if target_exports is None:
+        target_exports = []


I have no idea what this is from the code. I also don't know why we'd call this with no exports. Should this parameter really have a default of None? That seems like a crazy default.

pmitros · 2023-06-27T13:22:31Z

learning_observer/learning_observer/communication_protocol/executor.py

+    if KVS is None:
+        KVS = learning_observer.kvs.KVS()
+
+    async def dispatch_node(node):


pmitros · 2023-06-27T13:23:13Z

learning_observer/learning_observer/communication_protocol/executor.py

+
+    async def walk_dict(node_dict):
+        '''
+        This will walk a dictionary, and call `visit` on all variables, and make the requisite substitions


Expand docstring (What is a substitution? What does it mean to "walk a dictionary"? What does "visit" do?)

pmitros · 2023-06-27T13:23:48Z

learning_observer/learning_observer/communication_protocol/executor.py

+            elif isinstance(child_value, dict):
+                await walk_dict(child_value)
+
+    async def visit(node_name):


pmitros · 2023-06-27T13:24:17Z

learning_observer/learning_observer/communication_protocol/executor.py

+            nodes[node_name] = await dispatch_node(nodes[node_name])
+
+        # import json
+        # print('*****', node_name, json.dumps(nodes[node_name], indent=2, default=str))  # useful but produces a lot


Don't include commented-out code

pmitros · 2023-06-27T13:25:59Z

learning_observer/learning_observer/communication_protocol/executor.py

+        visited.add(node_name)
+        return nodes[node_name]
+
+    # return everything in dev mode


'everything' ==> something explaining that this refers to execution paths / providence. No one will know what a context is.

pmitros · 2023-06-27T13:26:41Z

learning_observer/learning_observer/communication_protocol/executor.py

+    if learning_observer.settings.RUN_MODE == learning_observer.settings.RUN_MODES.DEV:
+        return {e: await visit(e) for e in target_nodes}
+
+    # otherwise remove context from outputs


# Remove execution history if in deployed settings, with data flowing back to teacher dashboards

bradley-erickson added 17 commits May 12, 2023 17:23

initial commit for communication protocol

b70075f

changed dispatch_modes to class instead of enum

caff5da

cleaning up prior enum stuff

e8e0976

adjusting keys

72d4f35

updated function names and cleaned up code a bit

6d6b915

added in async code where necessary

eebcdc1

renamed request to query

7442099

removed create_ prefix from query

72df1ff

added named_query section to modules and displays them in admin statu…

74d88b6

…s page

added named queries as endpoints to each module

262045e

created query view platform

8b02021

working example with ws

d560861

added debugger page and implemented part of the KVS connection for se…

0730fce

…lect

added to lo.querie namespace and added function gatherer decorator

a580a08

changed named and added decorated for populating functions, still wor…

e0cd247

…king on keys

able to see text from google doc now

cf03379

start of error handling

4c346e3

pmitros commented Jun 5, 2023

View reviewed changes

pmitros and others added 12 commits June 5, 2023 10:48

Error messages and comments

1d24ffa

Factored out test code

6157ee2

Start of test infrastructure

c24e630

linted code

f209509

cleaned up test code KVS mess

1392697

broke execution file down further

b29c6d2

add more documentation regarding integration methods

659ab89

addressed duplicate function names

d7898eb

cleaned up test case code more

feecfc7

implemented first pass at exception handling with parameter

338b8e0

changed callable_function to publish_function

3acaafd

updated _remove_context to remove a list of keys instead

64ba429

pmitros commented Jun 27, 2023

View reviewed changes

bradley-erickson added 10 commits June 27, 2023 16:52

initial overhaul of docstrings

b5bd987

updated iscoroutinefunction to isawaitable where appropriate

5029712

added more documentation

279f0d5

renamed context to providence

5df5719

added more documentation to exception and query

c4789ff

more documentation

a2f0915

more doctests and added exception wrapper back in, got missed somewhere

4c4d12c

fix lint errors

e16c17e

Merge branch 'master' into communication-protocol

b78f614

more documentation of other files plus some renaming

8c63a07

bradley-erickson merged commit 3f85805 into master Jul 6, 2023

bradley-erickson linked an issue Jul 10, 2023 that may be closed by this pull request

Determine communication stream #52

Closed

bradley-erickson deleted the communication-protocol branch March 10, 2026 12:37



		@handler(learning_observer.communication_protocol.query.DISPATCH_MODES.KEYS)
		def hack_handle_keys(function, STUDENTS=None, STUDENTS_path=None, RESOURCES=None, RESOURCES_path=None):

Conversation

pmitros commented May 16, 2023

Uh oh!

pmitros Jun 5, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

'value': [To be added after query]

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

pmitros Jun 5, 2023 •

edited

Loading