Create initial release #1

adamcharnock · 2017-08-03T12:09:29Z

Goals

Ease of development & debugging
Excellent tooling & documentation
Targeting smaller teams
High speed & low latency (but not at the expense of other goals)

MVP

Required prior to use in production

Ideas for future work / notes:

Ability to bypass bus overheads for local RPC calls (suggestion).
Rename Meta to Options as this is more descriptive and doesn't clash with the 'metaclass' concept. Downside is that this diverges from Django.
Store the last result result of each RPC call and event firing (for both success and failure). Need some kind of stats backend?
Examine existing ways of defining API schemas: JSON schema (more generic), SWAGGER/OpenAPI (Rest-specific)
Adding testing utilities: Assert RPC was called, assert event was fired etc
- Support each API having stubs (reference)
  - There was an interesting article recently on different kinds of testing, in particular for services. Mocks v. stubs v. ?, but I cannot find it again.
- It should be possible for your software to function (for development purposes) without anything else operating on the bus. (I.e. mocking/stubbing)
Schema: https://pypi.python.org/pypi/schema ?
Test against the list of naughty words - it has some nice odd characters in there.
Use AlpaceJS to generate forms in admin UI based on JSON schema
Create online configuration validator
Use code comments to populate json schema description field. (doc strings, #: style. Parse parameter descriptions out of docstring?)
Sanity check annotations for event listeners and RPCs. Issue a warning if annotated types are not JSON-compatible.
Management UI – consider refactoring away from React to something more lightweight. React is preventing me from wanting to pursue this further.

Reading

https://queue.acm.org/detail.cfm?id=1142044
http://queue.acm.org/detail.cfm?id=1388786
http://wiki.c2.com/?WhatsWrongWithEjb
http://python-history.blogspot.co.uk/2009/01/pythons-design-philosophy.html
https://martinfowler.com/articles/mocksArentStubs.html (for mocking apis, also SO question)
https://vimeo.com/111998645
http://www.structlog.org/en/stable/ (Logging – copy API or use? Either way look into it)
https://medium.com/codezillas/a-bitter-guide-to-open-source-a8e3b6a3c1c4

Pre launch tasks

The text was updated successfully, but these errors were encountered:

adamcharnock · 2017-10-08T16:47:56Z

Update: We now have Redis RPC & Result brokers working.

Server

Client

adamcharnock · 2017-10-22T11:47:20Z

Autodiscovery

I've decided to forgo autodiscovery for now. I think this could work well inside a predictable project structure – such as a Django project – but a general purpose solution would need to exhaustively import a lot of modules in order to discover bus.py modules.

I think this is probably doable, but for now it seems best to omit this particularly magical feature. I suspect more knowledge of Python's import system will be required to do this well.

Initial naive implementation was:

# Initial naive (and abandoned) implementation of bus.py autodiscovery
def autodiscover(directory='.'):
    """Try to discover & import bus.py files"""
    lightbus_directory = Path(__file__).parent.resolve()
    # Find all bus.py files
    matches = [
        Path(match).resolve()
        for match
        in glob(str(Path(directory) / '**' / 'bus.py'), recursive=True)
    ]
    # Filter out any that are within the lightbus package
    matches = [m for m in matches if not str(m).startswith(str(lightbus_directory) + '/')]
    # Import each match
    for match in matches:
        # WARNING: This use of spec_from_file_location() is incorrect. The first parameter needs to 
        # be the entire module name, but here we just use 'bus'.
        spec = importlib.util.spec_from_file_location('bus', str(match))
        bus_module = importlib.util.module_from_spec(spec)
        spec.loader.exec_module(bus_module)

adamcharnock · 2017-10-27T18:23:15Z

After some thought, I think some basic autodiscovery will still be useful. Essentially using the first bus.py file yielded by a breadth-first search. Then determine the module name from sys.path, and perform the import.

However, only the top level bus.py file will be imported, anything else will have to be imported from there. Will attempt implementation

adamcharnock · 2018-03-01T13:59:25Z

Work has been continuing over the last few months. Items above are getting checked off pretty quickly now, so I'm hoping this proof of concept / MVP should be ready for the light of day in the next few months.

During this time we should also see the release of Redis 5.0, which will include streams (ref).

adamcharnock · 2018-03-05T11:22:05Z

Latest thoughts on how to specify parameters for events. Unlike RPCs, event definitions are provided by class properties rather than methods. With RPCs we can extract the RPC signature from the method definition using Python's built-in inspect module.

With events we do not use method definitions as this would imply that there should also be an implementation. The implementation should live in the API's clients not the API itself. We therefore use class properties, but as a result we lose the ability to extract a parameter signature.

I've explored various ideas below regarding how we can specify parameters for events:

# How can we specify parameters for events?

class TestApi(Api):
    # The current implementation
    user_registered1 = Event(parameters=['username', 'email', 'is_admin'])

    # Simple, no default values, no **kwargs
    user_registered2 = Event(parameters={'username': str, 'email': str, 'is_admin': bool})

    # Verbose, but has default values and **kwargs
    user_registered3 = Event(parameters=[
        inspect.Parameter('username', kind=inspect.Parameter.KEYWORD_ONLY, annotation=str),
        inspect.Parameter('email', kind=inspect.Parameter.KEYWORD_ONLY, annotation=str),
        inspect.Parameter('is_admin', kind=inspect.Parameter.KEYWORD_ONLY, annotation=str),
        inspect.Parameter('extra_fields', kind=inspect.Parameter.VAR_KEYWORD, annotation=str, default=False),
    ])

    # As above, but customised for more concise definitions
    user_registered4 = Event(parameters=[
        Parameter('username', str),
        Parameter('email', str),
        Parameter('is_admin', str),
        WildcardParameter('extra_fields', str, default=False),
    ])

    # Does not support **kwargs. Parameters can no longer be inline
    user_registered5 = Event(parameters=RegistrationEventParameters)

Where RegistrationEventParameters is:

class RegistrationEventParameters(NamedTuple):
    username: str
    email: str
    is_admin: str = False

Ultimately providing both options 1 & 4 seems preferable, in lieu of any better option.

adamcharnock · 2018-03-10T22:12:51Z

What should the config system look like?

Portable and transferable config
Therefore no python files - something like yaml/json would be more suitable
Can load config from various sources
- File
- URL
- Redis key (Maybe.)
- Are these just built in, or do we need a ConfigTransport?
Ability to auto-reload process when config changes? Or perhaps that should be provided by a config_changed event on the state API?
Ability to edit config in web UI

What different kinds of config are there:

Global bus config
- Plugins
- Plugin hook timeouts
Per API config
- Transports (+ configs)
Transport config
- Serialisers
- Connection parameters

Future development:

Per API config
- e.g. different APIs can use different transports, timeouts etc

Scratch pad:

plugins:
  internal_state:
    ping_interval: 60
  
  internal_metrics:
    enable: false

bus:
  schema:
    transport:
      redis: {}

apis:
  default:
    event_transport:
      class: lightbus.RedisTransport
      redis_url: redis://username:password@redis

  auth:
    validate: false
    event_transport:
      class: lightbus.RedisTransport
      redis_url: redis://username:password@another_redis_host

adamcharnock · 2018-04-02T10:28:51Z

Currently working on narrative documentation:

adamcharnock · 2018-06-25T12:06:01Z

A quick review of the various registries we are using:

Transport registry (internal to bus client)
Plugin registry (global)
API registry (global)
Listener registry (internal to bus client, mostly for internal use)
Schemas (internal to bus client, mostly for internal use)

Registries under consideration:

@bus.on_start() and @bus.on_stop() hooks (concerningly similar to the plugin hooks)
Config loader registry

adamcharnock · 2018-07-31T14:57:39Z

Recording some notes on lightbus prometheus exporter metrics:

lightbus_listener_lag_events
lightbus_listener_processed_events
lightbus_listener_processed_seconds
lightbus_listener_errors
lightbus_events_total
lightbus_called_rpcs
lightbus_processed_rpcs

labels:

    api_name
    event_name
    service_name
    transport_name

how:

    get_transport_metrics() method on transport
    get_listener_metrics() method on transport

adamcharnock · 2018-11-01T11:24:11Z

Note to self: I discuss versions and migrations on pages 199-200 of notebook 1.

adamcharnock · 2019-05-12T14:32:57Z

Resolving CannotBlockHere errors

There has been an ongoing issue with the Lightbus API which looks something like this:

Developer has a synchronous on_start handler
Lightbus starts up, starts its event loop, and calls the synchronous on_start handler
The on_start handler calls something like bus.my.event.listen(...)
This is a thin wrapper which calls block(bus.my.event.listen_async(...))
The block() utility tries to spin up an event loop in order to run the listen_async coroutine
block() explodes with a CannotBlockHere exception because it is already running within an event loop, and you cannot nest event loops.

The solution to date has been that one must write async handlers (e.g async def on_start()) if one actually wishes to do anything which interacts with the bus (which is pretty much needed to do anything useful). This is not desirable as it forces async code on developers using Lightbus, rather than it being optional.

The solution

The general solution to this is to run these hooks in an asyncio thread executor. Event loops are per-thread, so the hook its free to create its own loop as it now has its own thread.

There is a problem with this solution however: Lightbus has not been designed with treading in mind, and so is not threads-safe. Lightbus has been designed using asyncio, which puts lightbus in control of context switches (i.e. at any await statement). Threads on the other hand can context switch at any point.

Could Lightbus become thread safe? Most likely yes. However, I prefer to avoid this at this stage because 1) threading bugs are hard to track down, 2) I'm not particularly knowledgeable in this area, and 3) I would like Lightbus to stabilise more before embarking on this.

There is an alternative however. We are not using threads because we need threads, we are using threads to provide a clean environment for user code (i.e. an environment in which an event loop can be started). We therefore came to the following solution:

Only one asyncio task can be awake at once (across all threads). A global lock must be aquired in order to wakeup a task. This prevents arbitrary context switching within tasks.
The BusClient handles all bus interactions within the main thread when running as a worker. This ensures that the BusClient is only exposed to a single event loop.
What about when not running as a worker? Let's do a test with gunicorn.

adamcharnock · 2019-08-02T18:02:58Z

The above solution has now been implemented and merged. It appears to be working well.

adamcharnock · 2019-11-14T11:02:06Z

All development tasks are now complete. All that's left to go at the 'Pre launch tasks' which relate mostly to some additional documentation sections.

adamcharnock · 2019-11-29T15:13:54Z

Initial release is out! This epic issue can finally be closed, with all tasks complete 🎉

adamcharnock added a commit that referenced this issue Nov 12, 2017

Work on autodiscovery as per discussion in #1

93c0459

adamcharnock changed the title ~~Create working proof-of-concept~~ Create initial release Oct 29, 2019

adamcharnock closed this as completed Nov 29, 2019

adamcharnock mentioned this issue Dec 6, 2019

Worker thread refactoring #12

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Create initial release #1

Create initial release #1

adamcharnock commented Aug 3, 2017 •

edited

Loading

adamcharnock commented Oct 8, 2017

adamcharnock commented Oct 22, 2017

adamcharnock commented Oct 27, 2017

adamcharnock commented Mar 1, 2018

adamcharnock commented Mar 5, 2018 •

edited

Loading

adamcharnock commented Mar 10, 2018 •

edited

Loading

adamcharnock commented Apr 2, 2018

adamcharnock commented Jun 25, 2018 •

edited

Loading

adamcharnock commented Jul 31, 2018 •

edited

Loading

adamcharnock commented Nov 1, 2018

adamcharnock commented May 12, 2019

adamcharnock commented Aug 2, 2019

adamcharnock commented Nov 14, 2019

adamcharnock commented Nov 29, 2019

Create initial release #1

Create initial release #1

Comments

adamcharnock commented Aug 3, 2017 • edited Loading

Goals

MVP

Required prior to use in production

Ideas for future work / notes:

Reading

Pre launch tasks

adamcharnock commented Oct 8, 2017

Server

Client

adamcharnock commented Oct 22, 2017

Autodiscovery

adamcharnock commented Oct 27, 2017

adamcharnock commented Mar 1, 2018

adamcharnock commented Mar 5, 2018 • edited Loading

adamcharnock commented Mar 10, 2018 • edited Loading

adamcharnock commented Apr 2, 2018

adamcharnock commented Jun 25, 2018 • edited Loading

adamcharnock commented Jul 31, 2018 • edited Loading

adamcharnock commented Nov 1, 2018

adamcharnock commented May 12, 2019

Resolving CannotBlockHere errors

The solution

adamcharnock commented Aug 2, 2019

adamcharnock commented Nov 14, 2019

adamcharnock commented Nov 29, 2019

adamcharnock commented Aug 3, 2017 •

edited

Loading

adamcharnock commented Mar 5, 2018 •

edited

Loading

adamcharnock commented Mar 10, 2018 •

edited

Loading

adamcharnock commented Jun 25, 2018 •

edited

Loading

adamcharnock commented Jul 31, 2018 •

edited

Loading