Skip to content

Spec out the async-ipc protocol #36

@goodboy

Description

@goodboy

tractor utilizes a simple multiplexed protocol for conducting inter-process-task-communication (IPTC)?

Each per-process trio task can invoke tasks in other processes and received responses depending on the type of the remote callable. All packets are encoded as msgpack serialized dictionaries which I'll refer to as messages.

How it works

When an actor wants to invoke a remote routine it sends a cmd packet:
{'cmd': (ns, func, kwargs, uid, cid)} of type Dict[str, Tuple[str, str, Dict[str, Any], Tuple[str, str], str]]
Where:

  • ns is the remote module name
  • func is the remote function name
  • kwargs is a dict of keyword arguments to call the function with
  • uid is the unique id of the calling actor
  • cid is the unique id of the call by a specific task

The first response is a function type notifier msg:
{'functype': functype, 'cid': cid} of type Dict[str, str].
Where functype can take one of:

  • 'asyncfunc' for an asynchronous function
  • 'asyncgen for a single direction stream either implemented using an asyn generator function or a @stream decorated async func
  • 'context' for a inter actor, task linked, context. For now see Bidir streaming #209.

Depending on the value of functype then the following message(s) are sent back to the caller:

  • 'asyncfunc':
    • a single packet with the remote routine's result {'return', result, 'cid', cid} of type Dict[str, Any]
  • 'asyncgen':
    • a stream of packets with the remote async generator's sequence of results {'yield', value, 'cid', cid} of type Dict[str, Any].
  • 'context':
    • a single 'started' message containing a first value returned from the Context.started() call in the remote task followed by a possible stream of {'yield', value, 'cid', cid} messages if a bidir stream is opened on each side. Again see Bidir streaming #209.

A remote task which is streaming over a channel can indicate completion using a ''stop'` message:

If a remote task errors it should capture it's error output (still working on what output) and send it in a message back to its caller:

  • {'error': {tb_str': traceback.format_exc(), 'type_str': type(exc).__name__,}}

A remote actor must have a system in place to cancel tasks spawned from the caller. The system to do this should be invoke-able using the existing protocol defined above and thus no extra "cancel" message should be required (I think).

  • an example is when a local caller cancels its currently consuming stream by calling a Actor._cancel_task() routine in the remote actor. This routine should have knowledge of the rpc system and be capable of looking up the caller's task-id to conduct cancellation.
  • any remote task should should in theory be cancel-able in this way but there is not yet a "cross-actor cancel scope" system in place for generic tasks (this is maybe a todo)

What should be done

Metadata

Metadata

Assignees

No one assigned

    Labels

    Projects

    No projects

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions