-
Notifications
You must be signed in to change notification settings - Fork 474
Closed
Labels
Description
Create a design document to outline an approach for implementing the WorkerHeartbeat functionality. This should include the protobuf design and host implementation details.
WorkerHeartbeat will help us improve reliability as we can use this request to determine the state of the worker and use that information in turn to determine when to start/stop a worker process. The worker should send a heartbeat message every ~10 sec so that the host can then figure out health if pings did not come as expected. As of now host relies on worker_init_response - worker heartbeat can be extended to include rich set of information
Our protobuf currently has an empty definition for the WorkerHeartbeat here: