Skip to content

PubSub #1202

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Closed
mrocklin opened this issue Jun 23, 2017 · 3 comments
Closed

PubSub #1202

mrocklin opened this issue Jun 23, 2017 · 3 comments

Comments

@mrocklin
Copy link
Member

With Queue and Variable we now have two mechanisms to share small data or futures (pointing to actual data) between clients. Both of these systems route data through the central scheduler.

We might consider adding Pub and Sub to this family where all entries sent into a Pub arrive at all currently subscribed Subs on the same topic. This is somewhat different from the current Queue case, in which all elements are consumed by only one subscriber.

Additionally, we may want to send data directly between workers, rather than mediating through the scheduler. This would allow us to send large numeric datasets on PubSub systems with a single network hop and without saturating the scheduler bandwidth. This would be particularly useful for some advanced machine learning workloads.

As a first implementation the scheduler would track all clients currently subscribing to publishing on a particular topic. It would inform all clients whenever a new client subscribed or published. Clients would send data directly between each other using their attached workers' servers. This is brittle in a few ways, particularly when a single client is expected to broadcast large amounts of data to many workers, but we might be able to solve that in the future if it arises.

So to be clear there are two topics here:

  1. A pub-sub model of inter-client communication
  2. Direct worker/client-worker/client communication, bypassing the scheduler

cc @MLnick

@mrocklin
Copy link
Member Author

also cc @minrk

@stsievert
Copy link
Member

stsievert commented Jun 5, 2018

I think this issue is closed by #1999, and there's a similar issue at dask/dask-ml#171

@mrocklin mrocklin closed this as completed Jun 5, 2018
@mrocklin
Copy link
Member Author

mrocklin commented Jun 5, 2018

True, Thanks. Closed.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants