Discussion: Blocking operations #1961

NickCraver · 2022-01-24T01:32:47Z

We've gotten asks for this in things like #1474, #1593, and #1675.

The general ask is support for blocking operations like WAIT but also others like XREAD. The general problem here is with a multiplexed connection, all commands behind the blocking command are waiting on it and an indefinite time period, and so this is a very easy footgun to wield.

Proposals

One idea has been something like an IBlockingDatabase or similar where a user would have to explicitly cast to an interface to use it, but the more I think about it maybe that's not the correct abstraction. For one: these are per-client, per database, per-stream, and other things - so there would be multiple. But overall, the effect is per connection (or specifically in our internal nomenclature, per bridge).

What if instead this was on ConfigurationOptions to allow a blocking command. We could have one bool there that allows that multiplexer to perform blocking operations. Any attempt to call a blocking operation without this enabled would throw an informative error to the user. As these would be new commands or overloads not in play today, there wouldn't be any API break. This error message could point to our GitHub Pages help informing people about what this means and how to be careful.

Connection pooling I have worries about with respect to ordering but I believe these use cases to be completely orthogonal to ordering, so if we get pooling in this same options approach could be used to spin up n bridges or whatever approach we take there. Today we wouldn't want to do that because many have connection limits and don't want the subscriber one either (which isn't something we can disable today).

I'm curious what others thing of this: basically an option that says "yes, I accept the disclaimer" to allow blocking commands?

Downstream problems

Of note though, there is a downstream delineation of impact here because we're on sockets that have TTLs. For something like XREAD we could have an IAsyncEmulerable API with steam results streaming out, but internally we could say GET X items every N ms (e.g. XREAD COUNT 50 BLOCK 200), and only block for that long, then try again in a while loop, basically asynchronously blocking that multiplexer - this is fairly straightforward if the N ms is under 1,000.

Beyond that 1000 ms, we run into pile-up problems with heartbeats keeping the connection alive. The problem set is basically < 1000 ms and > 1000 ms, with the latter being "up to indefinite". Indefinite is a problem I don't have a solution for because we know that many environments including paid hosting will drop the socket if there's no data being sent for too long a time period. Socket TTLs of the system is not something we can readily work around, to my knowledge. We could send noise that piles up for execution behind the WAIT to the Redis server from the client, but nothing coming back on any defined time period is ensured as the timeout is indefinite and the pile up itself is also unbounded.

We could back off if we know the pending/current command is a block (with a flag?) and only KeepAlive once a minute or something to lower the load, but again that only solves socket death in one direction. But for limited blocks that have a millisecond duration specifier like XREAD, this isn't something we have to solve to unlock. Even without connection pooling or unbounded numbers of bridges, users could create a dedicated multiplexer with this option to read a stream with the proposal above. There could be 10 other problems with this, just wanted to get the discussion into a single issue!

cc @mgravell @philon-msft @davidfowl

The text was updated successfully, but these errors were encountered:

JesperTreetop · 2022-01-24T11:41:24Z

(Rando alert: Very sporadic StackExchange.Redis user who has been watching Redis Streams with interest.)

It seems to me that the safest and easiest-to-explain thing is to put all the troublesome operations that would ruin/block an entire connection in a completely different part of the API space. So, something like:

var db = /* IDatabase */;

// fork off an equivalent, new connection to the same place
var blocking = db.GetBlockingOperations(); 

await foreach (var xReadItems in blocking.XRead(...)) {
    // ...
}

If what you want is to use blocking XREAD and you're handling it in the intended way, you're already resigned to spinning up a dedicated connection for that. No reason that the library shouldn't do that in the best way it knows how, and fence stuff off. Potentially, it could also have some sort of collision detection - hey, you started a new blocking XRead when a blocking XRead is already going and hasn't ended or been cancelled - that would be extremely surprising in the regular API, but would guide you towards the pit of success for the blocking use case.

mgravell · 2022-01-24T12:36:24Z

Not wanting to derail, but I'd also invite thoughts on a secondary use-case here.

To implement blocking operation support, we'd (as alluded) need to implement a pool under the bonnet; this also enables two other performance related improvements. Basically, at the moment I'm seeing a lot of evidence of head-of-line blocking due to the single reader, especially on non-cluster scenarios (since all the load is effectively forced through one reader). If we have multiple connections, we also have the ability to shard traffic between multiple backends. To avoid semantic changes (in particular non-awaited operations), we could effectively have multiple calls to GetDatabase() return tokens that default to different underlying connections (but as a weak binding - if it becomes unavailable: meh). Or if we accept the semantic change (re unawaited operations), we could also allow different operations on the same IDatabase to go to different underlying connections, which gets much richer load sharding, but with a huge caveat along the lines of "if you don't await operations (or use a batch/transaction): order is not guaranteed"

wj8400684 · 2023-03-27T10:31:49Z

Is it supported now?

mgravell · 2023-03-27T11:28:40Z

No

…

On Mon, 27 Mar 2023, 11:32 吴俊, ***@***.***> wrote: Is it supported now? — Reply to this email directly, view it on GitHub <#1961 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AAAEHMHKOP2PH3PR66VXLUDW6FUB7ANCNFSM5MUDI6ZQ> . You are receiving this because you were mentioned.Message ID: ***@***.***>

josh-endries · 2023-09-01T20:12:13Z

Dang, just ran into this. Off to find another lib. :(

NickCraver added ➕ enhancement ⚙️ area:connection 🧱 blocking-commands labels Jan 24, 2022

This was referenced Feb 6, 2022

Blocking operation problem as suggested #1333

Closed

Achieving Strong durability and consistency in writes. #1373

Closed

Blocking operations #1675

Closed

NickCraver mentioned this issue May 28, 2022

Is this command implemented? #2147

Closed

NickCraver mentioned this issue Mar 3, 2023

Small Object Heap: code that allocates a lot of memory in SOH #2391

Closed

kmcclellan mentioned this issue Jan 10, 2025

Missing Commands in StackExchange.Redis #2055

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Discussion: Blocking operations #1961

Discussion: Blocking operations #1961

NickCraver commented Jan 24, 2022

JesperTreetop commented Jan 24, 2022

Uh oh!

mgravell commented Jan 24, 2022

Uh oh!

wj8400684 commented Mar 27, 2023

Uh oh!

mgravell commented Mar 27, 2023 via email

Uh oh!

josh-endries commented Sep 1, 2023

Uh oh!

Discussion: Blocking operations #1961

Discussion: Blocking operations #1961

Comments

NickCraver commented Jan 24, 2022

Proposals

Downstream problems

JesperTreetop commented Jan 24, 2022

Uh oh!

mgravell commented Jan 24, 2022

Uh oh!

wj8400684 commented Mar 27, 2023

Uh oh!

mgravell commented Mar 27, 2023 via email

Uh oh!

josh-endries commented Sep 1, 2023

Uh oh!