Comprehensively retry poll/register even on unforeseen errors #563

gnprice · 2024-03-12T18:52:50Z

This is a followup to:

Hopefully with #556 (fixed by #561) we're done with needing to add try/catch blocks and retries for conditions that can happen for purely operational reasons like the device going to sleep. But there might be another such condition we've missed; and there's also always the possibility of bugs in our code. In particular the event-poll loop calls PerAccountStore.handleEvent, which can reach quite a lot of our model code because of the variety of different events, so it's probable that at some point we'll have a bug that causes that to throw an exception.

For the bulk of our code, if a bug causes an exception, then the best thing to do is to let it propagate out of our code uncaught — it'll mean that that widget doesn't get built (causing a gray rectangle in release builds), or that user gesture doesn't get acted on, but the user can pick up from there and try again. But the event loop, or UpdateMachine, is different: it's the one spot in our code where we have a persistent thread of control flow of our own. It needs to keep itself going no matter what happens, because nothing else will resume it for us.

I have a rough draft for this, which I wrote while working on #556/#561. I put it aside in order to get #561 out the door promptly, but I'll plan to pick it up later. The draft also comes with some user feedback, a form of #555.

Related issues, some of which might end up getting handled at the same time as this one:

The text was updated successfully, but these errors were encountered:

PIG208 · 2024-08-05T21:24:44Z

The referenced draft seems to be https://github.com/gnprice/zulip-flutter/tree/dev-retry. I can pick this up to work on #555.

gnprice · 2024-08-16T00:49:19Z

This came up again at #809 (comment) (though that happened to be a false alarm). As I wrote there, restating the main substance of this issue but more briefly:

When we throw an exception from within PerAccountStore.handleEvent, that represents a bug but we can and should recover from it:

we should show the user an error dialog (for the beta, i.e. Show detailed poll-failure feedback, in beta #555),
and then replace the store the same way we do when the event queue expires (i.e. this issue Comprehensively retry poll/register even on unforeseen errors #563).

…ackoff This fixes zulip#563. We'll follow up with a few more commits that give more-informative errors in some cases, or change the handling of others from retrying getEvents to reloading the data from scratch. The if-disposed and store.isLoading lines have no effect in the case of a BAD_EVENT_QUEUE_ID error, because those can only come from the getEvents request, and then the inner catch block will have already taken the same steps. Fixes: zulip#563

gnprice added the a-api Implementing specific parts of the Zulip server API label Mar 12, 2024

gnprice added this to the Beta 2 milestone Mar 12, 2024

gnprice self-assigned this Mar 12, 2024

github-project-automation bot added this to Flutter app Mar 12, 2024

gnprice modified the milestones: B2: pre-summer, B2: Summer 2024 May 9, 2024

gnprice mentioned this issue Jul 13, 2024

Slow sometimes to replace event queue after expiry #809

Open

gnprice mentioned this issue Nov 16, 2024

Comprehensively retry event-poll failures #1063

Merged

gnprice closed this as completed in 2595cb0 Nov 20, 2024

github-project-automation bot moved this to Done in Flutter app Nov 20, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Comprehensively retry poll/register even on unforeseen errors #563

Comprehensively retry poll/register even on unforeseen errors #563

gnprice commented Mar 12, 2024

PIG208 commented Aug 5, 2024

gnprice commented Aug 16, 2024

Comprehensively retry poll/register even on unforeseen errors #563

Comprehensively retry poll/register even on unforeseen errors #563

Comments

gnprice commented Mar 12, 2024

PIG208 commented Aug 5, 2024

gnprice commented Aug 16, 2024