Enable autotrimming #11

npezza93 · 2024-09-12T01:01:57Z

Fixes #10

npezza93 · 2024-09-12T01:25:55Z

testing this on prod and its working like a charm

dhh · 2024-09-12T02:31:36Z

Awesome. Wonder if we should gel the domain language around trimming vs pruning?

dhh · 2024-09-12T02:31:56Z

@djmb Could you have a look?

npezza93 · 2024-09-12T02:36:15Z

Awesome. Wonder if we should gel the domain language around trimming vs pruning?

I've renamed everything from prune to trim. Is that what you mean?

dhh · 2024-09-12T03:11:35Z

Ah great. Yes 👍

kevinmcconnell · 2024-09-12T07:51:43Z

@npezza93 what do you think about triggering the trimming according to send activity, rather than unsubscribes? Could trigger a trim every n messages (by keeping a counter, or just using a random check on each write that's weighted according to that n, which will average out to the same thing).

That way the trimming workload would be balanced with the write workload, rather than being dependent on how often clients unsubscribe. Which I think should better match the work that trimming has to do -- the more messages you send, the more of them you'll have to trim.

djmb · 2024-09-12T08:17:44Z

I'd recommend a random check rather than a counter - you don't need to store the counter state, and you avoid a thundering herd from a bunch of processes booted together.

djmb · 2024-09-12T08:46:10Z

app/jobs/solid_cable/trim_job.rb

    def perform
-      ::SolidCable::Message.prunable.delete_all
+      ::SolidCable::Message.trimmable.delete_all


While this should work pretty well with SQLite, I have some worries about how this would behave on MySQL or PostgreSQL.

It's deleting an unbounded number of messages so could lock for a fair amount of time. If the database is being replicated, that could also trigger replication lag as those deletes are processed.

Also there could generally be locking issues with concurrent jobs attempting to run the query.

The approach solid_cache takes is to delete small amounts of data but do it often.

Every N / 2 writes we trigger an expiration task (a job or just in a thread).

The task will try to expire up to N records.

We expire N records, but trigger the expiration after N/2 inserts so we have downward pressure on the cache size when it is too large. But we don't try to clear everything out at once as that could be millions and millions of records.

Solid Cache then has a slightly complicated process for deleting records in a concurrent safe manner, but I think we could maybe just rely on SKIP LOCKED here instead. That means you need MySQL 8.0 at least, but Solid Queue already requires that so I don't think it would be an issue to have Solid Cable do the same.

Put up #15 which should address this. Let me know what you think!

Enable autotrimming

99c0de0

npezza93 force-pushed the autotrim branch from f74d43d to 99c0de0 Compare September 12, 2024 01:03

Add silence_polling config details

8729ccd

npezza93 merged commit dfb0165 into main Sep 12, 2024
2 checks passed

npezza93 deleted the autotrim branch September 12, 2024 01:26

djmb reviewed Sep 12, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Enable autotrimming #11

Enable autotrimming #11

npezza93 commented Sep 12, 2024

npezza93 commented Sep 12, 2024

dhh commented Sep 12, 2024

dhh commented Sep 12, 2024

npezza93 commented Sep 12, 2024

dhh commented Sep 12, 2024

kevinmcconnell commented Sep 12, 2024

djmb commented Sep 12, 2024

djmb Sep 12, 2024

npezza93 Sep 12, 2024

Enable autotrimming #11

Enable autotrimming #11

Conversation

npezza93 commented Sep 12, 2024

npezza93 commented Sep 12, 2024

dhh commented Sep 12, 2024

dhh commented Sep 12, 2024

npezza93 commented Sep 12, 2024

dhh commented Sep 12, 2024

kevinmcconnell commented Sep 12, 2024

djmb commented Sep 12, 2024

djmb Sep 12, 2024

Choose a reason for hiding this comment

npezza93 Sep 12, 2024

Choose a reason for hiding this comment