purge deleted documents that exceed TTL #5655

rnewson · 2025-09-15T11:11:01Z

Overview

Add a plugin to automatically purge deleted documents after a configurable interval has lapsed. This should be set high enough that all external consumers of the changes feed will have seen and processed the deleted documents. The plugin uses CouchDB's purge facility which ensures internal replication and indexes have processed the deletions before the deleted document is purged.

Testing recommendations

covered by eunit tests.

Related Issues or Pull Requests

N/A

Checklist

Code is written and works correctly
Changes are covered by tests
Any new configurable parameters are documented in rel/overlay/etc/default.ini
Documentation changes were made in the src/docs folder
Documentation changes were backported (separated PR) to affected branches

rnewson · 2025-09-15T11:13:58Z

Draft as I still need to do a better database-level config for the override.

nickva · 2025-09-15T20:37:15Z

src/docs/src/config/scanner.rst

+
+.. config:section:: couch_auto_purge_plugin :: Configure the Auto Purge plugin
+
+    .. config:option:: deleted_document_ttl


I like the name. Previously when discussing this we used the term tombstones, which it was descriptive it would be a new term for users to learn.

In the future it would be easier to also have a deleted_conflict_ttl which could be handled by the purge mechanism.

yes, I like the new name, and it leaves open the possibility of a document_ttl too (for non-deleted documents).

src/couch/src/couch_auto_purge_plugin.erl

nickva · 2025-09-15T20:54:57Z

This look pretty compact!

Draft as I still need to do a better database-level config for the override.

I'll look into adding a way to set and gets for the database level config. Hopefully as properties in the shard docs. We have some precent in resharding when we update the shard map:

couchdb/src/mem3/src/mem3_reshard_dbdoc.erl

Line 32 in 9133232

update_shard_map(#job{source = Source, target = Target} = Job) ->

rnewson · 2025-09-18T09:11:28Z

have added a commit that does the get/set of a database-level override into the _dbs document. I've placed it outside of the "props" object for now, but that's a discussion. I also deliberately don't add this property into the #shard{} records that mem3 would return as I'm trying to establish the database level property as a single value.

rnewson · 2025-09-18T09:18:24Z

noting that get deliberately just reads the local unsharded _dbs db for its answer and set tries to ensure the same node in the cluster (lowest live node) does the updates, to avoid conflicts. any update to _dbs is replicated in a ring to all nodes. I return a 202 status code as a hint that the write was made but is not yet redundantly stored.

nickva · 2025-09-18T15:32:15Z

Yeah something like get/set can work and agree I am not a fan of how props is spread over all #shard{} copies in the shards cache, ideally it should be something like dbname -> props as a new mem3 cache (ets table). But it would be good ,I think, to have a general well working props and maybe like we discussed in the couchdb meeting even use security for it and get a nice optimization boost from not having to deal with get_db any longer.

Props also has a bit of an extra benefit that it will automatically show up in the dbs_info result so users can inspect them the flags.

So far I have been trying to extract the "update shard map" bit from mem3_reshard and make it a general utility. It's got some resharding specific bits in there and perhaps extra belts and suspenders like for instance:

before making the change the leader (first live node), pull changes from other nodes
after the change it force pushes the changes to all the nodes
there is a wait to propagate step where we wait for the change to take effect on other nodes

Some of theses are there to protect against creating conflicts or handle the case where the ring may have just broken and such but maybe some of it overkill, too.

rnewson force-pushed the auto-delete-tseq branch from bca189f to cf347a7 Compare September 15, 2025 11:13

rnewson marked this pull request as draft September 15, 2025 11:13

rnewson force-pushed the auto-delete-tseq branch from cf347a7 to 62edb67 Compare September 15, 2025 12:00

nickva reviewed Sep 15, 2025

View reviewed changes

src/couch/src/couch_auto_purge_plugin.erl Show resolved Hide resolved

nickva reviewed Sep 15, 2025

View reviewed changes

src/couch/src/couch_auto_purge_plugin.erl Show resolved Hide resolved

rnewson force-pushed the auto-delete-tseq branch 3 times, most recently from 13ce28b to aeb78a7 Compare September 17, 2025 13:29

rnewson marked this pull request as ready for review September 18, 2025 09:11

add get/set interface for auto purge properties

c7f9ca4

rnewson force-pushed the auto-delete-tseq branch 2 times, most recently from b1172e3 to 60ddb11 Compare September 18, 2025 13:24

purge deleted documents that exceed TTL

5228b5b

rnewson force-pushed the auto-delete-tseq branch from 60ddb11 to 5228b5b Compare September 18, 2025 13:25

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

purge deleted documents that exceed TTL #5655

purge deleted documents that exceed TTL #5655

Uh oh!

rnewson commented Sep 15, 2025

Uh oh!

rnewson commented Sep 15, 2025

Uh oh!

nickva Sep 15, 2025

Uh oh!

rnewson Sep 18, 2025

Uh oh!

Uh oh!

Uh oh!

nickva commented Sep 15, 2025 •

edited

Loading

Uh oh!

rnewson commented Sep 18, 2025

Uh oh!

rnewson commented Sep 18, 2025

Uh oh!

nickva commented Sep 18, 2025

Uh oh!

Uh oh!


		.. config:section:: couch_auto_purge_plugin :: Configure the Auto Purge plugin

		.. config:option:: deleted_document_ttl

purge deleted documents that exceed TTL #5655

Are you sure you want to change the base?

purge deleted documents that exceed TTL #5655

Uh oh!

Conversation

rnewson commented Sep 15, 2025

Overview

Testing recommendations

Related Issues or Pull Requests

Checklist

Uh oh!

rnewson commented Sep 15, 2025

Uh oh!

nickva Sep 15, 2025

Choose a reason for hiding this comment

Uh oh!

rnewson Sep 18, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

nickva commented Sep 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

rnewson commented Sep 18, 2025

Uh oh!

rnewson commented Sep 18, 2025

Uh oh!

nickva commented Sep 18, 2025

Uh oh!

Uh oh!

nickva commented Sep 15, 2025 •

edited

Loading