Breaking change: Improve save() performance by skipping index creation #2702

ShaneHarvey · 2022-10-27T19:58:25Z

Indexes are now only created when a Model is first used, eg the first call to save() or on the first call to _get_collection(), or when the new meta["auto_create_index_on_save"] option is set to True. This is a minor breaking change for some applications. As a workaround apps can explicitly call ensure_indexes() or set meta["auto_create_index_on_save"] to True.

Note that with the default settings (auto_create_index=True + auto_create_index_on_save=False) indexes are still created after on a Document's first use, or first use after Document.drop_collection().

I hope this PR can finally resolve #1446. I reviewed that issue, the previous attempts to fix it (#1457 and #1511), and the original issue that added this behavior (#812) and I believe the solution in this PR is a good compromise between having better default behavior while also offering an escape hatch for the old behavior (via auto_create_index_on_save).

Here are the benchmark results:

$ python benchmarks/test_save_with_indexes.py
--------------------------------------------------------------------------------
Save 10000 documents with 0 indexes.
2.8389482499987935s
--------------------------------------------------------------------------------
Save 10000 documents with 1 index.
2.782498458000191s
--------------------------------------------------------------------------------
Save 10000 documents with 2 indexes.
2.7451970830006758s
--------------------------------------------------------------------------------
Save 10000 documents with 1 index (auto_create_index_on_save=True).
4.725924582999141s
--------------------------------------------------------------------------------
Save 10000 documents with 2 indexes (auto_create_index_on_save=True).
4.777219208997849s

4.72/2.78 = a significant 1.7x speed up for save() on a document with one field and one index.

Note: I've made some relatively minor changes to the other benchmarks as well, mainly just to explicitly use w=1 write concern. This change significantly improves the runtime (and the effectiveness) of the benchmarks, otherwise they end up using the server's default writeConcern which is now w:majority. I can move this to a new PR if desired.

Indexes are now only created when a Model is first used, eg the first call to save() or on the first call to _get_collection(), or when the new meta["auto_create_index_on_save"] option is set to True. This is a minor breaking change for some applications. As a workaround apps can explicitly call ensure_indexes() or set meta["auto_create_index_on_save"] to True.

ShaneHarvey · 2022-10-27T20:04:23Z

The tests pass locally, @bagerard this is ready for you consideration. Could you approve the github workflow?

stlucasgarcia

I'm hoping this will be merged soon 🙏🏻

bagerard · 2022-12-30T20:16:29Z

Thanks for resolving this @ShaneHarvey , much appreciated!

ShaneHarvey · 2023-01-10T21:00:18Z

Glad to see this merged, thanks @bagerard!!

stlucasgarcia approved these changes Dec 20, 2022

View reviewed changes

bagerard mentioned this pull request Dec 29, 2022

[Clone] Breaking change: Improve save() performance by skipping index creation #2719

Merged

bagerard merged commit 976502c into MongoEngine:master Dec 30, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Breaking change: Improve save() performance by skipping index creation #2702

Breaking change: Improve save() performance by skipping index creation #2702

Uh oh!

ShaneHarvey commented Oct 27, 2022 •

edited

Loading

Uh oh!

ShaneHarvey commented Oct 27, 2022

Uh oh!

stlucasgarcia left a comment

Uh oh!

bagerard commented Dec 30, 2022

Uh oh!

ShaneHarvey commented Jan 10, 2023

Uh oh!

Uh oh!

Breaking change: Improve save() performance by skipping index creation #2702

Breaking change: Improve save() performance by skipping index creation #2702

Uh oh!

Conversation

ShaneHarvey commented Oct 27, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ShaneHarvey commented Oct 27, 2022

Uh oh!

stlucasgarcia left a comment

Choose a reason for hiding this comment

Uh oh!

bagerard commented Dec 30, 2022

Uh oh!

ShaneHarvey commented Jan 10, 2023

Uh oh!

Uh oh!

ShaneHarvey commented Oct 27, 2022 •

edited

Loading