✨(worker) switch from Celery to Dramatiq #390

sylvinus · 2025-10-20T23:50:42Z

12 years later, everything old is new again: https://www.slideshare.net/slideshow/why-and-how-pricing-assistant-migrated-from-celery-to-rq-parispy-2/24526592

We are facing the same issues: inability to properly monitor and clean failing tasks, complex dependency introduced in the codebase.

Dramatiq seems to check a few boxes: Good redis support, reliable tasks, simple API, simple dashboard, understandable code. Let's try it!

Summary by CodeRabbit

Release Notes

New Features
- Added Dramatiq Dashboard for task monitoring in the admin panel
Chores
- Migrated task management system from Celery to Dramatiq
- Updated container image versions and dependencies
- Removed search reindex management command
- Updated worker and scheduler deployment configuration
- Configured scheduled tasks for system maintenance (selfcheck and message retry)

coderabbitai · 2025-10-20T23:50:52Z

Walkthrough

This PR migrates the backend message queue system from Celery to Dramatiq. Key changes include replacing task decorators and invocation methods (.delay() → .send()), updating result tracking (.id → .message_id), implementing a new progress reporting system via set_task_progress, updating Docker and deployment configurations, and adding a Dramatiq dashboard to Django admin.

Changes

Cohort / File(s)	Summary
Configuration & Infrastructure `Makefile`, `Procfile`, `compose.yaml`, `env.d/development/backend.defaults`, `src/backend/Dockerfile`, `docs/env.md`	Updated deployment and development configurations: pip-audit ignore flag in Makefile, Procfile worker/scheduler commands using Django management instead of Celery, docker-compose service renames (celery-dev → worker-dev, celery-ui → scheduler-dev) with new commands and dependencies, Docker base image and Poetry version bumps, Dramatiq environment variables, and documentation updates reflecting broker URL changes (CELERY_BROKER_URL → DRAMATIQ_BROKER_URL).
Core Task Framework `src/backend/core/utils.py`, `src/backend/core/worker_setup.py`, `src/backend/core/management/commands/worker.py`	Introduced new Dramatiq task utilities: `register_task()` wrapper, `set_task_progress()` for progress tracking, `get_task_progress()` for retrieval; added worker setup module for Django initialization; created custom worker command extending django_dramatiq's RunDramatiqCommand.
Task Definitions Migration `src/backend/core/mda/tasks.py`, `src/backend/core/services/importer/tasks.py`, `src/backend/core/services/search/tasks.py`	Migrated from Celery decorators to @register_task decorator; removed `self` parameter from task signatures; replaced Celery state updates with explicit `set_task_progress()` calls; added cron scheduling for selfcheck and retry tasks; removed error-handling try/except blocks in favor of direct execution flow.
Task Invocation Updates `src/backend/core/api/viewsets/send.py`, `src/backend/core/services/importer/service.py`, `src/backend/core/signals.py`	Changed task invocation from `.delay()` to `.send()`; updated result handling from `task.id` to `task.message_id`; preserved surrounding business logic and error handling.
Task Status & Progress Tracking `src/backend/core/api/viewsets/task.py`, `src/backend/core/services/importer/imap.py`	Replaced Celery AsyncResult-based status checks with Dramatiq result backend queries; added new DRAMATIQ_STATES mapping; implemented progress-driven response with progress/message/timestamp fields; removed task_instance parameter from process_folder_messages signature.
Admin Interface `src/backend/core/admin.py`, `src/backend/core/templates/admin/index.html`, `src/backend/messages/urls.py`	Integrated Dramatiq dashboard into Django admin: new CoreAdminSite with get_urls override exposing `/dramatiq/` endpoint; staff-only dashboard view via WSGI synthetic environment; admin index template link to Task Management dashboard.
Settings & Configuration `src/backend/messages/settings.py`, `src/backend/pyproject.toml`	Removed all Celery configuration (CELERY_BROKER_URL, CELERY_RESULT_BACKEND, django_celery_beat, etc.); added Dramatiq configuration blocks (DRAMATIQ_BROKER, DRAMATIQ_RESULT_BACKEND, DRAMATIQ_CRONTAB) across all environments; updated dependencies: removed celery[redis], django-celery-beat, django-celery-results; added django-dramatiq, dramatiq[redis], dramatiq-dashboard, dramatiq-crontab[sentry].
Removed/Deleted `src/backend/core/management/commands/search_reindex.py`, `src/backend/core/services/dns/tasks.py`, `src/backend/messages/celery_app.py`	Removed OpenSearch reindex management command, DNS Celery task scaffold, and Celery app configuration module.
Module/Package Updates `src/backend/core/tasks.py`, `src/backend/core/services/search/__init__.py`, `src/backend/core/services/search/index.py`, `src/backend/messages/__init__.py`, `src/backend/core/management/commands/run_task.py`	Commented out task imports in core/tasks.py; removed reindex_all/reindex_mailbox/reindex_thread exports and implementations; removed celery_app export; updated run_task.py terminology and task discovery logic to search multiple task modules in priority order.
Test Suite Updates `src/backend/core/tests/conftest.py`, `src/backend/core/tests/api/test_.py`, `src/backend/core/tests/importer/test_.py`, `src/backend/core/tests/search/test_*.py`	Added dramatiq worker and worker_broker fixtures; created comprehensive TaskDetailViewTest with progress/status scenarios; updated all task mocks from `.delay()` to `.send()` and result references from `.id` to `.message_id`; refactored progress mocking to use set_task_progress; added integration test for task API; removed celery_app backend patching.

Sequence Diagram(s)

sequenceDiagram
    participant Client as Client/API
    participant Handler as View/Signal Handler
    participant Queue as Dramatiq Queue
    participant Worker as Dramatiq Worker
    participant Cache as Django Cache
    participant DB as Database

    alt Old Celery Flow
        Client->>Handler: Request/Event
        Handler->>Queue: task.delay() 
        Queue->>Worker: Get task
        Worker->>DB: Update via self.update_state()
        Client->>Handler: Poll task.id in Celery backend
        Handler-->>Client: Status
    end

    alt New Dramatiq Flow
        Client->>Handler: Request/Event
        Handler->>Queue: task.send()
        Queue->>Worker: Get task
        Worker->>Cache: set_task_progress(progress, metadata)
        Cache-->>Cache: Store {progress, message, timestamp}
        Client->>Handler: Poll task.message_id
        Handler->>Cache: get_task_progress(task_id)
        Cache-->>Handler: {progress, message, timestamp}
        Handler-->>Client: Status with progress data
    end

Estimated code review effort

🎯 4 (Complex) | ⏱️ ~45 minutes

This migration spans multiple architectural layers with high heterogeneity: configuration changes, decorator/invocation pattern updates across multiple task modules, new progress tracking infrastructure, admin integration, and comprehensive test updates. While many changes follow predictable patterns (.delay() → .send()), the task definition logic, progress handling, and state management require separate reasoning for each affected module. The scope is broad (30+ files), with dense logic changes in task handlers and API endpoints.

Possibly related PRs

♻️(import) fix naming after refactor #365: Modifies the same src/backend/core/services/importer/imap.py module—this PR changes progress reporting mechanism while the retrieved PR further refactors the importer function signatures and promotion.
✨(mta) add prototype integrated MTA out #285: Modifies task/worker codepaths (src/backend/core/mda/tasks.py, task scheduling)—this PR replaces Celery infrastructure while the retrieved PR adds complementary Celery-based MTA selfcheck and SMTP outbound functionality.
✨(import) Enhance message import through multipart upload #372: Modifies message import codepaths (src/backend/core/api/viewsets/import_message.py, src/backend/core/services/importer/service.py, src/backend/core/services/importer/tasks.py)—both PRs affect the same import task signatures and invocation patterns.

Suggested reviewers

sdemagny
Nastaliss

Poem

🐰 From Celery's bloom to Dramatiq's beat,
Tasks now send forth in rhythmic fleet,
Progress whispers through the cache so bright,
Workers dance through the queue-less night!
No more delays—just messages true,
A hop, skip, and queue-jump right through! 🎪

Pre-merge checks and finishing touches

✅ Passed checks (3 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.
Title Check	✅ Passed	The pull request title "✨(worker) switch from Celery to Dramatiq" directly and clearly summarizes the primary change across the entire changeset. The raw summary shows extensive modifications throughout the codebase focused on migrating from Celery to Dramatiq, including updates to task definitions (core/mda/tasks.py, core/services/importer/tasks.py, core/services/search/tasks.py), configuration files (messages/settings.py), dependencies (pyproject.toml), and management commands. The title accurately reflects this main objective stated in the PR description and uses concise, specific language that a teammate scanning the history would immediately understand.
Docstring Coverage	✅ Passed	Docstring coverage is 97.53% which is sufficient. The required threshold is 80.00%.

✨ Finishing touches

📝 Generate docstrings

🧪 Generate unit tests (beta)

Create PR with unit tests
Post copyable unit tests in a comment
Commit unit tests in branch dramatiq

Warning

There were issues while running some tools. Please review the errors and either fix the tool's configuration or disable the tool if it's a critical failure.

🔧 Pylint (4.0.1)

src/backend/core/admin.py

src/backend/core/api/viewsets/task.py

src/backend/core/api/viewsets/send.py

23 others

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

coderabbitai

Actionable comments posted: 13

Caution

Some comments are outside the diff and can’t be posted inline due to platform limitations.

⚠️ Outside diff range comments (9)

src/backend/core/tests/search/test_search.py (2)

136-150: Fix incorrect assertion keys to match task return structure.

The task function reindex_all_task returns a dictionary with a "success" key, not "status". Line 145 will fail.

Apply this diff to fix the assertion:
-    assert result["status"] == "success"
+    assert result["success"] is True
152-164: Fix incorrect assertion keys to match task return structure.

The task function reindex_mailbox_task returns "success" and "mailbox_id" keys, not "status" and "mailbox". Lines 162-163 will fail.

Apply this diff to fix the assertions:
-    assert result["status"] == "success"
-    assert result["mailbox"] == str(test_mailbox.id)
+    assert result["success"] is True
+    assert result["mailbox_id"] == str(test_mailbox.id)

src/backend/core/services/importer/tasks.py (1)

184-225: MBOX streaming loads the entire file into memory (risk of OOM).

Collecting every message then reversing defeats the “without loading everything into memory” goal and will blow up on large mbox files. Use offsets with a spooled temp file to reverse without buffering message bodies.

-def stream_mbox_messages(file) -> Generator[bytes, None, None]:
+def stream_mbox_messages(file) -> Generator[bytes, None, None]:
@@
-    current_message = []
-    in_message = False
-    messages = []
-
-    # Read line by line to avoid loading entire file into memory at once
-    # We still need to collect messages for reversing due to mbox format
-    for line in file:
-        # Check for mbox message separator
-        if line.startswith(b"From "):
-            if in_message and current_message:
-                # End of previous message - store it
-                messages.append(b"".join(current_message))
-                current_message = []
-            in_message = True
-            # Skip the mbox From line
-            continue
-
-        if in_message:
-            current_message.append(line)
-
-    # Add the last message if there is one
-    if current_message:
-        messages.append(b"".join(current_message))
-
-    # Yield messages in reverse order to treat replies correctly
-    # (mbox format stores newest messages first)
-    for message in reversed(messages):
-        yield message
+    # Spool to disk, track offsets of message boundaries, then slice in reverse.
+    import io
+    import tempfile
+
+    with tempfile.SpooledTemporaryFile(max_size=32 * 1024 * 1024) as tmp:
+        # Copy stream to a seekable tmp file in chunks
+        for chunk in iter(lambda: file.read(1024 * 1024), b""):
+            tmp.write(chunk)
+        tmp.flush()
+        tmp.seek(0)
+
+        offsets: list[tuple[int, int]] = []  # (start, end)
+        start = None
+        pos = 0
+        while True:
+            line = tmp.readline()
+            if not line:
+                # EOF
+                if start is not None:
+                    offsets.append((start, pos))
+                break
+            if line.startswith(b"From "):
+                if start is not None:
+                    offsets.append((start, pos))
+                start = pos + len(line)  # skip the mbox "From " separator
+            pos = tmp.tell()
+
+        for start, end in reversed(offsets):
+            tmp.seek(start)
+            remaining = end - start
+            buf = io.BytesIO()
+            while remaining > 0:
+                chunk = tmp.read(min(1024 * 1024, remaining))
+                if not chunk:
+                    break
+                buf.write(chunk)
+                remaining -= len(chunk)
+            yield buf.getvalue()

If S3 storage returns a non-seekable stream, this approach still works since we spool locally first. Based on learnings.

src/backend/core/tests/importer/test_imap_import.py (3)

174-235: Tests still assume Celery’s update_state; switch to set_task_progress or assert only results.

The task no longer calls update_state; these assertions will never run. Replace with progress assertions via set_task_progress or drop progress checks.

-@patch("imaplib.IMAP4_SSL")
-def test_imap_import_task_success(
-    mock_imap4_ssl, mailbox, mock_imap_connection, sample_email
-):
+@patch("imaplib.IMAP4_SSL")
+@patch("core.services.importer.tasks.set_task_progress")
+def test_imap_import_task_success(
+    mock_set_task_progress, mock_imap4_ssl, mailbox, mock_imap_connection, sample_email
+):
@@
-    # Create a mock task instance
-    mock_task = MagicMock()
-    mock_task.update_state = MagicMock()
-
-    with patch.object(
-        import_imap_messages_task, "update_state", mock_task.update_state
-    ):
-        # Run the task
-        task = import_imap_messages_task(
+    # Run the task directly (actor is callable)
+    task = import_imap_messages_task(
             imap_server="imap.example.com",
@@
-        # Verify progress updates were called correctly
-        assert mock_task.update_state.call_count == 4  # 3 PROGRESS + 1 SUCCESS
-
-        # Verify progress updates
-        for i in range(1, 4):
-            mock_task.update_state.assert_any_call(
-                state="PROGRESS",
-                meta={
-                    "result": {
-                        "message_status": f"Processing message {i} of 3",
-                        "total_messages": 3,
-                        "success_count": i,  # Current message was successful
-                        "failure_count": 0,
-                        "type": "imap",
-                        "current_message": i,
-                    },
-                    "error": None,
-                },
-            )
-
-        # Verify success update
-        mock_task.update_state.assert_any_call(
-            state="SUCCESS",
-            meta=task,
-        )
+    # Optional: verify at least initial progress was reported
+    mock_set_task_progress.assert_any_call(0, {"message": "Connecting to IMAP server"})

259-303: Remove update_state expectations in login failure test.

No update_state is called; assert the returned failure payload instead.

-    # Create a mock task instance
-    mock_task = MagicMock()
-    mock_task.update_state = MagicMock()
-
-    # Mock IMAP connection to raise an error on login
-    with (
-        patch.object(import_imap_messages_task, "update_state", mock_task.update_state),
-        patch("core.services.importer.imap.imaplib.IMAP4_SSL") as mock_imap,
-    ):
+    # Mock IMAP connection to raise an error on login
+    with patch("core.services.importer.imap.imaplib.IMAP4_SSL") as mock_imap:
@@
-        # Verify only failure update was called
-        assert mock_task.update_state.call_count == 1
-        mock_task.update_state.assert_called_once_with(
-            state="FAILURE",
-            meta={
-                "result": task_result["result"],
-                "error": task_result["error"],
-            },
-        )
+        # No progress calls required here; just ensure failure payload is correct

308-371: Fix mocked progress and assertions; current test shadows the patched mock and uses Celery-style args.

You overwrite mock_set_task_progress with a new MagicMock(), so no calls are captured; also the signature should be set_task_progress(progress: int, metadata: dict).

-@patch("imaplib.IMAP4_SSL")
-@patch("core.services.importer.tasks.set_task_progress")
-def test_imap_import_task_message_fetch_failure(
-    mock_set_task_progress, mock_imap4_ssl, mailbox
-):
+@patch("imaplib.IMAP4_SSL")
+@patch("core.services.importer.tasks.set_task_progress")
+def test_imap_import_task_message_fetch_failure(
+    mock_set_task_progress, mock_imap4_ssl, mailbox
+):
@@
-    mock_set_task_progress = MagicMock()
+    # Use the patched mock_set_task_progress injected by the decorator
@@
-    # Verify progress updates were called correctly
-    assert mock_set_task_progress.call_count == 3  # 3 PROGRESS
+    # Verify progress updates were emitted (count depends on implementation)
+    assert mock_set_task_progress.call_count >= 1
@@
-    for i in range(1, 4):
-        mock_set_task_progress.assert_any_call(
-            state="PROGRESS",
-            meta={
-                "result": {
-                    "message_status": f"Processing message {i} of 3",
-                    "total_messages": 3,
-                    "success_count": 0,
-                    "failure_count": i,  # Current message failed
-                    "type": "imap",
-                    "current_message": i,
-                },
-                "error": None,
-            },
-        )
+    # Example: assert one of the calls contained the expected metadata message
+    messages = [
+        kwargs.get("metadata", {}).get("message")
+        if kwargs else (args[1].get("message") if len(args) >= 2 else None)
+        for args, kwargs in (call.args, call.kwargs) for call in mock_set_task_progress.call_args_list
+    ]
+    assert any("Processing message" in (m or "") for m in messages)

src/backend/core/api/viewsets/task.py (1)

37-45: Schema omits PROGRESS fields actually returned.

Add progress, message, and timestamp to OpenAPI schema to avoid drift.

-            fields={
-                "status": drf_serializers.ChoiceField(choices=sorted(DRAMATIQ_STATES)),
-                "result": drf_serializers.JSONField(allow_null=True),
-                "error": drf_serializers.CharField(allow_null=True),
-            },
+            fields={
+                "status": drf_serializers.ChoiceField(choices=sorted(DRAMATIQ_STATES)),
+                "result": drf_serializers.JSONField(allow_null=True),
+                "error": drf_serializers.CharField(allow_null=True),
+                # Present when status == "PROGRESS"
+                "progress": drf_serializers.IntegerField(required=False),
+                "message": drf_serializers.CharField(required=False, allow_blank=True),
+                "timestamp": drf_serializers.FloatField(required=False),
+            },

src/backend/core/mda/tasks.py (1)

113-122: Set final progress on early error paths

When the message doesn’t exist or is a draft, progress never reaches 100%, leaving the UI in limbo. Set progress to 100 with an error message before returning.

         except models.Message.DoesNotExist:
             error_msg = f"Message with ID '{message_id}' does not exist"
-            return {"success": False, "error": error_msg}
+            set_task_progress(100, {"message": error_msg})
+            return {"success": False, "error": error_msg}
 
         if message.is_draft:
             error_msg = f"Message '{message_id}' is still a draft and cannot be sent"
-            return {"success": False, "error": error_msg}
+            set_task_progress(100, {"message": error_msg})
+            return {"success": False, "error": error_msg}

src/backend/core/services/search/tasks.py (1)

103-125: Handle nonexistent mailbox gracefully

Mirror your thread/message handlers: catch Mailbox.DoesNotExist and return a structured error.

-    threads = models.Mailbox.objects.get(id=mailbox_id).threads_viewer
+    try:
+        threads = models.Mailbox.objects.get(id=mailbox_id).threads_viewer
+    except models.Mailbox.DoesNotExist:
+        logger.error("Mailbox %s does not exist", mailbox_id)
+        return {"mailbox_id": str(mailbox_id), "success": False, "error": "mailbox_not_found"}

🧹 Nitpick comments (11)

src/backend/core/worker_setup.py (1)

6-11: Consider adding error handling and logging for worker initialization.

The module-level execution of install() and django.setup() lacks error handling and observability. If initialization fails, workers may start with incomplete Django configuration, leading to cryptic task failures.

Apply this diff to add error handling and logging:
+import logging
 import django

 from configurations.importer import install

+logger = logging.getLogger(__name__)
+
+try:
+    logger.info("Initializing Django configuration for Dramatiq worker...")
-install(check_options=True)
-django.setup()
+    install(check_options=True)
+    django.setup()
+    logger.info("Django configuration initialized successfully")
+except Exception as e:
+    logger.exception("Failed to initialize Django configuration for worker: %s", e)
+    raise
As per coding guidelines.

src/backend/core/utils.py (2)

39-57: Clamp progress and use timezone-aware timestamps.

Avoid invalid values and prefer timezone-aware timestamps for consistency across services.

-from typing import Any, Dict, Optional
+from typing import Any, Dict, Optional
+from django.utils import timezone
@@
-def set_task_progress(progress: int, metadata: Optional[Dict[str, Any]] = None) -> None:
+def set_task_progress(progress: int, metadata: Optional[Dict[str, Any]] = None) -> None:
@@
-    task_id = current_message.message_id
+    task_id = current_message.message_id
+    # Defensive clamp to [0, 100]
+    try:
+        progress = max(0, min(100, int(progress)))
+    except (TypeError, ValueError):
+        progress = 0
     progress_data = {
         "progress": progress,
-        "timestamp": time.time(),
+        "timestamp": timezone.now().timestamp(),
         "metadata": metadata or {},
     }

60-69: LGTM.

Simple cache getter; consider adding a small, module-level constant for the cache key prefix to avoid typos across modules.

src/backend/core/services/importer/tasks.py (2)

97-115: Guard against zero-division and over-update.

If total_messages is 0, the loop won’t run, but being explicit avoids future refactors breaking this. Also, avoid computing i/total_messages when total_messages is small.

-                if i % 100 == 0 or i == total_messages:
-                    progress_percentage = min(20 + (i / total_messages) * 70, 90)
+                if total_messages > 0 and (i % 100 == 0 or i == total_messages):
+                    progress_percentage = min(20 + (i / total_messages) * 70, 90)

366-379: Consider emitting a final 100% progress update on success.

MBOX task sets 100% at completion; mirror this for EML to keep UX consistent.

-        if success:
-            return {
+        if success:
+            set_task_progress(100, {"message": "EML processing completed successfully"})
+            return {
                 "status": "SUCCESS",
                 "result": result,
                 "error": None,
             }

src/backend/core/tests/importer/test_imap_import.py (1)

373-454: Drop update_state patch in duplicate recipients test; keep result assertions only.

The actor doesn’t call update_state.

-    # Create a mock task instance
-    mock_task = MagicMock()
-    mock_task.update_state = MagicMock()
-
-    with patch.object(
-        import_imap_messages_task, "update_state", mock_task.update_state
-    ):
-        # Run the task
-        task = import_imap_messages_task(
+    # Run the task
+    task = import_imap_messages_task(
             imap_server="imap.example.com",
@@
-        # Critical: Verify that no validation errors were logged
-        # This ensures the deduplication logic works correctly
+        # Critical: Verify that no validation errors were logged
+        # This ensures the deduplication logic works correctly
         mock_logger.error.assert_not_called()

src/backend/core/api/viewsets/task.py (1)

65-65: Prefer the project’s IsAuthenticated to avoid DB hits.

Swap DRF’s permission for core.api.permissions.IsAuthenticated which short-circuits on tokens.
-from rest_framework import permissions
+from rest_framework import permissions
+from core.api.permissions import IsAuthenticated as CoreIsAuthenticated
@@
-    permission_classes = [permissions.IsAuthenticated]
+    permission_classes = [CoreIsAuthenticated]

src/backend/core/mda/tasks.py (1)

64-70: Return archive outcome in task result

Expose whether archiving actually happened to callers.

-    result = {
-        "message_id": str(message_id),
-        "success": True,
-    }
+    result = {
+        "message_id": str(message_id),
+        "success": True,
+        "archived": bool(must_archive),
+    }

src/backend/core/tests/api/test_task.py (1)

132-138: Create a valid Message (or use a harmless actor) for integration

send_message_task.send("test-message-id") will likely raise Message.DoesNotExist when the worker processes it. Either:

Build a minimal sendable Message via factories, then pass its id; or

Replace with a no-op/cheap actor for this integration check.

Example (adjust to your factories/models):
-    result = send_message_task.send("test-message-id")
+    # e.g., using a MessageFactory or minimal model setup:
+    # msg = factories.MessageFactory.create(is_draft=False, is_sender=True, ...)
+    # result = send_message_task.send(str(msg.id))

src/backend/core/services/search/tasks.py (2)

31-51: Stream large querysets to reduce memory footprint

For big datasets, use iterator() and compute total via .count() to avoid caching the entire queryset.

-    threads = models.Thread.objects.all()
-    total = threads.count()
+    total = models.Thread.objects.count()
+    threads = models.Thread.objects.all().iterator(chunk_size=1000)
 
-    for i, thread in enumerate(threads):
+    for i, thread in enumerate(threads, start=1):
         try:
             if index_thread(thread):
                 success_count += 1
             else:
                 failure_count += 1
 ...
-        if i % 100 == 0:
+        if i % 100 == 0:
             logger.debug("Progress for all threads: %s of %s", i, total)

121-124: Optional: expose progress via set_task_progress

If you want UI progress parity with MDA tasks, call set_task_progress at start/periodically/finish.

-from core.utils import register_task
+from core.utils import register_task, set_task_progress
@@
-    # Ensure index exists first
+    # Ensure index exists first
     create_index_if_not_exists()
+    set_task_progress(0, {"message": "Reindex mailbox started", "mailbox_id": str(mailbox_id)})
@@
-        if i % 50 == 0:
-            logger.debug(
-                "Updating progress for mailbox %s: %s of %s", mailbox_id, i, total
-            )
+        if i % 50 == 0:
+            pct = min(int(i / max(total, 1) * 100), 99)
+            set_task_progress(pct, {"message": f"Mailbox {mailbox_id} {i}/{total}"})
@@
-    return {
+    set_task_progress(100, {"message": f"Mailbox {mailbox_id} reindex complete"})
+    return {

Also applies to: 49-51

📜 Review details

Configuration used: CodeRabbit UI

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between bf77b07 and ce0791e.

⛔ Files ignored due to path filters (1)

src/backend/poetry.lock is excluded by !**/*.lock

📒 Files selected for processing (40)

Makefile (1 hunks)
Procfile (1 hunks)
compose.yaml (2 hunks)
docs/env.md (1 hunks)
env.d/development/backend.defaults (1 hunks)
src/backend/Dockerfile (2 hunks)
src/backend/core/admin.py (2 hunks)
src/backend/core/api/viewsets/import_message.py (2 hunks)
src/backend/core/api/viewsets/send.py (2 hunks)
src/backend/core/api/viewsets/task.py (4 hunks)
src/backend/core/management/commands/run_task.py (5 hunks)
src/backend/core/management/commands/search_reindex.py (0 hunks)
src/backend/core/management/commands/worker.py (1 hunks)
src/backend/core/mda/outbound.py (1 hunks)
src/backend/core/mda/tasks.py (6 hunks)
src/backend/core/services/dns/tasks.py (0 hunks)
src/backend/core/services/importer/imap.py (2 hunks)
src/backend/core/services/importer/service.py (2 hunks)
src/backend/core/services/importer/tasks.py (10 hunks)
src/backend/core/services/search/__init__.py (0 hunks)
src/backend/core/services/search/index.py (0 hunks)
src/backend/core/services/search/tasks.py (8 hunks)
src/backend/core/signals.py (3 hunks)
src/backend/core/tasks.py (1 hunks)
src/backend/core/templates/admin/index.html (1 hunks)
src/backend/core/tests/api/test_messages_import.py (15 hunks)
src/backend/core/tests/api/test_send_message_signature.py (11 hunks)
src/backend/core/tests/api/test_task.py (1 hunks)
src/backend/core/tests/conftest.py (1 hunks)
src/backend/core/tests/importer/test_file_import.py (2 hunks)
src/backend/core/tests/importer/test_imap_import.py (4 hunks)
src/backend/core/tests/importer/test_import_service.py (8 hunks)
src/backend/core/tests/search/test_search.py (3 hunks)
src/backend/core/utils.py (2 hunks)
src/backend/core/worker_setup.py (1 hunks)
src/backend/messages/__init__.py (0 hunks)
src/backend/messages/celery_app.py (0 hunks)
src/backend/messages/settings.py (4 hunks)
src/backend/messages/urls.py (1 hunks)
src/backend/pyproject.toml (2 hunks)

💤 Files with no reviewable changes (6)

src/backend/messages/init.py
src/backend/core/services/search/index.py
src/backend/core/services/search/init.py
src/backend/core/management/commands/search_reindex.py
src/backend/core/services/dns/tasks.py
src/backend/messages/celery_app.py

🧰 Additional context used

📓 Path-based instructions (6)

src/backend/**/*.py