Skip to content

Unexpected district duplication in ReCom chains #426

@proebsting

Description

@proebsting

We are seeing two unexpected behaviors from ReCom/GerryChain:

  1. We're getting an identical plan from a chain twice in a row with some frequency. Our mental model is that two successive plans should share N-2 identical districts along with two new districts (from the pairwise recombination/split).
  2. We're also seeing some district configurations get "reincarnated" with some frequency, by which we mean a district configuration in the current plan wasn't in the immediate predecessor plan in the chain but that district configuration was in one of the plans on the chain before the predecessor. Our mental model here is that generated districts are random and astronomical in number so the same one reappearing should be extraordinarily rare.
    Put another way, we have evidence that some districts are being created by multiple independent steps of the chain. Even more concerning, we are seeing some districts being created many independent times over the course of a single chain. (We had one district appear from 42 different recombination/split steps in a 10,000-long chain.)

I have attached a bundle that can reproduce both phenomena. The quickest is with the 20x30 uniform grid.
bundle.zip

The reproduction with a 40x40 uniform grid with 4 districts is particularly noteworthy given that each step is splitting 800 nodes in half, which ought to have an astronomical number of unique possibilities.

(Note, this happens with both the Kruskal and Wilson spanning tree algorithms.)

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions