Merge pull request #17 from astronomy-commons/pixel_margin_math

add get_margin and get_edge to pixel_math + create a Pixel Math Appen…
astronomy-commons · Feb 6, 2023 · e901d9e · e901d9e
2 parents 736c970 + 2040959
commit e901d9e
Show file tree

Hide file tree

Showing 4 changed files with 341 additions and 0 deletions.
diff --git a/src/hipscat/pixel_math/README.md b/src/hipscat/pixel_math/README.md
@@ -0,0 +1,151 @@
+# Pixel Math Appendix
+A reference document for the various utility functions of `hipscat/pixel_math`.
+
+## Pixel Margins
+The functions made to find the pixels that make up the border region of a given healpixel. Primarly used as a way to speed up the neighbor/margin caching code for [hipscat-import](https://github.com/astronomy-commons/hipscat-import/). Code originally created by Mario Juric for HIPS, found [here](https://github.com/mjuric/HIPS/blob/feature/multiprocess/hipscat/healpix.py).
+
+### get_edge
+Given a pixel pix at some order, return all
+pixels order dk _higher_ than pix's order that line
+pix's edge (or a corner).
+
+pix: the pixel ID for which the margin is requested
+dk: the requested order of edge pixel IDs, relative to order of pix
+edge: which edge/corner to return (NE edge=0, E corner=1, SE edge = 2, ....)
+
+#### Algorithm
+
+If you look at how the NEST indexing scheme works, a pixel at some order is
+subdivided into four subpixel at every subsequent order in such a way that the south
+subpixel index equals `4*pix`, east is `4*pix+1`, west is `4*pix+2`, north is `4*pix+3`:
+
+```
+                4*pix+3
+pix ->     4*pix+2    4*pix+1
+                4*pix
+```
+
+Further subdivisions split up each of those sub pixels accordingly. For example,
+the eastern subpixel (`4*pix+1`) gets divided up into four more:
+
+```
+S=4*(4*pix+1), E=4*(4*pix+1)+1, W=4*(4*pix+1)+2, N=4*(4*pix+1)+3
+
+                                                            4*(4*pix+3)+3
+                4*pix+3                             4*(4*pix+3)+2   4*(4*pix+3)+1
+                                            4*(4*pix+2)+3   4*(4*pix+3)   4*(4*pix+1)+3
+pix ===>  4*pix+2    4*pix+1  ===>  4*(4*pix+2)+2   4*(4*pix+2)+1   4*(4*pix+1)+2   4*(4*pix+1)+1
+                                            4*(4*pix+2)    4*(4*pix)+3    4*(4*pix+1)
+                4*pix                                4*(4*pix)+2   4*(4*pix)+1
+                                                                4*(4*pix)
+```
+etcetera...
+
+We can see that the edge indices follow a pattern. For example, after two
+subdivisions the south-east edge would consist of pixels:
+```
+4*(4*pix), 4*(4*pix)+1
+4*(4*pix+1), 4*(4*pix+1)+1
+```
+with the top coming from subdividing the southern, and bottom the eastern pixel.
+and so on, recursively, for more subdivisions. Similar patterns are identifiable
+with other edges.
+
+This can be compactly written as:
+
+```
+4*(4*(4*pix + i) + j) + k ....
+```
+
+with i, j, k, ... being indices whose choice specifies which edge we get.
+For example iterating through, i = {0, 1}, j = {0, 1}, k = {0, 1} generates indices
+for the 8 pixels of the south-east edge for 3 subdivisions. Similarly, for
+the north-west edge the index values would loop through {2, 3}, etc.
+
+This can be expanded as:
+
+```
+4**dk*pix  +  4**(dk-1)*i + 4**(dk-2)*j + 4**(dk-3) * k + ...
+```
+
+where dk is the number of subdivisions. E.g., for dk=3, this would equal:
+
+```
+4**3*pix + 4**2*i + 4**1*j + k
+```
+
+When written with bit-shift operators, another interpretation becomes clearer:
+
+```
+pix << 6 + i << 4 + j << 2 + k
+```
+
+or if we look at the binary representation of the resulting number:
+
+```
+    [pix][ii][jj][kk]
+```
+
+Where ii, jj, kk are the bits of _each_ of the i,j,k indices. That is, to get the
+list of subpixels on the edge, we bit-shift the base index pix by 2*dk to the left,
+and fill in the rest with all possible combinations of ii, jj, kk indices. For example,
+the northeast edge has index values {2, 3} = {0b10, 0b11}, so for (say) pix=8=0b1000, the
+northeast edge indices after two subdivisions are equal to:
+
+```
+0b1000 10 10 = 138
+0b1000 10 11 = 139
+0b1000 11 10 = 148
+0b1000 11 11 = 143
+```
+
+Note that for a given edge and dk, the suffix of each edge index does not depend on
+the value of pix (pix is bit-shifted to the right and added). This means it can be
+precomputed & stored for repeated reuse.
+
+#### Implementation
+
+The implementation is based on the binary digit interpretation above. For a requested
+edge and dk subdivision level, we generate (and cache) the suffixes. Then, for a given
+pixel pix, the edge pixels are readily computed by left-shifting pix by 2*dk places,
+and summing (== or-ing) it with the suffix array.
+
+### get_margin
+Returns the list of indices of pixels of order (kk+dk) that
+border the pixel pix of order kk.
+
+#### Algorithm
+Given a pixel pix, find all of its neighbors. Then find the
+edge at level (kk+dk) for each of the neighbors.
+
+This is relatively straightforward in the equatorial faces of the Healpix
+sphere -- e.g., one takes the SW neighbor and requests its NE edge to get
+the margin on that side of the pixel, then the E corner of the W neighbor,
+etc...
+
+This breaks down on pixels that are at the edge of polar faces. There,
+the neighbor's sense of direction _rotates_ when switching from face to
+face. For example at order=2, pixel 5 is bordered by pixel 26 to the
+northeast (see the Fig3, bottom, https://healpix.jpl.nasa.gov/html/intronode4.htm).
+But to pixel 5 that is the **northwest** edge (and not southwest, as you'd
+expect; run `hp.get_all_neighbours(4, 26, nest=True)` and
+`hp.get_all_neighbours(4, 5, nest=True)` to verify these claims).
+
+This is because directions rotate 90deg clockwise when moving from one
+polar face to another in easterly direction (e.g., from face 0 to face 1).
+We have to identify when this happens, and change the edges we request
+for such neighbors. Mathematically, this rotation is equal to adding +2
+(modulo 8) to the requested edge in get_edge(). I.e., for the
+pixel 5 example, rather than requesting SE=4 edge of pixel 26,
+we request SE+2=6=NE edge (see the comments in the definition of _edge_vectors
+near get_edge()).
+
+This index shift generalizes to `2 * (neighbor_face - pix_face)` for the case
+when _both_ faces are around the pole (either north or south). In the south,
+the rotation is in the opposite direction (ccw) -- so the sign of the shift
+changes. The generalized formula also handles the pixels whose one vertex
+is the pole (where all three neighbors sharing that vertex are on different
+faces).
+
+#### Implementation
+Pretty straightforward implementation of the algorithm above.
diff --git a/src/hipscat/pixel_math/__init__.py b/src/hipscat/pixel_math/__init__.py
@@ -6,3 +6,8 @@
     generate_destination_pixel_map,
     generate_histogram,
 )
+
+from .pixel_margins import (
+    get_edge,
+    get_margin
+)
diff --git a/src/hipscat/pixel_math/pixel_margins.py b/src/hipscat/pixel_math/pixel_margins.py
@@ -0,0 +1,111 @@
+#
+# Healpix utilities
+#
+
+import numpy as np
+import healpy as hp
+
+# see the documentation for get_edge() about this variable
+_edge_vectors = [
+    np.asarray([1, 3]), # NE edge
+    np.asarray([1]),    # E corner
+    np.asarray([0, 1]), # SE edge
+    np.asarray([0]),    # S corner
+    np.asarray([0, 2]), # SW edge
+    np.asarray([2]),    # W corner
+    np.asarray([2, 3]), # NW edge
+    np.asarray([3])     # N corner
+]
+
+# cache used by get_margin()
+_suffixes = {}
+
+def get_edge(d_order, pix, edge):
+    """Get all the pixels at order kk+dk bordering pixel pix.
+    See hipscat/pixel_math/README.md for more info.
+
+    Args:
+        dk (int): the change in k that we wish to find the margins for.
+        pix (int): the healpix pixel to find margin pixels of.
+        edge (int): the edge we want to find for the given pixel. (0-7)
+            0: NE edge
+            1: E corner
+            2: SE edge
+            3: S corner
+            4: SW edge
+            5: W corner
+            6: NW edge
+            7: N corner
+    Returns:
+        one-dimensional numpy array of long integers, filled with the healpix pixels 
+            at order kk+dk that border the given edge of pix.
+    """
+    if edge < 0 or edge > 7:
+        raise ValueError("edge can only be values between 0 and 7 (see docstring)")
+
+    if (edge, d_order) in _suffixes.keys():
+        pixel_edge = _suffixes[(edge, d_order)]
+    else:
+        # generate and cache the suffix:
+
+        # generate all combinations of i,j,k,... suffixes for the requested edge
+        # See https://stackoverflow.com/a/35608701
+        grid = np.array(np.meshgrid(*[_edge_vectors[edge]]*d_order))
+        pixel_edge = grid.T.reshape(-1, d_order)
+        # bit-shift each suffix by the required number of bits
+        pixel_edge <<= np.arange(2*(d_order-1),-2,-2)
+        # sum them up row-wise, generating the suffix
+        pixel_edge = pixel_edge.sum(axis=1)
+        # cache for further reuse
+        _suffixes[(edge, d_order)] = pixel_edge
+
+    # append the 'pix' preffix
+    pixel_edge = (pix << 2*d_order) + pixel_edge
+    return pixel_edge
+
+def get_margin(order, pix, d_order):
+    """Get all the pixels at order order+dk bordering pixel pix.
+    See hipscat/pixel_math/README.md for more info.
+
+    Args:
+        order (int): the healpix order of pix.
+        pix (int): the healpix pixel to find margin pixels of.
+        d_order (int): the change in k that we wish to find the margins for. 
+            Must be greater than kk.
+    Returns:
+        one-dimensional numpy array of long integers, filled with the healpix pixels 
+            at order kk+dk that border pix.
+    """
+    if d_order < 1:
+        raise ValueError("dk must be greater than order")
+
+    nside = hp.order2nside(order)
+
+    # get all neighboring pixels
+    neighbors = hp.get_all_neighbours(nside, pix, nest=True)
+
+    # get the healpix faces IDs of pix and the neighboring pixels
+    _, _, pix_face = hp.pix2xyf(nside, pix, nest=True)
+    _, _, faces  = hp.pix2xyf(nside, neighbors, nest=True)
+
+    # indices which tell get_edge() which edge/verted to return
+    # for a given pixel. The default order is compatible with the
+    # order returned by hp.get_all_neighbours().
+    which = np.arange(8)
+    if pix_face < 4: # northern hemisphere; 90deg cw rotation for every +1 face increment
+        mask = (faces < 4)
+        which[mask] += 2*(faces-pix_face)[mask]
+        which %= 8
+    elif pix_face >= 8: # southern hemisphere; 90deg ccw rotation for every +1 face increment
+        mask = (faces >= 8)
+        which[mask] -= 2*(faces-pix_face)[mask]
+        which %= 8
+
+    # get all edges/vertices 
+    # (making sure we skip -1 entries, for pixels with seven neighbors)
+    margins = []
+    for edge, pixel in zip(which, neighbors):
+        if pixel != -1:
+            margins.append(get_edge(d_order, pixel, edge))
+    margins = np.concatenate(margins)
+    return margins
diff --git a/tests/hipscat/pixel_math/test_pixel_margins.py b/tests/hipscat/pixel_math/test_pixel_margins.py
@@ -0,0 +1,74 @@
+"""Tests of pixel margin utility functions"""
+
+import pytest
+import numpy as np
+import numpy.testing as npt
+
+import hipscat.pixel_math as pm
+
+
+def test_get_margin():
+    """Check that the code works in the standard case."""
+    margins = pm.get_margin(2, 2, 2)
+
+    expected = np.array([1141, 1143, 1149, 1151, 1237,  
+                          128,  129,  132,  133,  144,   
+                           48,   50,   56,   58,   26,   
+                           10,   11,   14,   15, 1119])
+
+    assert len(margins) == 20
+
+    npt.assert_array_equal(margins, expected)
+
+def test_zero_dk():
+    """Check that the code fails when trying to find margins at same order as the pixel."""
+    with pytest.raises(ValueError) as ve:
+        margins = pm.get_margin(2, 2, 0)
+
+    assert str(ve.value) == "dk must be greater than order"
+
+def test_negative_dk():
+    """Check that the code fails when trying to find margins at a higher order than the pixel."""
+    with pytest.raises(ValueError) as ve:
+        margins = pm.get_margin(2, 2, -1)
+
+    assert str(ve.value) == "dk must be greater than order"
+
+def test_polar_edge():
+    """Check that the code works when trying to find margins around the north pole."""
+    margins = pm.get_margin(2, 5, 2)
+
+    expected = np.array([  69,   71,   77,   79,  101,  
+                          112,  113,  116,  117,  442,  
+                          426,  427,  430,  431, 1530, 
+                         1531, 1534, 1535, 1519])
+
+    assert len(margins) == 19
+
+    npt.assert_array_equal(margins, expected)
+
+def test_polar_edge_south():
+    """Check that the code works when trying to find margins around the south pole."""
+    margins = pm.get_margin(1, 35, 2)
+
+    expected = np.array([549, 551, 557, 559, 261, 
+                         272, 273, 276, 277,   0, 
+                         352, 354, 360, 362, 330, 
+                         538, 539, 542, 543, 527])
+    assert len(margins) == 20
+
+    npt.assert_array_equal(margins, expected)
+
+def test_edge_negative_value():
+    """Check to make sure get_edge fails when passed a negative edge value."""
+    with pytest.raises(ValueError) as ve:
+        edge = pm.get_edge(2, 5, -1)
+
+    assert str(ve.value) == "edge can only be values between 0 and 7 (see docstring)"
+
+def test_edge_greater_than_7():
+    """Check to make sure get_edge fails when passed an edge value greater than 7."""
+    with pytest.raises(ValueError) as ve:
+        edge = pm.get_edge(2, 5, 8)
+
+    assert str(ve.value) == "edge can only be values between 0 and 7 (see docstring)"