|
| 1 | +--- |
| 2 | +title: Introducing the FlaSH Group |
| 3 | +author: Ananya Joshi, Nolan Gormley, Richa Gadgil, Tina Townes |
| 4 | +date: 2024-01-01 |
| 5 | +tags: |
| 6 | + - flash |
| 7 | +authors: |
| 8 | + - ajoshi |
| 9 | + - nolan |
| 10 | + - richa |
| 11 | + - tina |
| 12 | +heroImage: flash_long.png |
| 13 | +heroImageThumb: flash_logo.png |
| 14 | +summary: | |
| 15 | + Delphi's FlaSH group works to alert experts about data that suggests quality issues or changes in disease dynamics from the millions of data points they publish daily to improve the efficacy of Delphi's data. |
| 16 | +output: |
| 17 | + blogdown::html_page: |
| 18 | + toc: true |
| 19 | +--- |
| 20 | + |
| 21 | + |
| 22 | + |
| 23 | +<p>Delphi publishes millions of public-health-related data points per day, including the total number of daily influenza cases, hospitalizations, and deaths per county and state in the United States (US). This data helps public health practitioners, data professionals, and members of the public make important, informed decisions relating to health and well-being.</p> |
| 24 | +<p>Yet, as data volumes continue to grow quickly (Delphi’s data volume expanded 1000x in just 3 years), it is infeasible for data reviewers to inspect every one of these data points for subtle changes in</p> |
| 25 | +<ul> |
| 26 | +<li>quality (like those resulting from data delays) or</li> |
| 27 | +<li>disease dynamics (like an outbreak).</li> |
| 28 | +</ul> |
| 29 | +<p>These issues, if undetected, can have critical downstream ramifications on data users (as shown by the example in Fig 1).</p> |
| 30 | +<div class="float"> |
| 31 | +<img src="/blog/2024-01-01-flash-intro/forecast.jpg" alt="Fig 1. Data quality changes in case counts, shown by the large spikes in March and July 2022, when cases were trending down, resulted in similar spikes for predicted counts (red) from multiple forecasts that were then sent to the US CDC. A weekly forecast per state, for cases, hospitalizations, and deaths, up to 4 weeks in the future means that modeling teams would have to review 600 forecasts per week and may not have been able to catch the upstream data issue." /> |
| 32 | +<div class="figcaption">Fig 1. Data quality changes in case counts, shown by the large spikes in March and July 2022, when cases were trending down, resulted in similar spikes for predicted counts (red) from multiple forecasts that were then sent to the US CDC. A weekly forecast per state, for cases, hospitalizations, and deaths, up to 4 weeks in the future means that modeling teams would have to review 600 forecasts per week and may not have been able to catch the upstream data issue.</div> |
| 33 | +</div> |
| 34 | +<p>We care about finding data issues like these so that we can alert downstream data users accordingly. That is why our goal in the FlaSH team (Flagging Anomalies in Streams related to public Health) is to quickly identify data points that warrant human inspection and create tools to support data review. Towards this goal, our team of researchers, engineers, and data reviewers iterate on our deployed interdisciplinary approach. In this blog series, we will cover the different methods and perspectives of the FlaSH project.</p> |
| 35 | +<p>Members: Ananya Joshi, Nolan Gormley, Richa Gadgil, Tina Townes </p> |
| 36 | +<p>Former Members: Luke Neurieter, Katie Mazaitis </p> |
| 37 | +<p>Advisors: Peter Jhon, Roni Rosenfeld, Bryan Wilder</p> |
0 commit comments