Skip to content

Releases: FullFact/health-misinfo-shared

v1.1.0

01 Apr 15:15
f5abd8a

Choose a tag to compare

What's Changed

Full Changelog: v1.0.0...v1.1.0

The One Where We Do YouTube

19 Aug 12:02
51671af

Choose a tag to compare

What's Changed

  • feat: change database schema by @ff-dh in #82
  • refactor: replace react with a flask-only version by @andylolz in #78
  • 70 change main inference model to multi label version by @dcorney in #76
  • fix: more accurate chunk start and end offsets by @andylolz in #87
  • 73-write-an-evaluation-script-using-promptfoo by @dearden in #83
  • 95 Ask for a direct quote as well as a paraphrase in multi label prompt by @dearden in #96
  • dev -> main. Schema changes; switch to flask-only; multi-label model by @andylolz in #84
  • feat: COPY data in dockerfile to have training claims by @ff-dh in #97
  • fix: store the sentence text, not the chunk by @andylolz in #98
  • feat: refine claim offsets by @andylolz in #102
  • fix: limit history to 20 items by @andylolz in #109
  • feat: show transcript alongside claims by @andylolz in #110
  • fix: update captions regex by @andylolz in #115
  • fix: pass raw transcript text to LLM by @andylolz in #108
  • 116 improve json parsing of generated output by @dearden in #117
  • Evaluation assertions by @dearden in #112
  • 81 evaluation write function to run evaluation code during promptfoo evaluation by @dearden in #94
  • Evaluation test for checking if the quote appears in the chunk by @dearden in #99
  • Add a test for checking the format of the LLM output by @dearden in #105
  • feat: user registration routes by @ff-dh in #121
  • fix: use user_id in runs & rename disable route by @ff-dh in #123
  • chore: update readme for new users handling by @ff-dh in #128
  • 131 remove evaluationpy by @dearden in #132
  • fix: only call google.auth.default() from inside a function by @andylolz in #134
  • fix: remove text from labels by @andylolz in #126
  • fix: check end_idx before using it by @andylolz in #129
  • 91 improve generation of labels for in context learning by @c-j-johnston in #127
  • fix: add claim summaries again for app by @c-j-johnston in #135
  • fix: add missing data file by @c-j-johnston in #137
  • feat: parse output jsons using existing function by @c-j-johnston in #139
  • feat: don't limit number of claims per chunk by @c-j-johnston in #142
  • 140 add format checker by @c-j-johnston in #144
  • refactor: assume we have exactly one candidate by @andylolz in #148
  • feat: only show current user’s video history by @andylolz in #150
  • feat: add summary to LLM output in evaluation by @c-j-johnston in #155
  • fix: check dataframe columns by @c-j-johnston in #158
  • 159 make package self contained by moving prompt data into src folder by @dcorney in #160
  • feat: add notice to home page by @scoltman in #163
  • Ensure example output is valid JSON by @andylolz in #172

New Contributors

Full Changelog: 0.0.1...v1.0.0

0.0.1

23 May 13:30
cbd5c86

Choose a tag to compare

Initial Release

  • database create script/module by @ff-dh in #10
  • Rename FOREIGN KEY id -> video_id by @ff-dh in #13
  • 3 enhance training data by @c-j-johnston in #15
  • 4 make evaluation robust by @dcorney in #5
  • 8 update list of search keywords and phrases to get a wider variety of health misinformation by @dearden in #19
  • feat: merge backend to main by @ff-dh in #28
  • feat: merge Frontend to main by @ff-dh in #27
  • feat: ansible playbooks for deployment by @ff-dh in #30
  • Copy changes by @andylolz in #32
  • Remove sqlite database from source control by @andylolz in #46
  • feature: use absolute imports with package modules by @ff-dh in #48
  • build: install node dependencies with npm ci by @andylolz in #47
  • feat: read baseUrl from an environment variable by @andylolz in #43
  • docs: add ‘getting started’ instructions to README by @andylolz in #45
  • refactor: remove tools.db by @andylolz in #52
  • 24 use fact checker annotated data for in context learning by @dcorney in #50
  • 58 add factchecker persona to prompts by @c-j-johnston in #59
  • 56 chunks of youtube transcripts should include their timestamp offsets by @dcorney in #60
  • feat: auto-deploy from GitHub Actions by @andylolz in #55
  • fix: increase timeout from 30 seconds (default) to 30 minutes by @JamesMcMinn in #57
  • refactor: rationalise into a single version by @andylolz in #63
  • fix: unescape HTML entities in titles by @andylolz in #65
  • feat: add MODEL_SHA to environment by @andylolz in #75
  • 38 investigate genai evaluation tools by @dearden in #72
  • 56 (take 2) chunks of youtube transcripts should include their timestamp offsets by @dcorney in #68

Full Changelog: https://github.com/FullFact/health-misinfo-shared/commits/0.0.1