Skip to content
Change the repository type filter

All

    Repositories list

    • Sous-chef kitchen- Self-service data access for sous-chef.
      Python
      1051Updated Jul 31, 2025Jul 31, 2025
    • devops tools
      Python
      1000Updated Jul 31, 2025Jul 31, 2025
    • Intelligently fetch lists of URLs from a large collection of RSS Feeds as part of the Media Cloud Directory.
      Python
      68111Updated Jul 30, 2025Jul 30, 2025
    • Public client for consuming content from the Media Cloud Online News Archive & Directory.
      Python
      327741Updated Jul 29, 2025Jul 29, 2025
    • sous-chef

      Public
      Configurable Data Analytics Pipeline
      Python
      0160Updated Jul 25, 2025Jul 25, 2025
    • The core pipeline used to ingest online news stories in the Media Cloud archive.
      Python
      56362Updated Jul 23, 2025Jul 23, 2025
    • Code that drives the public web-based tools for the Media Cloud Online News Archive and Directory.
      JavaScript
      1712680Updated Jul 17, 2025Jul 17, 2025
    • es-tools

      Public
      Elasticsearch tools developed by the Media Cloud project
      Python
      1100Updated Jul 16, 2025Jul 16, 2025
    • Ansible playbook for Elasticsearch
      Ruby
      853000Updated Jun 22, 2025Jun 22, 2025
    • A set of jupyter notebooks demonstrating how to use the Media Cloud API.
      Jupyter Notebook
      143900Updated Jun 17, 2025Jun 17, 2025
    • Internal library to allow querying multiple media platforms with a consistent API.
      Python
      5262Updated Jun 16, 2025Jun 16, 2025
    • FastAPI server for Media Cloud Vitals page
      Python
      0020Updated May 20, 2025May 20, 2025
    • news-search-api

      Public archive
      Internal API server that offers search access to the Media Cloud Online News Archive (in Elasticsearch).
      Python
      41100Updated Mar 12, 2025Mar 12, 2025
    • UNDER CONSTRUCTION - A package containing a library of issue validators in a flexibly deployable wrapper.
      Jupyter Notebook
      2091Updated Jan 31, 2025Jan 31, 2025
    • How Media Cloud approaches extracting metadata from online news stories
      Python
      61580Updated Dec 22, 2024Dec 22, 2024
    • An internal client library to access the new Mediacloud news archive search.
      Python
      2031Updated Oct 10, 2024Oct 10, 2024
    • Find rss, atom, xml, and rdf feeds on webpages
      Python
      133041Updated Oct 10, 2024Oct 10, 2024
    • simple toolkit of tools for consuming sitemaps
      Python
      1420Updated Oct 9, 2024Oct 9, 2024
    • mc-manage

      Public
      Python
      0000Updated Oct 8, 2024Oct 8, 2024
    • Daily performance metrics for the mediacloud application
      Python
      0010Updated Sep 20, 2024Sep 20, 2024
    • A Python client for the CLIFF geoparsing tool
      Python
      5501Updated May 21, 2024May 21, 2024
    • A client library to access the Wayback Machine news archive search.
      Python
      2410Updated Dec 15, 2023Dec 15, 2023
    • web-tools

      Public archive
      The shared repository for Media Cloud web apps (Explorer, Source Manager, Topic Mapper)
      JavaScript
      3065314Updated Dec 14, 2023Dec 14, 2023
    • backend

      Public archive
      Media Cloud is an open source, open data platform that allows researchers to answer quantitative questions about the content of online media.
      Python
      9028313125Updated Nov 20, 2023Nov 20, 2023
    • Dokku app that serves a static HTML catch-all page, displayed for bad domains
      HTML
      0000Updated Oct 25, 2023Oct 25, 2023
    • A simple homepage for the CLIFF project
      HTML
      1100Updated May 30, 2023May 30, 2023
    • Tag news stories based on models trained on the NYT corpus.
      Python
      134216Updated Mar 1, 2023Mar 1, 2023
    • Text to sentence splitter using heuristic algorithm by Philipp Koehn and Josh Schroeder.
      Python
      3224922Updated Nov 7, 2022Nov 7, 2022
    • glimpse

      Public archive
      Get a glimpse of attention to a topic on social media.
      Python
      2280Updated Sep 19, 2022Sep 19, 2022
    • Helpful micro-service to return results from word2vec models
      Python
      4200Updated Jul 29, 2022Jul 29, 2022