New OceanParcels website #113

VeckoTheGecko · 2025-03-27T14:36:53Z

Created the new OceanParcels website. Used https://xarray.dev as a starting point.

Made sure in the migration to bring example_data across as that is how parcels downloads example datasets.

Items still TODO:

Update "Projects that use parcels" section
Update features section

Fixes #112

VeckoTheGecko · 2025-03-28T14:32:22Z

This is the final script I used to extract the article data to port to the new site

Details

"""Script to scrape articles from old OceanParcels website."""

import requests
from bs4 import BeautifulSoup
import json
import re
import sys


def scrape_articles(url):
    try:
        # Fetch the webpage
        response = requests.get(url)
        response.raise_for_status()  # Raise an exception for bad status codes

        # Parse the HTML content
        soup = BeautifulSoup(response.text, "html.parser")

        # Find all card elements
        cards = soup.find_all("div", class_="card")

        # List to store extracted article information
        articles = []

        # Process each card
        for card in cards:
            try:
                # Extract title from h5 element
                title_elem = card.find("h5")
                title = title_elem.get_text(strip=True) if title_elem else ""

                # Extract authors (text immediately after h5)
                authors = ""
                if title_elem and title_elem.next_sibling:
                    authors = (
                        title_elem.next_sibling.strip()
                        if isinstance(title_elem.next_sibling, str)
                        else ""
                    )

                # Extract published info (journal, volume, pages)
                published_info = ""
                if title_elem:
                    # Find all text between authors and <br/>
                    next_elem = title_elem.find_next_sibling()
                    while next_elem and next_elem.name != "br":
                        if isinstance(next_elem, str):
                            published_info += next_elem.strip() + " "
                        else:
                            published_info += next_elem.get_text(strip=True) + " "
                        next_elem = next_elem.next_sibling
                    published_info = published_info.strip()

                # Extract DOI from card-link
                # Extract DOI from card-link
                doi_link = card.find(
                    "a", class_="card-link", href=lambda href: href and "doi" in href
                )
                if doi_link:
                    doi = doi_link.get("href", "")

                # Extract abstract from card-body
                card_body = card.find("div", class_="card-body")
                abstract = card_body.get_text(strip=True) if card_body else ""

                # Clean up abstract by replacing newlines and multiple spaces with single space
                authors = authors.rstrip(",")

                published_info = re.sub(r"\s*,", ",", published_info)

                # Create article dictionary
                article = {
                    "title": title,
                    "published_info": published_info,
                    "authors": authors,
                    "doi": doi,
                    "abstract": abstract,
                }
                article = {k: re.sub(r"\n\s*", " ", v) for k, v in article.items()}

                articles.append(article)

            except Exception as card_error:
                print(f"Error processing card: {card_error}")
                print("Problematic card HTML:")
                print(card.prettify())
                sys.exit(1)

        # Make articles chronological
        articles.reverse()

        # Save to JSON file
        with open("articles.json", "w", encoding="utf-8") as f:
            json.dump(articles, f, indent=2, ensure_ascii=False)

        print(f"Successfully scraped {len(articles)} articles.")
        return articles

    except requests.RequestException as e:
        print(f"Error fetching URL: {e}")
        sys.exit(1)


# Main execution
if __name__ == "__main__":
    url = "https://oceanparcels.org/articles.html"
    scrape_articles(url)

VeckoTheGecko · 2025-03-28T14:33:53Z

Updated view @erikvansebille

Fixes #122

VeckoTheGecko · 2025-03-28T15:22:12Z

Let's merge on Monday :)

Fixes #122

VeckoTheGecko · 2025-03-28T15:28:31Z

Should we remove the placeholder celebrating-10-years post or do you want to write it before we merge @erikvansebille ?

…els_website into migration

erikvansebille · 2025-03-28T15:35:16Z

Good point, I just removed it. Will write it in the coming week(?)

But perhaps you(?) can write a short blog post celebrating the new website launch, highlighting that we thank xarray for the design?

VeckoTheGecko · 2025-03-28T16:34:49Z

But perhaps you(?) can write a short blog post celebrating the new website launch, highlighting that we thank xarray for the design?

done :)

Rename sponsors to funders

VeckoTheGecko added 30 commits March 27, 2025 15:29

remove other pages

616771f

prep for new website

3c17a0e

add package.json and README

e246a25

add rest of site before migration

f99771e

add placeholder blog post

0bc16b7

remove dashboard code

0cc025e

Update footer

6b3244c

remove vercel

acf4bab

remove sections

a9a43c0

update footer

fc9e973

remove now unused data files

bcb3edf

remove giscus

ceec3e0

add todo comments and formatting

ab7db1b

remove social buttons

920d55d

update public assets

86c4d4b

upload assets and update logos on homepage

8f4c963

update features with placeholder and update homepage image

220bcb6

Add projects section back

723b692

update copyright notice

0acd4d9

Remove twitter meta

5e8193a

Update parcels assets

5f58e32

update team members data

f2094fb

add team images

30e1337

update team-member data keys

51d067f

Fix image fname extensions

af871ce

Update team members page and cards

b347bcc

Update team member

d9357fe

Finish team page

cdcd9ca

Remove dead code

fd546fa

Add sponsors section

c866597

VeckoTheGecko added 5 commits March 28, 2025 15:03

update sponsor logo ratios

b691da2

update link position for article elements

2a42188

migrate papers data to TypeScript

1bc05e8

update with published_info data

49cbcd7

Render published info in article

2cfbff6

erikvansebille approved these changes Mar 28, 2025

View reviewed changes

fix link in accordions

27b1712

Fixes #122

fix link in accordions

b738a5a

Fixes #122

VeckoTheGecko force-pushed the migration branch from 27b1712 to b738a5a Compare March 28, 2025 15:24

erikvansebille added 2 commits March 28, 2025 16:33

Removing anniversary blog post for the moment

651df67

Merge branch 'migration' of https://github.com/OceanParcels/oceanparc…

2e3b01b

…els_website into migration

Add post about new website

c35a6c6

erikvansebille and others added 9 commits March 31, 2025 08:05

Cropping homepage animation to Southern Hemisphere only

a2aee64

Fixing location of waddendrifter images

fcb3e3b

Removing background from virtualship and plasticparcels logo

6f23930

Changing to SH animation on homepage

caf1f76

Do not sort projects; and drop planktondrift for now (until fixed)

8f3e926

Renaming Sponsors to Funders throughout

083d70e

Updates on Features texts

bb23b9b

Merge pull request #125 from OceanParcels/sponsors-to-funders

4e3d6ff

Rename sponsors to funders

Add redirects for waddendrifters

2bc4da7

VeckoTheGecko merged commit e2b5afc into main Mar 31, 2025
3 checks passed

VeckoTheGecko deleted the migration branch March 31, 2025 09:55

erikvansebille mentioned this pull request Mar 31, 2025

Website refactor (flask) #74

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

New OceanParcels website #113

New OceanParcels website #113

Uh oh!

VeckoTheGecko commented Mar 27, 2025 •

edited

Loading

Uh oh!

VeckoTheGecko commented Mar 28, 2025

Uh oh!

VeckoTheGecko commented Mar 28, 2025

Uh oh!

VeckoTheGecko commented Mar 28, 2025

Uh oh!

VeckoTheGecko commented Mar 28, 2025

Uh oh!

erikvansebille commented Mar 28, 2025

Uh oh!

VeckoTheGecko commented Mar 28, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

New OceanParcels website #113

New OceanParcels website #113

Uh oh!

Conversation

VeckoTheGecko commented Mar 27, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

VeckoTheGecko commented Mar 28, 2025

Uh oh!

VeckoTheGecko commented Mar 28, 2025

Uh oh!

VeckoTheGecko commented Mar 28, 2025

Uh oh!

VeckoTheGecko commented Mar 28, 2025

Uh oh!

erikvansebille commented Mar 28, 2025

Uh oh!

VeckoTheGecko commented Mar 28, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

VeckoTheGecko commented Mar 27, 2025 •

edited

Loading