Skip to content

Feat: extract pagetype from og:type or ld+json #374

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 1 commit into from
Dec 5, 2023

Conversation

andremacola
Copy link
Contributor

@andremacola andremacola commented Dec 4, 2023

This pull request addresses the issue #373 and includes several improvements:

1 - Refactoring of extractMetaData to avoid repetition and enhance maintainability.
2 - Adds extraction of JSON+LD Schema for certain parameters such as description, image, author, published and type in case they are not found by extractMetaData. This provides an additional extraction option for these parameters.
3 - Opens up the possibility to enhance and expand the extraction of other parameters through the JSON+LD Schema.
4 - Adds a test for extractLDSchema.
5 - Adds the OG:TYPE meta tag based on https://schema.org/docs/full.html (returns only types related to articles, blogs, and websites).

No external dependencies were used, only the logic from the article-extractor itself.

@ndaidong
Copy link
Collaborator

ndaidong commented Dec 5, 2023

@andremacola thank you, I will check and merge this soon.

@ndaidong ndaidong changed the base branch from main to dev December 5, 2023 00:22
@ndaidong ndaidong merged commit f84aec2 into extractus:dev Dec 5, 2023
ndaidong added a commit that referenced this pull request Dec 5, 2023
- Merge pr #374 by @andremacola (issue #373)
- Update dependencies
- Update CI config
- Fix function call in eval.js
@ndaidong ndaidong mentioned this pull request Dec 5, 2023
ndaidong added a commit that referenced this pull request Jan 22, 2024
- Fix error while parsing ldjson
- Update dependencies

Related issues: #378, #374, #373
@ndaidong ndaidong mentioned this pull request Jan 22, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants