Feat: extract pagetype from og:type or ld+json #374

andremacola · 2023-12-04T22:56:15Z

This pull request addresses the issue #373 and includes several improvements:

1 - Refactoring of extractMetaData to avoid repetition and enhance maintainability.
2 - Adds extraction of JSON+LD Schema for certain parameters such as description, image, author, published and type in case they are not found by extractMetaData. This provides an additional extraction option for these parameters.
3 - Opens up the possibility to enhance and expand the extraction of other parameters through the JSON+LD Schema.
4 - Adds a test for extractLDSchema.
5 - Adds the OG:TYPE meta tag based on https://schema.org/docs/full.html (returns only types related to articles, blogs, and websites).

No external dependencies were used, only the logic from the article-extractor itself.

ndaidong · 2023-12-05T00:22:33Z

@andremacola thank you, I will check and merge this soon.

@andremacola

- Merge pr #374 by @andremacola (issue #373) - Update dependencies - Update CI config - Fix function call in eval.js

- Fix error while parsing ldjson - Update dependencies Related issues: #378, #374, #373

Feat: extract pagetype from og:type or ld+json

2fe4d72

ndaidong changed the base branch from main to dev December 5, 2023 00:22

ndaidong merged commit f84aec2 into extractus:dev Dec 5, 2023

ndaidong added a commit that referenced this pull request Dec 5, 2023

v8.0.8

0fd6c66

- Merge pr #374 by @andremacola (issue #373) - Update dependencies - Update CI config - Fix function call in eval.js

ndaidong mentioned this pull request Dec 5, 2023

v8.0.8 #375

Merged

ndaidong mentioned this pull request Jan 21, 2024

Expected ',' or '}' after property value in JSON at position 543 (line 23 column 7) #378

Closed

ndaidong added a commit that referenced this pull request Jan 22, 2024

v8.0.5

901d1cf

- Fix error while parsing ldjson - Update dependencies Related issues: #378, #374, #373

ndaidong mentioned this pull request Jan 22, 2024

v8.0.5 #379

Merged

This was referenced Jul 26, 2024

[Snyk] Upgrade @extractus/article-extractor from 8.0.4 to 8.0.10 jackkershaw/app-defaults#2

Merged

[Snyk] Upgrade @extractus/article-extractor from 8.0.3 to 8.0.10 jackkershaw/app-defaults#3

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feat: extract pagetype from og:type or ld+json #374

Feat: extract pagetype from og:type or ld+json #374

andremacola commented Dec 4, 2023 •

edited

Loading

ndaidong commented Dec 5, 2023

Feat: extract pagetype from og:type or ld+json #374

Feat: extract pagetype from og:type or ld+json #374

Conversation

andremacola commented Dec 4, 2023 • edited Loading

ndaidong commented Dec 5, 2023

andremacola commented Dec 4, 2023 •

edited

Loading