Skip to content

Conversation

@FrancescoSaverioZuppichini
Copy link

@FrancescoSaverioZuppichini FrancescoSaverioZuppichini commented Nov 21, 2025

Thanks a lot for the work on this repo guys! Evaluations are most needed in this field :)

Description

This PR adds:

  • scrapegraphai
  • edited README + notebook (created a color map dictionary)
  • rerun firecrawl

Scrapegraph ranks first on the success_rate by a small margin.

newplot

However the benchmark's is not really a benchmark since the ground truth are not always correct, #1 so the other metrics should be taken with a grain of salt. I've rerun firecrawl to prove my point, it lost 2 points on f1 because yeah ground truth have changed.

Not sure what is the plan here but either we come up with a solution to host the websites for the benchmark so they cannot change or it a non scientific way to evaluate scraping services.

Let me know your thoughts!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant