Relative URLs were mishandled, causing potential duplicate page visits #10

Sachin-NK · 2025-02-16T14:20:16Z

The web scraper wasn't handling web addresses correctly. It was getting confused by relative links (like those that just say "go up a level" instead of giving the full address). This meant it could accidentally visit the same page multiple times, wasting time and possibly messing up the data it was collecting. The fix makes sure all web addresses are complete before the scraper uses them, so it knows exactly which page is which.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Relative URLs were mishandled, causing potential duplicate page visits #10

Relative URLs were mishandled, causing potential duplicate page visits #10

Sachin-NK commented Feb 16, 2025

Relative URLs were mishandled, causing potential duplicate page visits #10

Relative URLs were mishandled, causing potential duplicate page visits #10

Comments

Sachin-NK commented Feb 16, 2025