Receives a URL and outputs the top 25 most common words in the page. Takes the url as the one and only parameter.
- URL passed to the class is fully formed and valid for JSoup to use.
- Outputting the list to the terminal is fine
- The SiteTextGetter implementation works correctly
- Jsoup does not correctly get all text in all instances. That is noted in its class.
- Counting words is not case sensitive
- Numbers (e.g., 123) are not words, but single letter "words" like 'v' or 'x' are
Run as you would a normal jar.
java -jar SiteTextRanker.jar https://www.your-cool-site.com