Skip to content

Releases: vmandic/tris-web-crawler

Tris - v1.4.0

10 Feb 00:30
Compare
Choose a tag to compare

Here goes out another Tris web crawler release, the v1.4.0 🎉

Go crawl some links: https://tris.fly.dev

  • no functional changes
  • project source is rewritten to TypeScript with NodeJS
  • added some new package.json scripts

This is basically done to get some better devux even tho I built the initial version with pure JavaScript.

Fun fact, the docker build and deployment to fly.io now takes 5x~ more time. 😄

Maybe DHH was really right in his blog?

Full Changelog: v1.3.1...v1.4.0

Tris - v1.3.1

03 Feb 22:22
Compare
Choose a tag to compare

Tris v1.3.1 is released 🎉 🕷️ ... it is such a silly release where I focused mostly on CSS animation of a spider emoji in a heading just for fun so it might take the page visitor's attention a bit: https://tris.fly.dev

image
  • feat: add a "clear" button to clear the input on home page, hopefully will provide a better UX for mobile users
  • typo fixes
  • the nav back button on crawling results is now a link button
  • added a spider emoji to crawling results
  • repository is renamed to tris-web-crawler
  • added more ideas to TODO.md

Full Changelog: v1.3.0...v1.3.1

Tris - v1.3.0

29 Jan 21:23
Compare
Choose a tag to compare

Tris v1.3.0 🎉 finally adds an end-user top feature for the homepage: a text input for the target URL to crawl upon

Check it out: https://tris.fly.dev/

  • adds CSS and a base color scheme, also some animations
  • adds an input box in index.html to allow the end-user to enter a target domain
  • some meta tags updates for socials

Full Changelog: v1.2.1...v1.3.0

Tris - v1.2.1

28 Jan 16:19
Compare
Choose a tag to compare

A patch release of Tris 🎉, added some small usability details like icons, manifest.json & GA tracking.

  • adds Google Analytics (gonna replace it with seogets soon)
  • adds manifest.json and icons
  • some README.md fixes
  • adds a TODO.md which is where I will track my progress of the project
  • adds a hardcoded version number in the index.html (planning to fix that to come out dynamically)
  • renamed env config SORT_OUTPUT to SORT_FILE_OUTPUT

Full Changelog: v1.2.0...v1.2.1

Tris - v1.2.0

27 Jan 22:59
Compare
Choose a tag to compare

Another (third!) 🎉 release of Tris - now with some CSS to it and better naming to it as it is really a web crawler crawling for links.

  • split the code to more static .js and .css files, makes the HTML file smaller and nicer
  • fixed the meta tag content and made consistent across the both of the HTML files
  • fix the rate limiter to release after having a crawler run finished
  • added some more Tailwind stuff like margins and paddings
  • made it look nice on the phone with rem font-size and viewport fixes
  • added an auto-scroll feature to the results page so you don't have to keep scrolling to the bottom of the page

Full Changelog: v1.1.0...v1.2.0

Tris - v1.1.0

25 Jan 23:45
Compare
Choose a tag to compare

The second release 🎉 of Tris web scraper which is now deployed online where you can test it for free: https://tris.fly.dev/

Mostly a cleanup with some safeguards and a better README.md document.

  • code reformatting
  • makes it deployable to Fly.io
  • simplify socket interaction for TLS/SSL
  • adds a poor man's rate limit to run three parallel scraping processes at max

Tris screenshot of results page running the web scraper in Chrome web browser

Full Changelog: v1.0.0...v1.1.0

Tris - v1.0.0

21 Jan 22:01
Compare
Choose a tag to compare

The first of maybe more releases of my hobby web scraper project - Tris.

  • run locally via CLI or web HTTP server to scrape a target domain
  • run under web HTTP server under Docker
  • the web server streams back the results to the HTML results page via an active socket connection
  • a quite naive, but first working draft implementation doing its job, no edge cases covered, neither UT covered
  • finds and parses <a> element href attribute values and recursively visits (already unvisited) links of a domain and its subdomains related HTML pages
  • supports an initial set of .env options to steer the scraper
  • offers saving the links output to a file
  • counting as "for production use" as it can run locally

Full Changelog: https://github.com/vmandic/tris-simple-spider-scraper/commits/v1.0.0