Releases: vmandic/tris-web-crawler
Tris - v1.4.0
Here goes out another Tris web crawler release, the v1.4.0 🎉
Go crawl some links: https://tris.fly.dev
- no functional changes
- project source is rewritten to TypeScript with NodeJS
- added some new package.json scripts
This is basically done to get some better devux even tho I built the initial version with pure JavaScript.
Fun fact, the docker build and deployment to fly.io now takes 5x~ more time. 😄
Maybe DHH was really right in his blog?
Full Changelog: v1.3.1...v1.4.0
Tris - v1.3.1
Tris v1.3.1 is released 🎉 🕷️ ... it is such a silly release where I focused mostly on CSS animation of a spider emoji in a heading just for fun so it might take the page visitor's attention a bit: https://tris.fly.dev
- feat: add a "clear" button to clear the input on home page, hopefully will provide a better UX for mobile users
- typo fixes
- the nav back button on crawling results is now a link button
- added a spider emoji to crawling results
- repository is renamed to tris-web-crawler
- added more ideas to TODO.md
Full Changelog: v1.3.0...v1.3.1
Tris - v1.3.0
Tris v1.3.0 🎉 finally adds an end-user top feature for the homepage: a text input for the target URL to crawl upon
Check it out: https://tris.fly.dev/
- adds CSS and a base color scheme, also some animations
- adds an input box in
index.html
to allow the end-user to enter a target domain - some meta tags updates for socials
Full Changelog: v1.2.1...v1.3.0
Tris - v1.2.1
A patch release of Tris 🎉, added some small usability details like icons, manifest.json & GA tracking.
- adds Google Analytics (gonna replace it with seogets soon)
- adds manifest.json and icons
- some README.md fixes
- adds a TODO.md which is where I will track my progress of the project
- adds a hardcoded version number in the index.html (planning to fix that to come out dynamically)
- renamed env config SORT_OUTPUT to SORT_FILE_OUTPUT
Full Changelog: v1.2.0...v1.2.1
Tris - v1.2.0
Another (third!) 🎉 release of Tris - now with some CSS to it and better naming to it as it is really a web crawler crawling for links.
- split the code to more static .js and .css files, makes the HTML file smaller and nicer
- fixed the meta tag content and made consistent across the both of the HTML files
- fix the rate limiter to release after having a crawler run finished
- added some more Tailwind stuff like margins and paddings
- made it look nice on the phone with rem font-size and viewport fixes
- added an auto-scroll feature to the results page so you don't have to keep scrolling to the bottom of the page
Full Changelog: v1.1.0...v1.2.0
Tris - v1.1.0
The second release 🎉 of Tris web scraper which is now deployed online where you can test it for free: https://tris.fly.dev/
Mostly a cleanup with some safeguards and a better README.md document.
- code reformatting
- makes it deployable to Fly.io
- simplify socket interaction for TLS/SSL
- adds a poor man's rate limit to run three parallel scraping processes at max
Full Changelog: v1.0.0...v1.1.0
Tris - v1.0.0
The first of maybe more releases of my hobby web scraper project - Tris.
- run locally via CLI or web HTTP server to scrape a target domain
- run under web HTTP server under Docker
- the web server streams back the results to the HTML results page via an active socket connection
- a quite naive, but first working draft implementation doing its job, no edge cases covered, neither UT covered
- finds and parses
<a>
element href attribute values and recursively visits (already unvisited) links of a domain and its subdomains related HTML pages - supports an initial set of .env options to steer the scraper
- offers saving the links output to a file
- counting as "for production use" as it can run locally
Full Changelog: https://github.com/vmandic/tris-simple-spider-scraper/commits/v1.0.0