Skip to content

Latest commit

 

History

History
45 lines (31 loc) · 1.62 KB

README.rst

File metadata and controls

45 lines (31 loc) · 1.62 KB

twikiget

https://readthedocs.org/projects/docs/badge/?version=latest

About

twikiget is a tool to download twiki pages and archive them in .warc format. It uses wget underneath and so it includes all its downloading features.

Features

  • download and archive specific TWiki page and all its attachments
  • create WARC files for long-term preservation purposes
  • save local cache for faster and periodic reprocessing
  • (planned) extract specific metadata from TWiki document markup according to configurable templates

Useful links