Hypertext archival systems

Heritrix3

@ https://github.com/internetarchive/heritrix3

Heritrix is the Internet Archive's open-source, extensible, web-scale, archival-quality web crawler project.

Webrecorder

@ https://webrecorder.net/

A suite of open source tools and packages to capture interactive websites and replay them at a later time as accurately as possible.

HTTrack

@ https://www.httrack.com/

Allows you to download a World Wide Web site from the Internet to a local directory.

wget

@ https://www.gnu.org/software/wget/

Retrieves files using HTTP, follows links, maps links to resources in filesystem, useful for archiving and mirroring web resources.