It also works with JavaScript-heavy websites like Mastodon and Youtube, which the standard “Save Page” feature implemented in all browsers usually fails to save, though some features like collapsibles are missing.

  • renzev@lemmy.world
    link
    fedilink
    arrow-up
    1
    ·
    17 minutes ago

    Also recommending ArchiveBox – it takes the shotgun approach to archiving websites, making snapshots with a bunch of different tools, singlefile being among them. You can use it as a one-shot command, or run it as a web server which provides a UI for archiving web pages. Linkwarden is similar.

  • /home/pineapplelover@lemmy.dbzer0.com
    link
    fedilink
    arrow-up
    48
    arrow-down
    1
    ·
    edit-2
    21 days ago

    Am I missing something? How is this different from CTRL+S

    Edit: clicked on link and saw demo video. Pretty cool, saves all images and assets without having that big folder of stuff

    • e0qdk@reddthat.com
      link
      fedilink
      arrow-up
      11
      ·
      edit-2
      21 days ago

      This takes a snapshot of the HTML elements from when they were loaded in your browser. If the page loads content dynamically, HTTrack won’t save it but this can. (i.e. this works better on crappy modern sites that need JS to even just load the article text…)

    • 3abas@lemmy.world
      link
      fedilink
      arrow-up
      8
      ·
      21 days ago

      A property secured site won’t serve its css and js when requested from a non configured domain (such as localhost), so the html is the only part that will work when you save the page as.

      • Atomic@sh.itjust.works
        link
        fedilink
        arrow-up
        2
        arrow-down
        5
        ·
        edit-2
        21 days ago

        Selecting “Save as” is not the same as “view while offline”.

        “Save as” is just a legacy function that has been passed down to Save the HTML into a file. It’s not used to “save” pages or sites to view offline.

        The ability to properly Save a page to view offline is already included in most browsers. Even Edge has that functionality built into it.

        • 3abas@lemmy.world
          link
          fedilink
          arrow-up
          8
          ·
          20 days ago

          The ability to properly Save a page to view offline is already included in most browsers. Even Edge has that functionality built into it.

          Go ahead and show the classroom what you’re talking about…

    • e0qdk@reddthat.com
      link
      fedilink
      arrow-up
      19
      ·
      21 days ago

      It stores the actual HTML structure and assets, so you can still view the page as it was more-or-less intended instead of it getting split up across print pages.

  • Strawberry@lemmy.blahaj.zone
    link
    fedilink
    arrow-up
    3
    ·
    20 days ago

    I use this to save the job listing of every job I apply to, for reference when I get a call back and the listing has been taken down. You can highlight a section of the page and it smartly saves just the page content needed to contain the highlighted portion. Really keeps the size down.

  • goodboyjojo@lemmy.world
    link
    fedilink
    arrow-up
    1
    ·
    20 days ago

    that’s cool. also you can use stuff like htrack to backup web pages and make them into zim files for software like kiwix