• PKscope@lemmy.world
    link
    fedilink
    English
    arrow-up
    292
    arrow-down
    2
    ·
    7 天前

    Tackling the problems that really matter. Good job, FBI.

    Fucking clowns.

  • Balldowern@lemmy.zip
    link
    fedilink
    English
    arrow-up
    142
    ·
    7 天前

    Why isn’t the FBI doing anything about Epstein island list ? That’s more important than some archive website.

  • rekabis@lemmy.ca
    link
    fedilink
    English
    arrow-up
    111
    arrow-down
    1
    ·
    7 天前

    The FBI is probably going nuts here because someone inadvertently archived the Epstein files and everyone at HQ is panicking. They need to purge it for the Internet before someone discovers that archived content, and so they’re using CP as an excuse.

    • conorab@lemmy.conorab.com
      link
      fedilink
      English
      arrow-up
      12
      ·
      6 天前

      It occasionally catches things that archive.org misses too. Also really nice to have an alternative.

      It’d be nice to have a way of doing decentralised archiving while still keeping the trust. If you’re trying to prove that a site really said something at a certain date to another person, pointing to your own archive is kinda useless.

        • conorab@lemmy.conorab.com
          link
          fedilink
          English
          arrow-up
          2
          ·
          2 天前

          It would help! It would establish that an archive was made no later than the date it was recorded on a blockchain (assuming the archiver isn’t also the one the made the original content in which case they can upload it after making the “archive”). You would still need to prove the trustworthiness of the archived data and at the moment the only thing we have for that is just trusting the archiver.

          You could do something like have multiple archivers archive the same site in s stripped down for like plain text (so that differences caused by time or day, ads, etc don’t change the hash) and that way you can say that X amount of archivers agree that the site looked like that at that time.

      • Optional@lemmy.world
        link
        fedilink
        English
        arrow-up
        13
        arrow-down
        3
        ·
        7 天前

        So basically you need to spam me. Because a donation plea every so often . . .doesn’t get enough addresses to sell?

        I’m saying it’s a flawed implementation is all.

        • NotSteve_@piefed.ca
          link
          fedilink
          English
          arrow-up
          26
          ·
          7 天前

          Purely anecdotal but they’re the only news site that I’ve ever given my email to and I actually enjoy seeing their emails. They send entire (interesting) articles that can be read with no CSS/tracking images enabled and their monetisation is a small text ad that breaks a single couple of paragraphs.

          I’ve never gotten an email from them that was begging for money or anything like that, just basically an RSS feed of interesting articles

        • Prove_your_argument@piefed.social
          link
          fedilink
          English
          arrow-up
          11
          arrow-down
          2
          ·
          7 天前

          The idea that forcing a signup (building a web of information about a user through the use of cookies and other browser metadata) to protect against AI (that is gonna use tooling, mirrors, proxies and any number of fully working methodologies) is ludicrous.

          They just want to track who you are, what you do, and then sell that data which should never have been gathered in the first place as part of their advertising revenue.

          • DesertCreosote@piefed.blahaj.zone
            link
            fedilink
            English
            arrow-up
            9
            ·
            7 天前

            Normally I would agree with you, but given how much they care about privacy (as indicated by what they write about and talk about on their podcast), I don’t think tracking is what they’re after in this specific case.

            And they know that the signup won’t completely block AI, but it does help.

    • brbposting@sh.itjust.works
      link
      fedilink
      English
      arrow-up
      13
      ·
      7 天前

      Softest paywall ever - they do such good work, they can have an anonymous email of mine no problem

      Magic link’s so annoying though, just wanna password (they’re journalists not techies though is the long and short of it)

  • Knock_Knock_Lemmy_In@lemmy.world
    link
    fedilink
    English
    arrow-up
    71
    ·
    7 天前

    The archive runs Apache Hadoop and Apache Accumulo. All data is stored on HDFS, textual content is duplicated 3 times among servers in 2 datacenters and images are duplicated 2 times. Both datacenters are in Europe, with OVH hosting at least one of them.

    To avoid detection, archive.today runs via a botnet that cycles through countless IP addresses, making it quite difficult for grumpy webmasters to stop their sites getting scraped. Access to paywalled sites is through logins secured via unclear means, which need to be replenished constantly: here’s the creator asking for Instagram credentials. Finally, the serving of the website is also subject to a perpetual game of cat and mouse: “I can only predict that there will be approximately one trouble with domains per year and each fifth trouble will result in domain loss.” As of today, archive.today still works, but users are redirected to archive.md.

  • snoons@lemmy.ca
    link
    fedilink
    English
    arrow-up
    82
    arrow-down
    2
    ·
    7 天前

    Friends of tech Bros Incorporated.

    Regulatory capture is complete in the states.

  • dan1101@lemmy.world
    link
    fedilink
    English
    arrow-up
    27
    ·
    6 天前

    The news sites are trying to have it both ways. Serving the news articles to visitors and then covering them up with a paywall with browser tricks.

      • ITGuyLevi@programming.dev
        link
        fedilink
        English
        arrow-up
        13
        ·
        6 天前

        I would put that more on the ad networks, if the ads were related to the article, it may generate a few more clicks. The ads are completely random and built off a profile they assume would contain relevant info about me… but it doesn’t really seem to be accurate (this is kind of by my own choosing though).

        Instead articles about rebuilding cars should have ads related to perhaps rebuilding cars and not some fucking nutritional supplement or some other unrelated thing.

        • silence7@slrpnk.netOP
          link
          fedilink
          English
          arrow-up
          5
          ·
          edit-2
          6 天前

          Better ad targeting does make ads more valuable…but because only Google and Facebook have the visibility and ML to do it effectively, they wound up with all the ad revenue. Everybody else ended up with a few pennies

          • willington@lemmy.dbzer0.com
            link
            fedilink
            English
            arrow-up
            1
            ·
            edit-2
            5 天前

            Owners win.

            If I have a plot of land with some cherry trees on it, I can get a landless person to pick most of them for free.

            I make an offer “for every 100 pounds/kilos of cherries you pick for me, I’ll let you keep 1”. If the person who receives such an offer has no land of their own, they have to agree to avoid starvation.

            That’s why our system needs a huge class of the landless, resourceless, and assetless people. Then for the priveledge of touching a privatized resource you have accept the privateer’s conditions.

            Fencing off resources and protecting the fence by the threat of death is how this scam works.

            And it is impossible to fix this societal problem by simply trading more and better as an individual. The ruleset of the game is tuned for mass free energy extraction from the assetless class at the macro level.

  • girlthing@lemmy.blahaj.zone
    link
    fedilink
    English
    arrow-up
    31
    ·
    edit-2
    6 天前

    The owner should release the source code / configuration, in whatever state it’s in, before things escalate further. It’d suck for all their work to go down the drain. I’m sure there’d be people willing to adopt the project and host instances.

    If you agree and you have Tumblr, would you consider asking them anonymously?

    https://blog.archive.today/ask

      • eah@programming.dev
        link
        fedilink
        English
        arrow-up
        3
        ·
        edit-2
        6 天前

        The administration didn’t threaten to take down the IA or investigate it or anything like that, so it’s not similar at all.

        It’s conspiratorial to think the FBI is doing this to censor or hide something. archive.is is primarily used to get around paywalls. The most likely explanation is news sites complained to the FBI that their copyrights are being violated (which is true), so the FBI is investigating. They’ve had a problem with falling revenue for a decade or more at this point as everything went online and people expected to get instant access for free in contrast to print media.

      • deathbird@mander.xyz
        link
        fedilink
        English
        arrow-up
        1
        ·
        6 天前

        I suspect they’re going after .is because they are more resistant to taking things down. But that’s speculation on my part. And even if I’m right, what is it that they actually are trying to remove?

        • Pup Biru@aussie.zone
          link
          fedilink
          English
          arrow-up
          3
          ·
          edit-2
          5 天前

          voyager automatically opens links in reader mode for me and it works about 80% of the time

          (but this article it doesn’t work for)

          • Cricket [he/him]@lemmy.zip
            link
            fedilink
            English
            arrow-up
            2
            ·
            5 天前

            Interesting, my experience with reader mode to get around paywalls is just about the opposite - it works may 20% of the time. Probably different sites that we’re visiting.

    • punkibas@lemmy.zip
      link
      fedilink
      English
      arrow-up
      8
      ·
      6 天前

      I have JavaScript disabled by default on all pages, I only activate it if I need to, as per the privacyguides recommendations, but on this site at least, it still won’t load the article. If I want to read it I’d have to either register or use the archive.

  • Broadfern@lemmy.world
    link
    fedilink
    English
    arrow-up
    30
    ·
    7 天前

    That would explain why adguard’s public DNS started blocking it (labeled vaguely as “legal request”).

    • Daemon Silverstein@calckey.world
      link
      fedilink
      arrow-up
      9
      ·
      7 天前

      @a_person@piefed.social @silence7@slrpnk.net @technology@lemmy.world

      Same when I tried to access the archived version of the linked article of this thread. I was faced by a TLS error I never saw before (SSL_ERROR_INTERNAL_ERROR_ALERT), so I thought the Archive Today was facing server-side issues, until I decided to try accessing through the smartphone, and no error happened there.

      I only managed to access Archive Today through my computer after disabling several security things, which seems quite suspicious, as if the Archive Today were being hijacked by a MitM (possibly the FBI themselves? They’re famous for setting up honeypots) who were trying to push malicious code/tracking to whomever access it.

      I would be further worried if I were USian or a citizen from Global North (as I’m Brazilian and from Global South, I can tell the FBI to go pound sand, lol).

      To USians, my suggestion is caution accessing Archive Today (at least the current IP address being pointed at by mainstream DNS resolvers) for a while, as the server, while seemingly Archive Today, may be actually some kind of FBI honeypot in disguise. It goes without saying how ICANN and IANA are US entities, prone to interference from three-lettered US agencies. There are alternatives to Archive Today, such as Ghost Archive and 12ft.