mesa@piefed.social to Technology@lemmy.worldEnglish · 2 months agoTesla said it didn’t have key data in a fatal crash. Then a hacker found it.www.washingtonpost.comexternal-linkmessage-square25fedilinkarrow-up1573arrow-down11file-textcross-posted to: fuck_ai@lemmy.worldtechnology@beehaw.org
arrow-up1572arrow-down1external-linkTesla said it didn’t have key data in a fatal crash. Then a hacker found it.www.washingtonpost.commesa@piefed.social to Technology@lemmy.worldEnglish · 2 months agomessage-square25fedilinkfile-textcross-posted to: fuck_ai@lemmy.worldtechnology@beehaw.org
minus-squareMonkderVierte@lemmy.ziplinkfedilinkEnglisharrow-up11·edit-22 months agoHow does archive get the unpaywalled version? I don’t think they pay the subscription for every single tabloid out there? Asking for a friend.
minus-squarestoly@lemmy.worldlinkfedilinkEnglisharrow-up7·2 months agoThe paywall is JavaScript but the content is still in plaintext below. The crawlers don’t read the JavaScript.
minus-squareMonkderVierte@lemmy.ziplinkfedilinkEnglisharrow-up8·2 months agoDisabling 3rd-party js has no paywall, but only the first paragraph too. Crawlers get full access?
minus-squareAnarchistArtificer@slrpnk.netlinkfedilinkEnglisharrow-up6·2 months agoI think they use the same thing that web crawlers use. If Google’s crawler couldn’t access the content of the page (or could only access a limited amount of content), it would likely rank far lower in search results
minus-squareMonkderVierte@lemmy.ziplinkfedilinkEnglisharrow-up3·edit-22 months agoBtw, how come there is no search engine where you can sort and filter how you want instead of how they want? (except self-hosted i mean) Pornhub has better searchability than, uh, all search sites i know.
How does archive get the unpaywalled version? I don’t think they pay the subscription for every single tabloid out there?
Asking for a friend.
The paywall is JavaScript but the content is still in plaintext below. The crawlers don’t read the JavaScript.
Disabling 3rd-party js has no paywall, but only the first paragraph too. Crawlers get full access?
I think they use the same thing that web crawlers use. If Google’s crawler couldn’t access the content of the page (or could only access a limited amount of content), it would likely rank far lower in search results
Btw, how come there is no search engine where you can sort and filter how you want instead of how they want? (except self-hosted i mean)
Pornhub has better searchability than, uh, all search sites i know.