#CloudFlare is now hitting the archive.org wayback machine with the same #CAPTCHA as #Tor users, thus censoring history too.
@nipos @resist1984 they only save websites that allow crawlers. So disabling crawlers for website means it won't be saved.
wayback machine respects robots.
https://blog.reputationx.com/block-wayback-machine
And claiming that it is "one more reason to use cloudflare" is kinda wierd.
And yes, you can get your site removed from wayback machine.
@fortune
@nipos @nedelne_rano @resist1984 @frommMoritz
You can also request do not be added anymore
@nipos
@nedelne_rano @resist1984 @frommMoritz
Google cache
@nipos @nedelne_rano @resist1984 @frommMoritz
Also one of my hobbies is archiving
@nipos @nedelne_rano @resist1984 @frommMoritz and yet history is important. There is a balance to be found here somewhere.
If you don't want your information public, don't make it public. Facebook already disallows crawling, so does Twitter, by the way. So your point is mostly moot anyway.
CloudFlare unilaterally deciding to screw over one of the main projects keeping Internet history is not that balance.
@nedelne_rano @resist1984 @frommMoritz There's a big difference in making content searchable or cloning it completely forever.If it's in the search and the author decides to delete it,search links will return Error 404 after clicking it.Yes,there may be some other cache but I'm talking about pure search results.This isn't problematic.If you delete it and there's an exact copy of the page which isnt removed,this is a problem in some cases.