Google says public data is fair game for training its AIs
30 comment bubble on white
we're just being honest, says web giant
Google has updated its privacy policy to confirm it scrapes public data from the internet to train its AI models and services – including its chatbot Bard and its cloud-hosted pro
the fine print
"Google uses information to improve our services and to develop new products, features and technologies that benefit our users and the public"
https://www.theregister.com/2023/07/06/google_ai_models_internet_scraping/
imo if things are on the open web then it's fair game
if folk like content providers don't want their stuff being used there are ways to stop it being on the open web then AI wouldn't be able to use it
they can use paywall services like patreion, substcsk, rumble, medium etc.,,,,, to name a few all subscrition based platforms can't be used for AI learning like open platforms can cause they have good terms & conditions about that
open web stuff is a free for all
@ecksmc When I worked in IT, we just blocked scrapers and left normal traffic alone, and did nothing to impact our real users. There are/were many tools available to throttle abuse of a system that don't involve bothering users.