This confirms my suspicions that the Left is behind some of the major AI efforts. There is no way that a professional in Google's AI dev teams would deliberately choose a site well-known for radical left commenters to scrape for knowledge. We know from the very start this will contaminate the AI bot database for training. The cost of data curation (the cleaning and selection of data for training the AI) will be enormous; I have been seeing job posts for data cleaners lately paying $26 /hr. Google will have to use offshore labor in some of this, but the offshore people will not understand American politics culture so biased data can and will slip in.
Further, we know for a fact that Reddit is flooded with bots pushing left ideology, so obviously its comment text data will be biased heavily towards the bot owner views.
This is all a setup. I am guessing that Altman is behind this, and using it as a way to contaminate a rival's data this way.
This confirms my suspicions that the Left is behind some of the major AI efforts. There is no way that a professional in Google's AI dev teams would deliberately choose a site well-known for radical left commenters to scrape for knowledge. We know from the very start this will contaminate the AI bot database for training. The cost of data curation (the cleaning and selection of data for training the AI) will be enormous; I have been seeing job posts for data cleaners lately paying $26 /hr. Google will have to use offshore labor in some of this, but the offshore people will not understand American politics culture so biased data can and will slip in.
Further, we know for a fact that Reddit is flooded with bots pushing left ideology, so obviously its comment text data will be biased heavily towards the bot owner views.
This is all a setup. I am guessing that Altman is behind this, and using it as a way to contaminate a rival's data this way.