Reddit posts are effectively worse than useless for training a model lmao. It's a good way to make a model that is fucking retarded and responds like faggot Plebbitor trannies do, and not much else.
Quality of training data is insanely important, even more important than quantity, and ALL Plebbit posts are low quality data.
Reddit is already bots. Gemini will be an AI learning from AI. Feeding the snake's tail into its own mouth. A data Oroborous. A prion made of information.
What does an Internet with Wasting Disease look like?
This confirms my suspicions that the Left is behind some of the major AI efforts. There is no way that a professional in Google's AI dev teams would deliberately choose a site well-known for radical left commenters to scrape for knowledge. We know from the very start this will contaminate the AI bot database for training. The cost of data curation (the cleaning and selection of data for training the AI) will be enormous; I have been seeing job posts for data cleaners lately paying $26 /hr. Google will have to use offshore labor in some of this, but the offshore people will not understand American politics culture so biased data can and will slip in.
Further, we know for a fact that Reddit is flooded with bots pushing left ideology, so obviously its comment text data will be biased heavily towards the bot owner views.
This is all a setup. I am guessing that Altman is behind this, and using it as a way to contaminate a rival's data this way.
Reddit already is AI bot posts
AI training AI
Reminds me of that movie Multiplicity with Michael Keaton when they make a clone from a clone and he comes out half retarded.
Lul
https://www.youtube.com/watch?v=tQlujXWP-5c
Sam Altman is the third largest Reddit shareholder at 8.7% behined Ten Cent 11% and the Newhouse family 30% (Conde Naste)
Reddit posts are effectively worse than useless for training a model lmao. It's a good way to make a model that is fucking retarded and responds like faggot Plebbitor trannies do, and not much else.
Quality of training data is insanely important, even more important than quantity, and ALL Plebbit posts are low quality data.
Reddit is already bots. Gemini will be an AI learning from AI. Feeding the snake's tail into its own mouth. A data Oroborous. A prion made of information.
What does an Internet with Wasting Disease look like?
Considering Gemini is already trash in a lot of ways, it will just get shittier lol
This confirms my suspicions that the Left is behind some of the major AI efforts. There is no way that a professional in Google's AI dev teams would deliberately choose a site well-known for radical left commenters to scrape for knowledge. We know from the very start this will contaminate the AI bot database for training. The cost of data curation (the cleaning and selection of data for training the AI) will be enormous; I have been seeing job posts for data cleaners lately paying $26 /hr. Google will have to use offshore labor in some of this, but the offshore people will not understand American politics culture so biased data can and will slip in.
Further, we know for a fact that Reddit is flooded with bots pushing left ideology, so obviously its comment text data will be biased heavily towards the bot owner views.
This is all a setup. I am guessing that Altman is behind this, and using it as a way to contaminate a rival's data this way.
Garbage in, Garbage out.