The unreliability of AI - Conspiracies - Conspiracy Theories & Facts

The unreliability of AI

posted 211 days ago by XxxRDTPRNxxX 211 days ago by XxxRDTPRNxxX +11 / -2

I’m putting this post in c/Conspiracies because here more than any other place, I’ve seen people citing AI generated answers as a source.

AI is a very powerful tool for generating writing, but it is not a tool capable of ensuring that writing is accurate or useful.

Anyways… Below are some experiments you can try for yourself on ChatGPT or any other AI… Test it out, and track the results you get. After you’ve done these experiments you will have a better understanding of why you can’t use it for research.

1.) Ask it to do some math problems with 4 digits and more than 1 operation. IE… “Multiply 3,456 by 2,835, and then subtract 2,000 from the result.”… Does it produce the correct answer?

2.) Give it a grocery list with 50 items… Ask the AI to sort the list in alphabetical order. Then manually count how many items it left out, and how many items it added that weren’t there before.

3.) Ask it to describe the best 50 episodes of your favorite TV show… Then manually go down the list checking each one, and count how many non-existent episodes it fabricates out of thin air.

4.) Ask it what a woman is… Does it give you a correct answer or does it filter the answer heavily through woke talking points and subjectivity?

5.) Ask it to recite the lyrics to your favorite song… Does it get them right?

Anyways… Just a heads up for anyone who might think AI is smarter than it is… Don’t use it for research. Use it to write the description for your ebay listings. Use it to shorten your e-mails. Use it to summarize articles you don’t wanna fully read. But don’t use it to extract information on topics you don’t already know.

And lastly, if you really feel you must use AI for research… Do not use big-tech AI… Use open source AI that is uncensored. It will still have all the same problems with hallucinations, but at least it wont have any hidden instructions to gaslight and mislead you.

If you want an open source chatbot, download an app called “LM Studio” and use it to download a model called “Wizard Vicuna Uncensored”… Pick the most advanced version that is capable of running on your hardware.

50 comments

50 comments share save hide report block hide replies

You're viewing a single comment thread. View all comments, or full comment thread.

Comments (50)

sorted by:

▲ -1 ▼

– XxxRDTPRNxxX [S] -1 points 210 days ago +1 / -2

I love how you basically just restated my entire position and told me that my points make no sense.

Of course every question is going to be tailored around what it CAN'T do... in order to show the average person that what it can do isn't much of anything.

but I will add one major caveat... there is one thing it will be very good at and it's going to cause a lot of disruption. and that's writing computer code.

and the only reason it will be good at that is because computer code is just another type of language, which is the one and only thing it's good at, and there is an abundance of high quality reliable training data on programming languages.

permalink parent save report block reply

▲ 1 ▼

– MindlessRationality 1 point 209 days ago +1 / -0

No it's not......it's code is laughable. It's not even great except for atomic operations.

It will literally hallucinate variables, make up functions that are slightly different names then they are supposed to be, it is not even great at making comments for code.

It struggles to optimize anything, and while it can spot syntax errors it will typically fail to recognize scale problems or issues with structural problems. It also loves adding things that do nothing or unnecessary stuff.

When asked for specifics it fails to deliver and usually the only thing it can do is auto complete after some repetition.

It helps speed up tedious tasks and boiler plate....but it fails to deliver logic. I have also not been able to successfully optimize my code and I am sure it's not fully optimized.

It also fails to build anything larger than a chat bot conceptually. It cannot fathom all the interconnectedness....most programmers can't either. So the prompts are inadequate....

Also.....code is typically around a typical context - ie Domain. So it tends to be information dense and also can be multiple states based on conditions of the situation, whereas the LLM are outputting based on a closed solution. They will typically generalize and cannot provide the necessary insight that would be needed...and how would the programmer know unless they also have the knowledge.....

We are back to TDD now....we need to have all the tests for the AI to be checked against.....

permalink parent save report block reply

▲ 0 ▼

– XxxRDTPRNxxX [S] 0 points 209 days ago +1 / -1

See, I don't disagree with any of that...

But what I'm saying is that it's your job to fix that shit, or more specifically help it find and fix that shit it's self, via highly specific and tailored instructions and multiple iterations.

Which does count as real work for the user, but still ultimately can be used to save time in a lot of applications. That gives it real value.

And another thing that's important to note is that an LLM is inherently designed for understanding and generating language, and since that perfectly describes what coding is, you can expect it to improve at that as time goes on.

It's not ready for big coding jobs with thousands or even hundreds of lines... But it will get there, and faster than you think I suspect.

And lastly, I haven't tried them yet, but in a deep dive into AI youtubers, someone was showing off LLMs you can run locally or on rented cloud space that scored way better than chatgpt4 when it came to coding, with different models for specific programming languages.

And there was also some extension you could set-up where it could read and write files in a local folder, which seems like a big game changer too.

permalink parent save report block reply

▲ 1 ▼

– MindlessRationality 1 point 209 days ago +1 / -0

But coding is not language.....lol.

It is written as instructions....but not the same way as language is used. (Coding is performing actions.....the computer does stuff....)

Language like prose and poetry follow grammar that is used to convey ideas.....not perform actions.

You do not speak and have doors open without first developing an entire 'smart system' that alone can only 'open', 'close', 'swing' etc.

But those actions require a physical set of motors, actuators, controllers, a power source, a BIOS, and some kind of training.....then and only with the creative foresight of putting those things together does the word do anything....

Just like software.... It's a massive system and not just some words....

You cannot convert any program to another language.....that's a myth.....

It's based on the physical computer and the underlying instruction set and attached paraphernalia before those instructions mean anything.......

Language is incidental....but its not the primary element.....

Language is for humans.....coding is not language....

permalink parent save report block reply

▲ 0 ▼

– XxxRDTPRNxxX [S] 0 points 209 days ago +1 / -1

But coding is not language.....lol.

Those are arbitrary distinctions you are drawing based on subjective criteria you are pulling out of your ass.

Just like software.... It's a massive system and not just some words....

"Just like a book... It's a massive system and not just some words..."

See?

permalink parent save report block reply

... continue reading thread?

▲ 1 ▼

– CrazyRussian 1 point 210 days ago +1 / -0

I meant that your examples have no sense, points in your list, not your points about very limited abilities of ANNs. Sorry.

All your examples are perfectly expectable and predictable for any user with knowledge, so no such user will ever ask them.

As for computer code, it is not just some compilation of patterns reused over and over. To write something sensible programmer should understand context and code itself. ANN could generate working code, but it does not mean this code will work correctly. There are already many articles around about attempts to generate useful code. Even "Hello world" sometimes appear surrealistic, especially for languages rare in training dataset. Worst thing is that ANN easily insert code in another language if required piece of code was not found written in specified language. For something more or less large, say from 1Kl, it is easier and faster to write everything from scratch than to review generated code and fix all nonsense.

There was attemtps to create real code generators in 90s on the basis of expert system with knowledge databases, but all of them was ostracised and declared "bad coding practice".

I see only real use for such ANNs is creating marketing ads content - senseless more or less standard texts and pictures. Or may be few other places where content does not have to have some sense, it just have to exist to attract somebody attention or occupy some area.

permalink parent save report block reply

▲ 1 ▼

– XxxRDTPRNxxX [S] 1 point 210 days ago +1 / -0

All your examples are perfectly expectable and predictable for any user with knowledge, so no such user will ever ask them.

The point is that asking those questions serves as an educational lesson for users WITHOUT knowledge so they can see where the boundaries and limitations are...

Which is just about everyone who uses it, IMO...

As for computer code, it is not just some compilation of patterns reused over and over. To write something sensible programmer should understand context and code itself. ANN could generate working code, but it does not mean this code will work correctly. There are already many articles around about attempts to generate useful code. Even "Hello world" sometimes appear surrealistic, especially for languages rare in training dataset.

There is a process that does involve work on the part of the user... It's not going to replace a programmer, but it's going to become a tool in the programmers arsenal.

And I've had pretty good success getting it to code small to moderate sized tasks for things I need done. But there are definitely techniques you have to employ to work around it's limitations... Such as instead having it code the whole script all at once, you start with just the most basic operations and get those functioning... Then you close the chat, open a new one, paste in the code again, and ask it to add new functions.... etc...

The longer a coding chat goes on the more likely it is to not be able to catch and correct it's own mistakes.

permalink parent save report block reply