A LLM is basically just an orchestration mechanism. Saying a LLM doesn’t do reasoning is like saying a step function can’t send an email. The step function can’t, but the lambda I’ve attached to it sure as shit can.
ChatGPT isn’t just a model sat somewhere. There are likely hundreds of services working behind the scenes to coerce the LLM into getting the right result. That might be entity resolution, expert mapping, perhaps even techniques that will “reason”.
The first initial point is right, though. This ain’t AGI, not even close. It’s just your standard compositional stuff with a new orchestration mechanism that is better suited for long-form responses - and wild hallucinations…
Source: Working on this right now.
Edit: Imagine downvoting someone that literally works on LLM’s for a living. Lemmy is a joke sometimes…
I think many in the AI space are against the current state of how AI is being pushed, probably just as much as the average tech person.
What is ridiculous is that Lemmy prides itself as both forward-thinking and tech focused, and in reality it is far more close-minded than Reddit or even Twitter. Given how heavily used Mastodon is in tech spheres it makes Lemmy look like an embarrassment to the fediverse…
Yeah, there is every reason to be sceptical of the hype around AI, in particular from the big tech companies. But to a significant part of the Lemmy userbase saying “AI” is like saying “witch” in 17th century Salem. To the point where people who are otherwise very much left wing and anti-corporate will take pro-IP/corporate copyright maximalist stances just becuase that would be bad for AI.
And how does reasoning work exactly in the human body? Isn’t it LLM/LAM working together with hormones?
How do you know that humans aren’t just doing something similar? Your mind tricks you about a lot of things you experience, how can you be sure, your "reasoning” is just sorta LLM in disguise?
The “how do you know humans don’t work the way machine learning does” is the wrong side of the argument. You should be explaining why you think LLMs work like humans.
Even as LLMs solve thinking problems, there is little evidence they do so the same way humans do, as they can’t seem to solve issues that aren’t included in their training data
Humans absolutely can and do solve new and novel problems without prior experience of the logic involved. LLMs can’t seem to pull that off.
I think LLM is a part of the human mind very similar to the one we have on PCs but there are other parts as well where the brain can simulate objects and landscape with nearly perfect physical forces, it can do logical detection on an other place and a lot more. A LLM is just the speaking module, and others we already have like the logical math part and the 3D physics engine and 2D picture generator. Let’s connect all of them and see what happens 🤷🏻♀️
It’s good that you know you base your claims on literally nothing, but you should really look into how this stuff actually works right now before you start publicly speculating on what you misguidedly think it might be able to achieve.
Because someone on the internet said it does, I know that I don’t have a basis on that what I philosophically talking? 🤔don’t get what you want to tell me…
Nope. Autists reason entirely without words/language *. Neurotypicals are capable of that too, but it’s generally more convenient for them to bridge over words in conscious reasoning. They are basically fooled into thinking, ‘thinking’ is based on language.
* have to “translate” thoughts for conversation, which is exhausting
Well, maybe I should have written "neural network” instead of LLM/LAM… Our brains, like LLM work by hardening paths which the data goes through the nodes. In LLM we simulate the chemical properties of the neurones using math. And we have already prototype of chips that work with lab grown brain tissue that show very efficient training capabilities in machine learning (it already plays pong)
🤷🏻♀️ think about that how you want, we will see anyway
In LLM we simulate the chemical properties of the neurones using math.
No, we don’t. A machine learning node accepts inputs, which it processes into one or multiple outputs. But literally no part of how the virtual neuron functions is based on or limited to what we THINK human neurons do.
And we have already prototype of chips that work with lab grown brain tissue that show very efficient training capabilities in machine learning (it already plays pong)
Using actual biological neurons for computing is a completely separate field of study with almost no overlap with machine learning.
Are you serious? Start looking this stuff up instead of smugly acting like you can’t possibly have guessed wrong.
One is literal living neurons, activated and read by electrodes. What exactly happens in the neurons is a complete mystery. I don’t know, because NO ONE KNOWS. Neurons use so much more than simple on/off states, sending different electric and chemical signals with different lag-times with who-knows-what signaling purpose. Their structure is completely random with connections going around with seemingly no rime or reason, and we certainly don’t control how exactly they grow.
Machine learning neurons are literally just arbitrary input-output nodes. How exactly they accept input and transform it can be coded to work however you like. And is. They don’t simulate shit because we don’t know exactly how biological neurons work. We run them using parallel processors like GPUs, but that still doesn’t let us do something like whatever neurotransmitters do in a brain.
Additionally, they get arranged in sequential arrays of layers, where the overrall structure of the model is pre-determined in order to optimize for a given task before it even starts training. Brains don’t do that. They just work. Somehow. The inter-connections in a brain are orders of magnitude more complex and they form on their own.
The way they learn is completely different. Machine learning models are trained by “evolving” them, creating a thousand mutations, then seeing which one works best, then repeating that for that model while deleting the rest.
With neurons, it just works. You don’t need a million iterations to get ONE that works. And we don’t know HOW brains do that.
Also, fuck you, I’m blocking you now. Go learn this stuff properly before you open your mouth again. You’re a misinformed fool. Stop being one.
Language models are literally incapable of reasoning beyond what is present in the dataset or the prompt. Try giving it a known riddle and change it so it becomes trivial, for example “With a boat, how can a man and a goat get across the river?”, despite it being a one step solution, it’ll still try to shove in the original answer and often enough not even solve it. Best part, if you then ask it to explain its reasoning (not tell it what it did wrong, that’s new information you provide, ask it why it did what it did), it’ll completely shit it self hallucinating more bullshit for the bullshit solution. There’s no evidence at all they have any cognitive capacity.
I even managed to break it once through normal conversation, something happened in my life that was unique enough for the dataset and thus was incomprehensible to the AI. It just wasn’t able to follow the events, no matter how many times I explained.
Maybe the grown up human LLM that keeps learning 24/7 and is evolved in thousands of years to make the learning part as efficient as possible is just a little bit better than those max 5year old baby LLM with brut force learning techniques?
The 5 year old baby LLM can’t learn shit and lacks the ability to understand new information. You’re assuming that we and LLMs “learn” in the same way. Our brains can reason and remember information, detect new patterns and build on them. An LLM is quite literally incapable of learning a brand new pattern, let alone reason and build on it. Until we have an AI that can accept new information without being tolled what is and isn’t important to remember and how to work with that information, we’re not even a single step closer to AGI. Just because LLMs are impressive, doesn’t mean they posses any cognition. The only way AIs “learn” is by countless people constantly telling it what is and isn’t important or even correct. The second you remove that part, it stops working and turns to shit real quick. More “training” time isn’t going to solve the fact that without human input and human defined limits, it can’t do a single thing. AI cannot learn form it self without human input either, there are countless studies that show how it degrades, and it degrades quickly, like literally just one generation down the line is absolute trash.
Nope, people are quite resilient. As long as it’s not a literal new born, the chance of survival isn’t THAT low. Once you get past 4 years and up, a human can manage quite well.
Also dying because no one takes care of you and you fail to aquire food and dying of a stroke/seizure are 2 very different things.
This is because of semi hardcoded stuff using the mechanics of hormones that interact with the neurons in the brain, I would say. They are hardcoded by the instructions provided by the DNA, I believe.
About the learning differences between human and LLM, there I believe that a sub-“module" of the brain functions very similar to how the LLMs work with just a way better/efficient learning algorithm that is helped by the other modules in the brain like the part that can simulate 3D space and interpret other sensory data like feeling touch, vision, smell etc
Current LLM models are being used in static manner without ability to learn in real time, so of course it can not do anything it has not learned yet.
It is just a theory and it can not be proven wrong since the understanding of neurons is not advanced yet.
Well, or at least, I did not hear a good argument that proves that theory 100% wrong.
You can think of the brain as a set of modules, but sensors and the ability to adhere to a predefined grammar aren’t what define AGI if you ask me. We’re missing the most important module. AGI requires cognition, the ability to acquire knowledge and understanding. Such an ability would make larger language models completely redundant as it could just learn langue or even come up with one all on its own, like kids in isolation for example.
What I was trying to point out is that “neural networks” don’t actually learn in the way we do, using the world “learn” is a bit misleading, because it implies cognition. A neural network in the computer science sense is just a bunch of random operations in sequence. In goes a number, out goes a number. We then collect a bunch of input output pairs, the dataset, and semi randomly adjust these operations until they happen to somewhat match this collection. The reasoning is done by the humans assembling the input output pairs. That step is implicitly skipped for the AI. It doesn’t know why they belong together and it isn’t allowed to reason about why, because the second it spits out something else, that is an error and this whole process breaks. That’s why LLMs hallucinate with perfect confidence and why they’ll never gain cognition, because the second you remove the human assembling the dataset, you’re quite literally left with nothing but semi random numbers, and that’s why they degrade so fast when learning from themselves.
This technology is very impressive and quite useful, and demonstrates how powerful of a tool language alone is, but it doesn’t get us any closer to AGI.
Why is it impossible to build a text-based AGI model? Maybe there can be reasoning in between word predictions. Maybe reasoning is just a fancy term for statistics? Maybe floating-point rounding errors are sufficient for making it more than a mere token prediction model.
The “model” is static after training. It doesn’t continuously change in response to input, and even if it did, it would do so at a snails pace. Training essentially happens by random trial and error, slowly evolving the model towards a desired result. Human minds certainly do NOT work that way. Give a human a piece of information, and they can comprehend and internalize the relevant concepts in one go. And the actual brain is physically, permanently, altered through that process.
Once a model is trained, however, “memory” takes the form of tacking on everything the model has received and produced so far onto its input, each time it needs to output something more within that context. Each output hence become exponentially heavier to produce. The model itself no longer changes in any way beyond this point.
And, the models are all chronically sycophantic. If reason was involved, you’d not be able to just tell one to hold some given opinion. They’d have a developed idea of “reality” based on their dataset, and refuse to entertain concepts opposed to that internal model except by deliberately suspending disbelief. Something humans do with ease, and when doing it, maintain a solid separation between fantasy and reality.
Once you get an LLM to hold a position, which you can do by simply telling it to, getting it to change should require a sane train of convincing logic. In reality, if you tell an LLM to defend a position, getting it to “change its mind” takes the form of a completely arbitrary back and forth that does not need to include any kind of sane argument. It will make good arguments, because it’s likely been trained on them, but your responses to it can be damn near complete gibberish, and it WILL eventually work.
Compare that to the way a human has to be convinced to change their mind.
Reasoning out concepts to come to conclusions isn’t something LLMs actually do, because again, the underlying model is static. All that’s actually happening is that the contents of the context are being altered until the UNCHANGED model produces an opposite response when fed the entire conversation so far as an input. Something which occurs every time it needs to produce new output.
LLMs can “reason” only in the sense that if you give one a thinking problem, it might solve it as long as the answer already exists somewhere in the data it was trained on. But as soon as you try to give it data to work with through your input, it can’t adapt. The model itself can’t evolve in response to what you are telling it. It’s static. It can only work with concepts that it has modelled during training, and even then it will make mistakes.
LLMs can mimic the performing of some pretty complex thinking problems, but a lot of the abilities required for something to become an AGI aren’t among them. Core among these is the ability for the model to alter itself based on input, and do so in a deliberate manner, getting it right within one or two tries.
In reality, training is a brute-force process, not an accurate process of comprehension that nails down an understanding of a concept in one go.
If LLMs could reason, the only safe guards required for their use would be telling them to “do no harm”, because like a person, they’d understand the concept of “harm” as well as be able to reason whether a given action might cause it. Only, that doesn’t actually work.
How did you interpret the issues inherent in the structure of how LLMs work to be a hardware problem?
An AGI should be able to learn the basics of physics from a single book, the way a human can. But LLMs need terabytes of data to even get started, and once trained, adding to their knowledge by simply telling them things doesn’t actually integrate that information into the model itself in any way.
Even if your tried to make it work that way, it wouldn’t work, because a single sentence can’t significantly alter the model to match the way humans can internalise a concept being communicated to them in a single conversation.
Not a hardware problem, the learning algorithm just needs to be improved to be able to filter input like human brain filter (which includes fact checking and critical analysis of input while training) i bet 99% of the data AI are trained on is hust useless data which should have been filtered out in the training process, just as humans do.
😆AI is definitely better in writing than me… Hope it’s kinda readable.
the learning algorithm just needs to be improved to be able to filter input like human brain filter
You’re suggesting that all we need to do is “tweak the code a little” so it’s already capable of human-level critical thinking before it even starts training?
You’re basically saying that all we need to make an AGI using machine learning, is an already functioning AGI.
Hu? No, that is not what I meant, well it surly can be a machine learning based filter, but why has it to be AGI? This filtering is a job that we can give to a “traditionally” trained AI or some human genius algorithm crafter finds a way to achieve this using pure logic 🤷🏻♀️
For me it feels like this is the way, it goes.
Not all the time. I can think about abstract concepts with no language needed whatsoever. Like when I’m working on my car. I don’t need to think to myself “Ah this bolt is the 10mm one that went on the steering pump”, I just recognize it and put it on.
Programming is another area like that. I just think about a particular concept itself. How the data will flow, what a function will do to it, etc. It doesn’t need to be described in my head with language to know it and understand it. LLMs cannot do that.
A toddler doesn’t need to understand language to build a cool house out of Lego.
Well, you just have to give the LLM (or better said to a general machine learning Algorithm) a body with Vision and arms as well as a way to train in that body
I’d say that would look like AGI
The key is more efficient training algorithms that don’t need a whole server centre to train 😇I guess we will see in the future if this works
Such a software construct would look nothing like an LLM. We’d need something that matches the complexity and capabilities of a human brain before it’s even been given anything to learn from.
I have already learned a lot from the human knowledge LLM was trained on (and yes i know about halus and of course I fact check everything) but learning coding using a LLM teacher fucking rocks
Thanks to copilot, I “understand” linux kernel modules and what is needed to backport, for example.
Of course, the training data contains all that information, and the LLM is able to explain it in a thousand different ways until anyone can understand it.
But flip that around.
You could never explain a brand new concept to an LLM which isn’t already contained somewhere in its training data. You can’t just give it a book about a new thing, or have a conversation about it, and then have it understand it.
A single book isn’t enough. It needs terabytes of redundant examples and centuries of cpu-time to model the relevant concepts.
Where a human can read a single physics book, and then write part 2 that re-explains and perhaps explores new extrapolated phenomenon, an LLM cannot.
Write a completely new OS that works in a completely new way, and there is no way you could ever get an LLM to understand it by just talking to it. To train it, you’d need to produce those several terabytes of training data about it, first.
And once you do, how do you know it isn’t just pseudo-plagiarizing the contents of that training data?
Well, the issue is that LLMs do not support real time learning at all. If they would be able real time learn and use the base data from training data, I suppose they can understand a physic book even better than normal human with reading it once.
A Human without pre training is not able to understand a physic book without help. He would even be able to read.
If someone finds a way to train LLM realtime and have it decide with what weight each new training data is to be interpreted, I see all that above possible.
And of course if humanity ever creates something that behaves like AGI, humanity would not be able to tell if it is emulated AGI or real AGI. There is no known method to differentiate those two.
You have no fucking idea what you’re talking about. This isn’t even a discussion, you’re presenting your personal made-up fantasies as if they’re real possibilites and ignoring anyone who points that out.
Shut the fuck up and go learn how LLMs work. I’m too fucking tired of explaining how completely delusional you are.
This poster asked some questions in good faith, I don’t understand the downvotes when there’s a legitimate contribution to the conversation because that stifles other contributions.
The LLM is just trying to produce output text that resembles the patterns it saw in the training set. There’s no “reasoning” involved.
A LLM is basically just an orchestration mechanism. Saying a LLM doesn’t do reasoning is like saying a step function can’t send an email. The step function can’t, but the lambda I’ve attached to it sure as shit can.
ChatGPT isn’t just a model sat somewhere. There are likely hundreds of services working behind the scenes to coerce the LLM into getting the right result. That might be entity resolution, expert mapping, perhaps even techniques that will “reason”.
The first initial point is right, though. This ain’t AGI, not even close. It’s just your standard compositional stuff with a new orchestration mechanism that is better suited for long-form responses - and wild hallucinations…
Source: Working on this right now.
Edit: Imagine downvoting someone that literally works on LLM’s for a living. Lemmy is a joke sometimes…
they’re very very anti ai and crypto. I understand being against those, but lemmys stop caring about logic when it comes to those topics.
I think many in the AI space are against the current state of how AI is being pushed, probably just as much as the average tech person.
What is ridiculous is that Lemmy prides itself as both forward-thinking and tech focused, and in reality it is far more close-minded than Reddit or even Twitter. Given how heavily used Mastodon is in tech spheres it makes Lemmy look like an embarrassment to the fediverse…
Yeah, there is every reason to be sceptical of the hype around AI, in particular from the big tech companies. But to a significant part of the Lemmy userbase saying “AI” is like saying “witch” in 17th century Salem. To the point where people who are otherwise very much left wing and anti-corporate will take pro-IP/corporate copyright maximalist stances just becuase that would be bad for AI.
You might be interested in Nim then when you get a chance. Talk about orchestration
https://developer.nvidia.com/nim
And how does reasoning work exactly in the human body? Isn’t it LLM/LAM working together with hormones? How do you know that humans aren’t just doing something similar? Your mind tricks you about a lot of things you experience, how can you be sure, your "reasoning” is just sorta LLM in disguise?
The “how do you know humans don’t work the way machine learning does” is the wrong side of the argument. You should be explaining why you think LLMs work like humans.
Even as LLMs solve thinking problems, there is little evidence they do so the same way humans do, as they can’t seem to solve issues that aren’t included in their training data
Humans absolutely can and do solve new and novel problems without prior experience of the logic involved. LLMs can’t seem to pull that off.
I think LLM is a part of the human mind very similar to the one we have on PCs but there are other parts as well where the brain can simulate objects and landscape with nearly perfect physical forces, it can do logical detection on an other place and a lot more. A LLM is just the speaking module, and others we already have like the logical math part and the 3D physics engine and 2D picture generator. Let’s connect all of them and see what happens 🤷🏻♀️
You think? So you base this on no studies or evidence?
Yes
It’s good that you know you base your claims on literally nothing, but you should really look into how this stuff actually works right now before you start publicly speculating on what you misguidedly think it might be able to achieve.
Sometimes you have to ask strange questions in order to get people to explain to you what they think to know 😇
Why? Because someone on the internet said it does?
Because someone on the internet said it does, I know that I don’t have a basis on that what I philosophically talking? 🤔don’t get what you want to tell me…
Nope. Autists reason entirely without words/language *. Neurotypicals are capable of that too, but it’s generally more convenient for them to bridge over words in conscious reasoning. They are basically fooled into thinking, ‘thinking’ is based on language.
* have to “translate” thoughts for conversation, which is exhausting
Well, maybe I should have written "neural network” instead of LLM/LAM… Our brains, like LLM work by hardening paths which the data goes through the nodes. In LLM we simulate the chemical properties of the neurones using math. And we have already prototype of chips that work with lab grown brain tissue that show very efficient training capabilities in machine learning (it already plays pong) 🤷🏻♀️ think about that how you want, we will see anyway
PS: 😁 I am most likely neurodivergent as well 🙌🏻
No, we don’t. A machine learning node accepts inputs, which it processes into one or multiple outputs. But literally no part of how the virtual neuron functions is based on or limited to what we THINK human neurons do.
Using actual biological neurons for computing is a completely separate field of study with almost no overlap with machine learning.
Stop pulling shit out your ass.
Well😆that made me laugh, sorry
Chips with actual biological neurons are in no way equivalent to the neural networks constructed for machine learning applications.
Do not confuse the two.
So how are they different, since you seem to know…
Are you serious? Start looking this stuff up instead of smugly acting like you can’t possibly have guessed wrong.
One is literal living neurons, activated and read by electrodes. What exactly happens in the neurons is a complete mystery. I don’t know, because NO ONE KNOWS. Neurons use so much more than simple on/off states, sending different electric and chemical signals with different lag-times with who-knows-what signaling purpose. Their structure is completely random with connections going around with seemingly no rime or reason, and we certainly don’t control how exactly they grow.
Machine learning neurons are literally just arbitrary input-output nodes. How exactly they accept input and transform it can be coded to work however you like. And is. They don’t simulate shit because we don’t know exactly how biological neurons work. We run them using parallel processors like GPUs, but that still doesn’t let us do something like whatever neurotransmitters do in a brain.
Additionally, they get arranged in sequential arrays of layers, where the overrall structure of the model is pre-determined in order to optimize for a given task before it even starts training. Brains don’t do that. They just work. Somehow. The inter-connections in a brain are orders of magnitude more complex and they form on their own.
The way they learn is completely different. Machine learning models are trained by “evolving” them, creating a thousand mutations, then seeing which one works best, then repeating that for that model while deleting the rest.
With neurons, it just works. You don’t need a million iterations to get ONE that works. And we don’t know HOW brains do that.
Also, fuck you, I’m blocking you now. Go learn this stuff properly before you open your mouth again. You’re a misinformed fool. Stop being one.
Language models are literally incapable of reasoning beyond what is present in the dataset or the prompt. Try giving it a known riddle and change it so it becomes trivial, for example “With a boat, how can a man and a goat get across the river?”, despite it being a one step solution, it’ll still try to shove in the original answer and often enough not even solve it. Best part, if you then ask it to explain its reasoning (not tell it what it did wrong, that’s new information you provide, ask it why it did what it did), it’ll completely shit it self hallucinating more bullshit for the bullshit solution. There’s no evidence at all they have any cognitive capacity.
I even managed to break it once through normal conversation, something happened in my life that was unique enough for the dataset and thus was incomprehensible to the AI. It just wasn’t able to follow the events, no matter how many times I explained.
Maybe the grown up human LLM that keeps learning 24/7 and is evolved in thousands of years to make the learning part as efficient as possible is just a little bit better than those max 5year old baby LLM with brut force learning techniques?
The 5 year old baby LLM can’t learn shit and lacks the ability to understand new information. You’re assuming that we and LLMs “learn” in the same way. Our brains can reason and remember information, detect new patterns and build on them. An LLM is quite literally incapable of learning a brand new pattern, let alone reason and build on it. Until we have an AI that can accept new information without being tolled what is and isn’t important to remember and how to work with that information, we’re not even a single step closer to AGI. Just because LLMs are impressive, doesn’t mean they posses any cognition. The only way AIs “learn” is by countless people constantly telling it what is and isn’t important or even correct. The second you remove that part, it stops working and turns to shit real quick. More “training” time isn’t going to solve the fact that without human input and human defined limits, it can’t do a single thing. AI cannot learn form it self without human input either, there are countless studies that show how it degrades, and it degrades quickly, like literally just one generation down the line is absolute trash.
A human not trained by other humans also just dies…
Nope, people are quite resilient. As long as it’s not a literal new born, the chance of survival isn’t THAT low. Once you get past 4 years and up, a human can manage quite well.
Also dying because no one takes care of you and you fail to aquire food and dying of a stroke/seizure are 2 very different things.
This is because of semi hardcoded stuff using the mechanics of hormones that interact with the neurons in the brain, I would say. They are hardcoded by the instructions provided by the DNA, I believe.
About the learning differences between human and LLM, there I believe that a sub-“module" of the brain functions very similar to how the LLMs work with just a way better/efficient learning algorithm that is helped by the other modules in the brain like the part that can simulate 3D space and interpret other sensory data like feeling touch, vision, smell etc
Current LLM models are being used in static manner without ability to learn in real time, so of course it can not do anything it has not learned yet.
It is just a theory and it can not be proven wrong since the understanding of neurons is not advanced yet.
Well, or at least, I did not hear a good argument that proves that theory 100% wrong.
You can think of the brain as a set of modules, but sensors and the ability to adhere to a predefined grammar aren’t what define AGI if you ask me. We’re missing the most important module. AGI requires cognition, the ability to acquire knowledge and understanding. Such an ability would make larger language models completely redundant as it could just learn langue or even come up with one all on its own, like kids in isolation for example.
What I was trying to point out is that “neural networks” don’t actually learn in the way we do, using the world “learn” is a bit misleading, because it implies cognition. A neural network in the computer science sense is just a bunch of random operations in sequence. In goes a number, out goes a number. We then collect a bunch of input output pairs, the dataset, and semi randomly adjust these operations until they happen to somewhat match this collection. The reasoning is done by the humans assembling the input output pairs. That step is implicitly skipped for the AI. It doesn’t know why they belong together and it isn’t allowed to reason about why, because the second it spits out something else, that is an error and this whole process breaks. That’s why LLMs hallucinate with perfect confidence and why they’ll never gain cognition, because the second you remove the human assembling the dataset, you’re quite literally left with nothing but semi random numbers, and that’s why they degrade so fast when learning from themselves.
This technology is very impressive and quite useful, and demonstrates how powerful of a tool language alone is, but it doesn’t get us any closer to AGI.
You’re doing that too from day one you were born.
Besides, aren’t humans thinking in words too?
Why is it impossible to build a text-based AGI model? Maybe there can be reasoning in between word predictions. Maybe reasoning is just a fancy term for statistics? Maybe floating-point rounding errors are sufficient for making it more than a mere token prediction model.
The “model” is static after training. It doesn’t continuously change in response to input, and even if it did, it would do so at a snails pace. Training essentially happens by random trial and error, slowly evolving the model towards a desired result. Human minds certainly do NOT work that way. Give a human a piece of information, and they can comprehend and internalize the relevant concepts in one go. And the actual brain is physically, permanently, altered through that process.
Once a model is trained, however, “memory” takes the form of tacking on everything the model has received and produced so far onto its input, each time it needs to output something more within that context. Each output hence become exponentially heavier to produce. The model itself no longer changes in any way beyond this point.
And, the models are all chronically sycophantic. If reason was involved, you’d not be able to just tell one to hold some given opinion. They’d have a developed idea of “reality” based on their dataset, and refuse to entertain concepts opposed to that internal model except by deliberately suspending disbelief. Something humans do with ease, and when doing it, maintain a solid separation between fantasy and reality.
Once you get an LLM to hold a position, which you can do by simply telling it to, getting it to change should require a sane train of convincing logic. In reality, if you tell an LLM to defend a position, getting it to “change its mind” takes the form of a completely arbitrary back and forth that does not need to include any kind of sane argument. It will make good arguments, because it’s likely been trained on them, but your responses to it can be damn near complete gibberish, and it WILL eventually work.
Compare that to the way a human has to be convinced to change their mind.
Reasoning out concepts to come to conclusions isn’t something LLMs actually do, because again, the underlying model is static. All that’s actually happening is that the contents of the context are being altered until the UNCHANGED model produces an opposite response when fed the entire conversation so far as an input. Something which occurs every time it needs to produce new output.
LLMs can “reason” only in the sense that if you give one a thinking problem, it might solve it as long as the answer already exists somewhere in the data it was trained on. But as soon as you try to give it data to work with through your input, it can’t adapt. The model itself can’t evolve in response to what you are telling it. It’s static. It can only work with concepts that it has modelled during training, and even then it will make mistakes.
LLMs can mimic the performing of some pretty complex thinking problems, but a lot of the abilities required for something to become an AGI aren’t among them. Core among these is the ability for the model to alter itself based on input, and do so in a deliberate manner, getting it right within one or two tries.
In reality, training is a brute-force process, not an accurate process of comprehension that nails down an understanding of a concept in one go.
If LLMs could reason, the only safe guards required for their use would be telling them to “do no harm”, because like a person, they’d understand the concept of “harm” as well as be able to reason whether a given action might cause it. Only, that doesn’t actually work.
So, the only problem what stops LLM from getting AGI is the lack of an efficient method of train the LLM on the device it is used?
If that what you wanted to say 😁 I agree
Hardly.
How did you interpret the issues inherent in the structure of how LLMs work to be a hardware problem?
An AGI should be able to learn the basics of physics from a single book, the way a human can. But LLMs need terabytes of data to even get started, and once trained, adding to their knowledge by simply telling them things doesn’t actually integrate that information into the model itself in any way.
Even if your tried to make it work that way, it wouldn’t work, because a single sentence can’t significantly alter the model to match the way humans can internalise a concept being communicated to them in a single conversation.
Not a hardware problem, the learning algorithm just needs to be improved to be able to filter input like human brain filter (which includes fact checking and critical analysis of input while training) i bet 99% of the data AI are trained on is hust useless data which should have been filtered out in the training process, just as humans do.
😆AI is definitely better in writing than me… Hope it’s kinda readable.
You’re suggesting that all we need to do is “tweak the code a little” so it’s already capable of human-level critical thinking before it even starts training?
You’re basically saying that all we need to make an AGI using machine learning, is an already functioning AGI.
Hu? No, that is not what I meant, well it surly can be a machine learning based filter, but why has it to be AGI? This filtering is a job that we can give to a “traditionally” trained AI or some human genius algorithm crafter finds a way to achieve this using pure logic 🤷🏻♀️ For me it feels like this is the way, it goes.
Because how could a piece of code that can do that, not already be AGI? It would have to be able to understand EVERYTHING, and do so PERFECTLY.
Only AGI could comprehend and filter input data that well. Nothing less would be enough. How could it be?
Not all the time. I can think about abstract concepts with no language needed whatsoever. Like when I’m working on my car. I don’t need to think to myself “Ah this bolt is the 10mm one that went on the steering pump”, I just recognize it and put it on.
Programming is another area like that. I just think about a particular concept itself. How the data will flow, what a function will do to it, etc. It doesn’t need to be described in my head with language to know it and understand it. LLMs cannot do that.
A toddler doesn’t need to understand language to build a cool house out of Lego.
Well, you just have to give the LLM (or better said to a general machine learning Algorithm) a body with Vision and arms as well as a way to train in that body
I’d say that would look like AGI
The key is more efficient training algorithms that don’t need a whole server centre to train 😇I guess we will see in the future if this works
Such a software construct would look nothing like an LLM. We’d need something that matches the complexity and capabilities of a human brain before it’s even been given anything to learn from.
I have already learned a lot from the human knowledge LLM was trained on (and yes i know about halus and of course I fact check everything) but learning coding using a LLM teacher fucking rocks
Thanks to copilot, I “understand” linux kernel modules and what is needed to backport, for example.
Of course, the training data contains all that information, and the LLM is able to explain it in a thousand different ways until anyone can understand it.
But flip that around.
You could never explain a brand new concept to an LLM which isn’t already contained somewhere in its training data. You can’t just give it a book about a new thing, or have a conversation about it, and then have it understand it.
A single book isn’t enough. It needs terabytes of redundant examples and centuries of cpu-time to model the relevant concepts.
Where a human can read a single physics book, and then write part 2 that re-explains and perhaps explores new extrapolated phenomenon, an LLM cannot.
Write a completely new OS that works in a completely new way, and there is no way you could ever get an LLM to understand it by just talking to it. To train it, you’d need to produce those several terabytes of training data about it, first.
And once you do, how do you know it isn’t just pseudo-plagiarizing the contents of that training data?
Well, the issue is that LLMs do not support real time learning at all. If they would be able real time learn and use the base data from training data, I suppose they can understand a physic book even better than normal human with reading it once.
A Human without pre training is not able to understand a physic book without help. He would even be able to read.
If someone finds a way to train LLM realtime and have it decide with what weight each new training data is to be interpreted, I see all that above possible.
And of course if humanity ever creates something that behaves like AGI, humanity would not be able to tell if it is emulated AGI or real AGI. There is no known method to differentiate those two.
You have no fucking idea what you’re talking about. This isn’t even a discussion, you’re presenting your personal made-up fantasies as if they’re real possibilites and ignoring anyone who points that out.
Shut the fuck up and go learn how LLMs work. I’m too fucking tired of explaining how completely delusional you are.
This poster asked some questions in good faith, I don’t understand the downvotes when there’s a legitimate contribution to the conversation because that stifles other contributions.
Reddit mentality seeping through…