LLMs are solving MCAT, the bar test, SAT etc like they’re nothing. At this point their performance is super human. However they’ll often trip on super simple common sense questions, they’ll struggle with creative thinking.
Is this literally proof that standard tests are not a good measure of intelligence?
Apologies if this comes off as rude, but as an engineer involved in reinforcement learning, it’s upsetting when people make claims like this based on conjecture and hand-wavey understandings of ML. Some day there will be goal-driven agents that can interact with the world, and those agents will be harmed by those kinds of incorrect understandings of machine learning.
LLMs’ thinking also feeds into their word choice. Where else would they be getting the words from, thin air? No, it’s from billions of neurons doing what neurons do, thinking.
What is a “support structure”, in your mind? That’s not a defined neuroscience, cog sci, or ML term, so it sounds to me like hand-waving.
LLMs can and do generalize beyond their training data, it’s literally the whole point. Otherwise, they’d be useless.
During training, neural weights from previous examples are revisited and recontextualized given the new information. This is what leads to generalization.
The Chinese Room is not a valid argument, because the same logic can be applied to other humans besides yourself.