Symbol grounding problem


In cognitive science and semantics, the symbol grounding problem concerns how it is that words get their meanings, and hence is closely related to the problem of what meaning itself really is. The problem of meaning is in turn related to the problem of how it is that mental states are meaningful, hence to the problem of consciousness: what is the connection between certain physical systems and the contents of subjective experiences.

Background

Referents

distinguished a referent and the word's meaning. This is most clearly illustrated using the proper names of concrete individuals, but it is also true of names of kinds of things and of abstract properties: "Tony Blair", "the prime minister of the UK during the year 2004", and "Cherie Blair's husband" all have the same referent, but not the same meaning.
Some have suggested that the meaning of a word is the rule or features that one must use in order to successfully pick out its referent. In that respect, and come closer to wearing their meanings on their sleeves, because they are explicitly stating a rule for picking out their referents: "Find whoever was prime minister of the UK during the year 2004", or "find whoever is Cherie's current husband". But that does not settle the matter, because there's still the problem of the meaning of the components of that rule, and how to pick them out.
The phrase "Tony Blair" does not have this recursive component problem, because it points straight to its referent, but how? If the meaning is the rule for picking out the referent, what is that rule, when we come down to non-decomposable components like proper names of individuals ?

Referential process

Humans are able to pick out the intended referents of words, such as "Tony Blair" or "bachelor," but this process need not be explicit. It is probably an unreasonable expectation to know the explicit rule for picking out the intended referents.
So if we take a word's meaning to be the means of picking out its referent, then meanings are in our brains. That is meaning in the narrow sense. If we use "meaning" in a wider sense, then we may want to say that meanings include both the referents themselves and the means of picking them out. So if a word is located inside an entity that can use the word and pick out its referent, then the word's wide meaning consists of both the means that that entity uses to pick out its referent, and the referent itself: a wide causal nexus between a head, a word inside it, an object outside it, and whatever "processing" is required in order to successfully connect the inner word to the outer object.
But what if the "entity" in which a word is located is not a head but a piece of paper ? What is its meaning then? Surely all the words on this screen, for example, have meanings, just as they have referents.
In the 19th century, the semiotician Charles Sanders Peirce suggested what some think is a similar model: according to his triadic sign model, meaning requires an interpreter, a sign or representamen, an object, and is the virtual product of an endless regress and progress called Semiosis. Some have interpreted Peirce as addressing the problem of grounding, feelings, and intentionality for the understanding of semiotic processes. In recent years, Peirce's theory of signs has been rediscovered by an increasing number of artificial intelligence researchers in the context of symbol grounding problem.

Grounding process

There would be no connection at all between written symbols and any intended referents if there were no minds mediating those intentions, via their own internal means of picking out those intended referents.
So the meaning of a word on a page is "ungrounded." Nor would looking it up in a dictionary help: If one tried to look up the meaning of a word one did not understand in a dictionary of a language one did not already understand, one would just cycle endlessly from one meaningless definition to another. One's search for meaning would be ungrounded.
In contrast, the meaning of the words in one's head—those words one does understand—are "grounded". That mental grounding of the meanings of words mediates between the words on any external page one reads and the external objects to which those words refer.

Requirements for symbol grounding

Another symbol system is natural language. On paper or in a computer, language, too, is just a formal symbol system, manipulable by rules based on the arbitrary shapes of words. But in the brain, meaningless strings of squiggles become meaningful thoughts. Harnad has suggested two properties that might be required to make this difference:
One property that static paper or, usually, even a dynamic computer lack that the brain possesses is the capacity to pick out symbols' referents. This is what we were discussing earlier, and it is what the hitherto undefined term "grounding" refers to. A symbol system alone, whether static or dynamic, cannot have this capacity, because picking out referents is not just a computational property; it is a dynamical property.
To be grounded, the symbol system would have to be augmented with nonsymbolic, sensorimotor capacities—the capacity to interact autonomously with that world of objects, events, actions, properties and states that its symbols are systematically interpretable as referring to. It would have to be able to pick out the referents of its symbols, and its sensorimotor interactions with the world would have to fit coherently with the symbols' interpretations.
The symbols, in other words, need to be connected directly to their referents; the connection must not be dependent only on the connections made by the brains of external interpreters like us. Just the symbol system alone, without this capacity for direct grounding, is not a viable candidate for being whatever it is that is really going on in our brains when we think meaningful thoughts.
Meaning as the ability to recognize instances or perform actions is specifically treated in the paradigm called "Procedural Semantics", described in a number of papers including "Procedural Semantics" by Philip N. Johnson-Laird and expanded by William A. Woods in "Meaning and Links". A brief summary in Woods' paper reads: "The idea of procedural semantics is that the semantics of natural language sentences can be characterized in a formalism whose meanings are defined by abstract procedures that a computer can either execute or reason about. In this theory the meaning of a noun is a procedure for recognizing or generating instances, the meaning of a proposition is a procedure for determining if it is true or false, and the meaning of an action is the ability to do the action or to tell if it has been done."

Consciousness

The necessity of groundedness, in other words, takes us from the level of the pen-pal Turing test, which is purely symbolic, to the robotic Turing test, which is hybrid symbolic/sensorimotor. Meaning is grounded in the robotic capacity to detect, categorize, identify, and act upon the things that words and sentences refer to. On the other hand, if the symbols refer to the very bits of '0' and '1', directly connected to their electronic implementations, which a computer system can readily manipulate, then even non-robotic computer systems could be said to be "sensorimotor" and hence able to "ground" symbols in this narrow domain.
To categorize is to do the right thing with the right kind of thing. The categorizer must be able to detect the sensorimotor features of the members of the category that reliably distinguish them from the nonmembers. These feature-detectors must either be inborn or learned. The learning can be based on trial and error induction, guided by feedback from the consequences of correct and incorrect categorization; or, in our own linguistic species, the learning can also be based on verbal descriptions or definitions. The description or definition of a new category, however, can only convey the category and ground its name if the words in the definition are themselves already grounded category names. So ultimately grounding has to be sensorimotor, to avoid infinite regress.
But if groundedness is a necessary condition for meaning, is it a sufficient one? Not necessarily, for it is possible that even a robot that could pass the Turing test, "living" amongst the rest of us indistinguishably for a lifetime, would fail to have in its head what Searle has in his: It could be a p-zombie, with no one home, feeling feelings, meaning meanings. However, it is possible that different interpreters would have different mechanisms for producing meaning in their systems, thus one cannot require that a system different from a human "experiences" meaning in the same way that a human does, and vice-versa.
Harnad thus points at consciousness as a second property. The problem of discovering the causal mechanism for successfully picking out the referent of a category name can in principle be solved by cognitive science. But the problem of explaining how consciousness could play an "independent" role in doing so is probably insoluble, except on pain of telekinetic dualism. Perhaps symbol grounding is enough to ensure that conscious meaning is present, but then again, perhaps not. In either case, there is no way we can hope to be any the wiser—and that is Turing's methodological point.

Formulation

To answer this question we have to formulate the symbol grounding problem itself :

Functionalism

There is a school of thought according to which the computer is more like the brain—or rather, the brain is more like the computer: According to this view, the future theory explaining how the brain picks out its referents, will be a purely computational one. A computational theory is a theory at the software level. It is essentially a computer algorithm: a set of rules for manipulating symbols. And the algorithm is "implementation-independent." That means that whatever it is that an algorithm is doing, it will do the same thing no matter what hardware it is executed on. The physical details of the dynamical system implementing the computation are irrelevant to the computation itself, which is purely formal; any hardware that can run the computation will do, and all physical implementations of that particular computer algorithm are equivalent, computationally.
A computer can execute any computation. Hence once computationalism finds a proper computer algorithm, one that our brain could be running when there is meaning transpiring in our heads, meaning will be transpiring in that computer too, when it implements that algorithm.
How would we know that we have a proper computer algorithm? It would have to be able to pass the Turing test. That means it would have to be capable of corresponding with any human being as a pen-pal, for a lifetime, without ever being in any way distinguishable from a real human pen-pal.

Searle's Chinese room argument

formulated the "Chinese room argument" in order to disprove computationalism. The Chinese room argument is based on a thought experiment: in it, Searle stated that if the Turing test were conducted in Chinese, then he himself, Searle, could execute a program that implements the same algorithm that the computer was using without knowing what any of the words he was manipulating meant.
At first glance, it would seem that if there's no meaning going on inside Searle's head when he is implementing that program, then there's no meaning going on inside the computer when it is the one implementing the algorithm either, computation being implementation-independent. But on a closer look, for a person to execute the same program that a computer would, at very least it would have to have access to a similar bank of memory that the computer has. This means that the new computational system that executes the same algorithm is no longer just Searle's original head, but that plus the memory bank.
In particular, this additional memory could store a digital representation of the intended referent of different words, that the algorithm would use as a model of, and to derive features associated with, the intended referent. The "meaning" then is not to be searched in just Searle's original brain, but in the overall system needed to process the algorithm..
Thus, Searle's not perceiving any meaning in his head alone when simulating the work of a computer, does not imply lack of meaning in the overall system, and thus in the actual computer system passing an advanced Turing test.

Implications

How does Searle know that there is no meaning going on in his head when he is executing such a Turing-test-passing program? Exactly the same way he knows whether there is or is not meaning going on inside his head under any other conditions: He understands the words of English, whereas the Chinese symbols that he is manipulating according to the algorithm's rules mean nothing whatsoever to him. However, the complete system that is manipulating those Chinese symbols – which is not just Searle's brain, as explained in the previous section – may have the ability to extract meaning from those symbols, in the sense of being able to use internal models of the intended referents, pick out the intended referents of those symbols, and generally identifying and using their features appropriately.
Note that in pointing out that the Chinese words would be meaningless to him under those conditions, Searle has appealed to consciousness. Otherwise one could argue that there would be meaning going on in Searle's head under those conditions, but that Searle himself would simply not be conscious of it. That is called the to Searle's Chinese Room Argument, and Searle the Systems Reply as being merely a reiteration, in the face of negative evidence, of the very thesis that is on trial in his thought-experiment: "Are words in a running computation like the ungrounded words on a page, meaningless without the mediation of brains, or are they like the grounded words in brains?"
In this either/or question, the word "ungrounded" has implicitly relied on the difference between inert words on a page and consciously meaningful words in our heads. And Searle is asserting that under these conditions, the words in his head would not be consciously meaningful, hence they would still be as ungrounded as the inert words on a page.
So if Searle is right, that both the words on a page and those in any running computer program are meaningless in and of themselves, and hence that whatever it is that the brain is doing to generate meaning can't be just implementation-independent computation, then what is the brain doing to generate meaning ?

Brentano's notion of intentionality

"Intentionality" has been called the "mark of the mental" because of some observations by the philosopher Brentano to the effect that mental states always have an inherent, intended object or content toward which they are "directed": One sees something, wants something, believes something, desires something, understands something, means something etc., and that object is always something that one has in mind. Having a mental object is part of having anything in mind. Hence it is the mark of the mental. There are no "free-floating" mental states that do not also have a mental object. Even hallucinations and imaginings have an object, and even feeling depressed feels like something. Nor is the object the "external" physical object, when there is one. One may see a real chair, but the "intentional" object of one's "intentional state" is the mental chair one has in mind.
If this all sounds like skating over the surface of a problem rather than a real break-through, then the foregoing description has had its intended effect: No, the problem of intentionality is not the symbol grounding problem; nor is grounding symbols the solution to the problem of intentionality. The symbols inside an autonomous dynamical symbol system that is able to pass the robotic Turing test are grounded, in that, unlike in the case of an ungrounded symbol system, they do not depend on the mediation of the mind of an external interpreter to connect them to the external objects that they are interpretable as being "about"; the connection is autonomous, direct, and unmediated. But grounding is not meaning. Grounding is an input/output performance function. Grounding connects the sensory inputs from external objects to internal symbols and states occurring within an autonomous sensorimotor system, guiding the system's resulting processing and output.
Meaning, in contrast, is something mental. But to try to put a halt to the name-game of proliferating nonexplanatory synonyms for the mind/body problem without solving it, let us cite just one more thing that requires no further explication: feeling. The only thing that distinguishes an internal state that merely has grounding from one that has meaning is that it feels like something to be in the meaning state, whereas it does not feel like anything to be in the merely grounded functional state. Grounding is a functional matter; feeling is a felt matter. And that is the real source of Brentano's vexed peekaboo relation between "intentionality" and its internal "intentional object": All mental states, in addition to being the functional states of an autonomous dynamical system, are also feeling states: Feelings are not merely "functed," as all other physical states are; feelings are also felt.
Hence feeling is the real mark of the mental. But the symbol grounding problem is not the same as the mind/body problem, let alone a solution to it. The mind/body problem is actually the feeling/function problem: Symbol-grounding touches only its functional component. This does not detract from the importance of the symbol grounding problem, but just reflects that it is a keystone piece to the bigger puzzle called the mind.
The neuroscientist Antonio Damasio investigates this marking function of feelings and emotions in his Somatic marker hypothesis. Damasio adds the notion of biologic homeostasis to this discussion, presenting it as an automated bodily regulation process providing intentionality to a mind via emotions. Homeostasis is the mechanism that keeps all bodily processes in healthy balance. All of our actions and perceptions will be automatically "evaluated" by our body hardware according to their contribution to homeostasis. This gives us an implicit orientation on how to survive. Such bodily or somatic evaluations can come to our mind in the form of conscious and non-conscious feelings and lead our decision-making process. The meaning of a word can be roughly conceptualized as the sum of its associations and their expected contribution to homeostasis, where associations are reconstructions of sensorimotor perceptions that appeared in contiguity with the word. Yet, the Somatic marker hypothesis is still hotly debated and critics claim that it has failed to clearly demonstrate how these processes interact at a psychological and evolutionary level. The recurrent question that the Somatic marker hypothesis does not address remains: how and why does homeostasis become felt homeostasis?