I continue to obsess about how language knowledge should be represented in ACT-R.
ACT-R makes some strong, pervasive claims about how knowledge is stored. The main claim is that knowledge is represented in declarative and procedural memories. Procedural knowledge can only be
used - it is know-how, stored as a (large!) collection of rules. Rules watch for their condition-of-application to occur, and when it does, they fire - they 'do their thing.' And that's
all you can do with a rule. They can't be consciously recalled, described, recognized, modified, or even willfully created. The
only thing we can do with a rule is apply it, when applicable.
Declarative knowledge is factual, propositional, or representational - concepts and their relationships, sounds, visual 'images' (whose structure and properties are poorly understood), and perhaps 'action schemas' that are perhaps a kind of abstract
vignette. The units of declarative memory are called
chunks.
We can call declarative chunks into conscious awareness and operate on them (using rules). Common operations are recognition, comparison, combination, analysis (finding parts) and association (finding similar things). At a higher level, we can
describe declarative knowledge if we have applicable language skills. Failing at description, we should at minimum be able to recognize and compare chunks - "Have you seen
this before?" "Is
this like
that?"
I personally have not seen anything that suggests to me that a chunk can be invisible - either inaccessible to conscious awareness, or indescribable/ineffable. First, it is almost a given that a chunk that can affect cognition can be brought into awareness. In ACT-R, it is a given.
Yes, a chunk's normal use could be so habitual and fast that we are not normally aware of using it, but we can still direct our awareness to it and find it.
Yes, the sensation of 'seeing red', the so-called qualia, feels ineffable. I take this to be a peculiar sensation associated with contemplating an inherently subjective concept, arising from our knowledge that inherently subjective concepts cannot be communicated! Philosophers writing about 'seeing red' have no trouble finding words, would that they had more.
Let's turn our attention to language-specific knowledge.
It's uncontroversial to say that grammar is not declarative knowledge - native speakers do not know the grammar of their language. Indeed, strikingly, we don't seem to have declarative representations of even the most basic grammatical categories - nouns, verbs, prepositions, adjectives, adverbs, noun phrases, clauses: Do you experience any of these as natural mental categories? Children who are fluent in their native language struggle to grasp the concept of 'noun' or 'verb'. You don't see students going "OH! THOSE things! Those are called nouns?" - quite the contrary, they scratch their heads and have dozens of questions. Is "running" a noun or a verb? Is "myself" a noun? Is "play" a noun, a verb, or an adjective?
The abstract non-leaf categories are just that much more so.
On the other hand, people know by the end of the last word whether a sentence is 'acceptable' or not, and (at least for educated adults?) there seems to be a clear distinction between
ungrammatical and
nonsense. Viz Jabberwocky.
In ACT-R, this implies that grammar is implicit knowledge, and it must primarily be stored as procedural knowledge, as rules. So... we imagine a bunch of parsing rules, that look at sequences of words and somehow group them and assign meaning. And for speaking, a bunch of rules that take intention, context and concept and create sequences of words.
Don't they have to be two separate sets of rules? How does that work? How do they stay in sync?
This does fit with the observation that children and 2nd language learners can understand a much richer language than they can generate, including grammatical forms. And we can understand idiosyncratic speech and dialect that we ourselves cannot fluently produce. Does anybody not understand Yoda in Star Wars? And how many people can fluently imitate Yoda's dialect?
On the other hand, we can hear and understand (even vaguely) a new word, and then use it immediately. Kids pick up slang instantly. Adults pick up jargon in their areas of interest - computer geeks, chemists, biologists. lawyers, doctors, woodworkers, farmers...
But here we are talking about content words, or sometimes - confusingly - multiword idioms, multiword verbs and separable verbs: "old hand", "early bird", "screw up", "get over", "take to".
Try this: What does "farm" mean? Now compare: What does "myself" mean? Or "there"? Or "to"?
Some ConclusionsThe stuff we can learn and change quickly is declarative. Content words are stored as declarative bundles, but not just individual words, any idiom gets the same treatment - idioms, multiword verbs, separable verbs. The word that appears to be a preposition in a separable verb is not necessarily functioning as a preposition, such as the "over" in "get over". Seems to me that "get over" is a separable transitive verb. Individual words, and 'conventional' combinations of words are stored as declarative chunks, forming the
lexicon.
Grammar - patterning beyond what comes from the lexicon - is handled entirely procedurally.
Syntax is never represented in declarative form - unless we study grammar or linguistics!
There are no declarative chunks for grammar categories or grammatical rules.
Parsing does not involve activation, construction, or modification of representations of abstract syntactic entities such as VP, Head-Phrase, Determiner, etc. Similarly these things are not used in language generation.
During parsing and generation, the intermediate form must be a mixture of words, lexemes (abstract word concepts), and concept/chunks. There are abstract concepts that are expected and constructed, but these are
explicable concepts not grammatical categories. I'm proposing that, for example, the word 'while' is followed by a description of a process or condition - not by an SC (subject-complement?)
And here's a thing that causes no end of confusion: Because we use rules to parse and generate language, those rules become tuned with use, and we compress sequences of steps into single steps. In ACT-R terms, we compile new rules to accelerate and shorten common mental procedures. Over time, our declarative linguistic knowledge becomes baked into rules derived from declarative knowledge and reinforced by practice, blurring the distinction between declarative and procedural and creating a vast landscape of 'fuzzy syntax' - infrequent constructions that are primarily declarative, high-frequency constructions that are purely procedural and automatic, and
every possible intermediate blend. Linguistics treats this as the hinterlands, populated by scruffy lawless barbarians - but maybe this is where the native speakers live.
JustificationOnly declarative memory can consistently change quickly (fractions of a second for new chunks) and declarative memory is what our language understanding rules and language production rules have in common. Language knowledge that can quickly go from understanding to production (or the other way 'round?) is declarative.
The divergence of behavior from factual knowledge - when what we say diverges from what we do - is a hallmark of knowledge being stored in both declarative and procedural form: Procedural knowledge ultimately determines what we do; What we explain comes from declarative memory. That we can use function words without being able to define them indicates that our use of them is procedural.
For example, "while" is followed by some kind of description of a process or a condition. I'm sure you and your parser expect and look for a kind of thing following the word 'while'. You might say "while
something was going on" or "while
something was true". Such an abstract construction - "something was going on" - is as close as we get to a syntactic non-terminal.
SpeculationMaybe there are some language skills (rules) that can work both comprehension and production. For example, if we partially parse what we are planning to say, then the comprehension system could modify, interfere with, or even abort something before or during production. To what extent this happens pre-verbally - before we hear ourselves saying it - I don't know.
Predictions- Reading, understanding and generating language that uses common words and constructions will require minimal access to declarative memory. To the extent that ACT-R allows it as cognitively plausible, comprehending and generating common constructions will be handled entirely by rules. When you hear "Good morning" you don't need to access memory to determine that "good" is an adjective, or that "morning" is a noun, or to recognize or construct a representation of a "noun phrase", or look up this idiom and discover that it is marked "conventional greeting". If procedural rules can translate directly from two words to an intentional signal ('speaker is greeting me in a customary way') then I'm sure they do, with minimal memory access.
- Generating and comprehending high-speed speech (compressed, or e.g. in competitive debate) will require a much higher percentage of formulaic, familiar words, idioms and constructions.
- Bet there's an inverse correlation between flexibility of idioms and frequency of use. Not for every idiom, but as a general trend.