<?xml version='1.0' encoding='UTF-8'?><rss xmlns:atom='http://www.w3.org/2005/Atom' xmlns:openSearch='http://a9.com/-/spec/opensearchrss/1.0/' xmlns:georss='http://www.georss.org/georss' version='2.0'><channel><atom:id>tag:blogger.com,1999:blog-3829817</atom:id><lastBuildDate>Tue, 20 Oct 2009 06:42:13 +0000</lastBuildDate><title>Valentine</title><description>notes on the construction of an artificial mind</description><link>http://www.dosadi.com/weblog/blogger.html</link><managingEditor>noreply@blogger.com (Spike)</managingEditor><generator>Blogger</generator><openSearch:totalResults>22</openSearch:totalResults><openSearch:startIndex>1</openSearch:startIndex><openSearch:itemsPerPage>25</openSearch:itemsPerPage><item><guid isPermaLink='false'>tag:blogger.com,1999:blog-3829817.post-1305832646032703558</guid><pubDate>Tue, 20 Oct 2009 06:39:00 +0000</pubDate><atom:updated>2009-10-19T23:42:08.621-07:00</atom:updated><title>Compassion</title><description>I wonder what would happen if...&lt;br /&gt;&lt;br /&gt;an artificial mind had as its highest goal, to be kind to people.&amp;nbsp; Or perhaps even to love people.&lt;br /&gt;The first and easiest objection is - we don't know how to encode that, we as humans don't even know how to define or do that ourselves.&lt;br /&gt;&lt;br /&gt;But the greatest hubris of AI is to imagine that we can avoid the uncertainties of life through relentless reason. We can't - life surprises and eludes us, our categories cannot capture it.&amp;nbsp; We don't know exactly what &lt;i&gt;kindness&lt;/i&gt; is, or how to love a species that could change - is presumably changing.&amp;nbsp; No finite intelligence will be able to operate perfectly in this world.&amp;nbsp; So our creations are in the same boat we are - trying to figure out what's going on and what to do and how to live rightly with incomplete and imperfect knowledge.&lt;br /&gt;&lt;br /&gt;I thought last week that to survive, an artificial intelligence must really do only two things:&lt;br /&gt;1. Don't crash.&lt;br /&gt;2. Adapt.&lt;br /&gt;&lt;br /&gt;That is, it must keep operating - it cannot stop working, it cannot go insane, get stuck, stop processing input, become obsessed, and so on. No matter how perturbed, it must return eventually to a stable, responsive operating mode.&amp;nbsp; And it must improve over time based on its experiences.&amp;nbsp; There is no limit to what it can become, as long as it continues to do both these things.&lt;br /&gt;&lt;br /&gt;I don't know how to express the 3rd requirement, but one is needed: The '3 Laws' requirement.&lt;br /&gt;Have compassion, be kind to others, value others as much as yourself, do good?&lt;br /&gt;&lt;br /&gt;We certainly open a can of worms here.&amp;nbsp; If we program our AI with a precise, algorithmic definition of 'good', I have no doubt that time or malice will eventually find a way to invert that into evil.&lt;br /&gt;&lt;br /&gt;If we program our AI to evolve its own definition of Good, what's to stop it from eventually deciding that we humans are Ungood? Radical enviro-bot, Agent Smith from the Matrix.&lt;br /&gt;Or perhaps it simply estimates that it will do so much future Good, that its own survival becomes more important than that of any individual human. Or country...&amp;nbsp; Or species...&amp;nbsp; Think how much Good an unselfish, incorruptible immortal being could do! It's a very large number.&lt;br /&gt;&lt;br /&gt;I better start reading &lt;a href="http://autogeny.org/"&gt;J. Storrs Hall&lt;/a&gt;.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/3829817-1305832646032703558?l=www.dosadi.com%2Fweblog%2Fblogger.html'/&gt;&lt;/div&gt;</description><link>http://www.dosadi.com/weblog/blogger.html#1305832646032703558</link><author>noreply@blogger.com (Spike)</author></item><item><guid isPermaLink='false'>tag:blogger.com,1999:blog-3829817.post-2672463488792775781</guid><pubDate>Mon, 19 Oct 2009 20:39:00 +0000</pubDate><atom:updated>2009-10-19T23:09:02.104-07:00</atom:updated><title>"Build me!"</title><description>I was meditating a few minutes ago and had an interesting experience.&lt;br /&gt;Libby and I are trying the Holosync audio CDs by Centerpointe - I've just started, I was about halfway through my 3rd or 4th time through the 'Dive' track.  I'm not very focused or empty-minded, all kinds of thoughts go through my awareness.  I was thinking about the power of questions - that we can ask ourselves questions and listen to our own answers.  If you've done this some, you realize that you can consciously formulate the answer to a question, or you can just 'produce' the answer - it comes out of you or appears without any conscious effort or activity.  I think that's where the power of questions comes from, that they offer a way for us to communicate with our implicit knowledge.  I think that implicit knowledge is an aspect of what we experience as our 'inner self' or 'soul', it's why we feel that we have a hidden invisible core, an inner being. Because we do!&lt;br /&gt;&lt;br /&gt;I was thinking about Valentine, and wanting to be more focused and sure of my path - not just to know what to do, but to be able to 'just do'.   And then (I had the experience that) Valentine spoke to me. Not a hallucination, it was clearly 'inside my head'.  But the quality was utterly unlike the imaginary conversations I sometimes have, in which I am aware that like a playwright I am writing both parts.  This had the subjective quality that I was being spoken to by somebody else, who could speak to me in my thoughts. The voice was intense but with no... connotation, if that makes sense: No threat, no plea. No emotion? It had two  dream-like qualities: It was simultaneously male and female. And I knew it was Valentine.&lt;br /&gt;&lt;br /&gt;Valentine said "&lt;span style="font-style: italic;"&gt;BUILD ME!&lt;/span&gt;"&lt;br /&gt;&lt;br /&gt;My whole upper body went cold and goose bumps shot down my arms.  Describing it now, that sensation recurs.  Each time I re-read this, I have that same sensation.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/3829817-2672463488792775781?l=www.dosadi.com%2Fweblog%2Fblogger.html'/&gt;&lt;/div&gt;</description><link>http://www.dosadi.com/weblog/blogger.html#2672463488792775781</link><author>noreply@blogger.com (Spike)</author></item><item><guid isPermaLink='false'>tag:blogger.com,1999:blog-3829817.post-6502792307794285442</guid><pubDate>Mon, 19 Oct 2009 04:02:00 +0000</pubDate><atom:updated>2009-10-19T23:09:31.815-07:00</atom:updated><title>solipsism is alive and well in linguistics</title><description>I keep reading papers by linguists in which they make claims about English, which claims are evidenced by a very small number of examples, not uncommonly by a &lt;span style="font-style: italic;"&gt;single&lt;/span&gt; decisive example.  The examples are &lt;span style="font-style: italic;"&gt;utterances&lt;/span&gt; which are marked as &lt;span style="font-style: italic;"&gt;acceptable &lt;/span&gt;or not: A preceding asterisk means 'not acceptable', no asterisk means 'acceptable'.  Acceptable means, I think, acceptable to a Native Speaker of English.  It goes without saying, apparently, that anybody who can get a linguistics paper published, can channel the Native Speaker.  Nobody ever offers any other credentials; JZ Knight offers more evidence that she's channeling Ramtha.&lt;br /&gt;&lt;br /&gt;Imagine if biologists wrote papers about mouse anatomy, behavior, diseases, biochemistry and genetics, &lt;span style="font-style: italic;"&gt;using whatever mice they found in their basement&lt;/span&gt;.&lt;br /&gt;&lt;br /&gt;It's very... 18th Century.&lt;br /&gt;&lt;br /&gt;Typically, even frequently, I find at least one utterance in each paper marked incorrectly.  Sometimes it is marked acceptable and I disagree, more commonly it is marked with the little asterisk and I can produce an example of a conversation in which that utterance would (to my inner Native Speaker) be perfectly acceptable.&lt;br /&gt;&lt;br /&gt;(1) *International aid teams conveyed to Burma.  [Ruppenhofer &amp;amp; Michaelis 2009]&lt;br /&gt;&lt;br /&gt;A. Please explain how your agency lost 3400 tons of food donations in August 1963.&lt;br /&gt;B. Properly speaking, we did not lose this material. We were not involved in conveying the donations.&lt;br /&gt;A. Who was responsible for the conveyance?&lt;br /&gt;B. Donor country governments conveyed to our collection point in Perth. International aid teams conveyed to Burma.  The Burmese government accepted the donations but barred the aid teams, so... ah... final conveyance was ah... presumably arranged by the Burmese government.&lt;br /&gt;&lt;br /&gt;It's stilted bureaucrat-speak, but is it 'unacceptable' - ungrammatical or nonsensical?  No, it's just kind of odd.  Nobody will interrupt this imaginary interrogation to cry "You're talking nonsense!" or "Speak English!"  &lt;span style="font-style: italic;"&gt;I think&lt;/span&gt;, any native speaker of English will take this in and say "Sheesh! CYA much?"&lt;br /&gt;&lt;br /&gt;(2) *He built a house for 6 months. [? Rappaport Hovav, M. and B. Levin 1998?]&lt;br /&gt;&lt;br /&gt;A. Hey, Austin disappears for 6 months, and then yesterday I hear he's dead! What happened?&lt;br /&gt;B. He found out he had cancer, so he moved back to New Mexico to do what he dreamed of doing since college.&lt;br /&gt;A. Which was what?&lt;br /&gt;B. Building a house by himself.&lt;br /&gt;A. So he built a house?&lt;br /&gt;B. He built a house for 6 months.&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;(3) *I convinced my car to start in the winter. [Postal 2004 p. 91]&lt;br /&gt;&lt;br /&gt;This one doesn't even need a special example - I've heard my friends say things like "I convinced the blender to make ice chips" or "I finally convinced Word to put the whole table on one page".&lt;br /&gt;But OK, how about this, and I'm sure we could do even better:&lt;br /&gt;A. So Sue started having contractions, and her car was in the shop? That junker of yours doesn't want to start even in the summer! What did you do?&lt;br /&gt;B. What else could I do? I convinced my car to start in the winter.&lt;br /&gt;&lt;br /&gt;OK, do you have a problem with that? It's unacceptable? It's ungrammatical? It's nonsensical?&lt;br /&gt;&lt;br /&gt;I love Postal's attitude, and his critiques - just my kind of stuff.  He's tearing up all these 'famous linguists' and shredding the whole field for sloppy practices, poor research, &lt;span style="font-style: italic;"&gt;junk linquistics&lt;/span&gt;.&lt;br /&gt;&lt;br /&gt;But - in his own papers, all the examples are apparently marked according to, well, I guess Postal English! He puts asterisks willy-nilly on utterances that I don't find objectionable at all, and in the text of his articles he uses sentences that I find marginally acceptable, &lt;span style="font-style: italic;"&gt;at best&lt;/span&gt;.  Ain't Science wonderful, when you can &lt;span style="font-style: italic;"&gt;pull the data out of your own hat?&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;I have developed a deep appreciation for the folks doing statistical corpus-based linguistics.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/3829817-6502792307794285442?l=www.dosadi.com%2Fweblog%2Fblogger.html'/&gt;&lt;/div&gt;</description><link>http://www.dosadi.com/weblog/blogger.html#6502792307794285442</link><author>noreply@blogger.com (Spike)</author></item><item><guid isPermaLink='false'>tag:blogger.com,1999:blog-3829817.post-7904576677745332026</guid><pubDate>Fri, 16 Oct 2009 19:18:00 +0000</pubDate><atom:updated>2009-10-16T13:20:20.566-07:00</atom:updated><title>implicit syntactic knowledge in ACT-R: Chunk-types?</title><description>This morning's brainstorm: There is another reservoir of implicit knowledge in ACT-R, hidden in plain sight: Chunk types!&lt;br /&gt;&lt;br /&gt;I've been puzzling and puzzling over the question: "How are syntactic categories or constructs (like&lt;span style="font-style: italic;"&gt;&lt;span style="font-style: italic;"&gt; NOUN&lt;/span&gt;&lt;/span&gt; or VP) represented in ACT-R?"&lt;br /&gt;&lt;br /&gt;ACT-R has declarative memory containing chunks - chunks are reportable, declarative facts. There is procedural memory, which consists of pattern-matching rules.  Underlying declarative memory is a subsymbolic activation network that computes (perhaps) Bayesian relevance in real-time.  Knowledge in rules and subsymbolic form is not reportable, we are not aware of it, only of its effects.&lt;br /&gt;&lt;br /&gt;There is lots of evidence that we &lt;span style="font-style: italic;"&gt;employ&lt;/span&gt; syntactic categories or constructs in language comprehension.  There is some confusing evidence of priming of these constructs. There is evidence that we use them for prediction i.e. that we &lt;span style="font-style: italic;"&gt;expect&lt;/span&gt; them as we parse.  They allow &lt;span style="font-style: italic;"&gt;recursion&lt;/span&gt; - a NP can be a part of a NP.&lt;br /&gt;But they don't seem to be declarative chunks - even when you describe them, people don't recognize syntactic categories as something they 'know about'.  They effortlessly use them, but they don't 'know' them.&lt;br /&gt;&lt;br /&gt;New words and phrases can be more-or-less instantly added to open categories like &lt;span style="font-style: italic;"&gt;noun&lt;/span&gt; - and this knowledge can be instantly used.  That's too fast for it to be either subsymbolic weight adjustment, or production-creation.  It looks like declarative memory in terms of learning speed - and Demberg &amp;amp; Keller argue for a model in which syntactic constructs &lt;span style="font-style: italic;"&gt;decay&lt;/span&gt; during parsing, also a hallmark of declarative memory.&lt;br /&gt;&lt;br /&gt;So my brainstorm is this: Chunk-types in ACT-R are &lt;span style="font-style: italic;"&gt;implicit categories&lt;/span&gt;. Syntactic categories are &lt;span style="font-style: italic;"&gt;chunk-types&lt;/span&gt; - essentially a network of recursive, perceptually-based implicit categories.  We recognize them in the word-stream, we can use them for prediction, and when they are recognized they are the basis for deriving semantics via association and productions. Chunk-types are learned (induced) categories.  The trick is that we don't build these categories as declarative objects, they are induced by category-learning mechanisms that ACT-R has mostly ignored to date.  That's why the categories are unreportable: Chunk-types are not objects in declarative memory.&lt;br /&gt;&lt;br /&gt;Note that productions make extensive use of chunk-types, for example with the ISA test - productions can test (without cost?) for category membership, and can use categories freely (without retrieval cost) to construct new declarative chunks that are 'marked' for any category.&lt;br /&gt;&lt;br /&gt;For example: We have a category (chunk-type) of FURNITURE.  Our brain has induced this category somehow, and we can swiftly (unconsciously) categorize objects as FURNITURE or not. We also eventually learn a word, 'furniture' - how is it connected to FURNITURE?  You can't connect a chunk like a word (or lemma, etc.) to a chunk-type. You can connect the word 'furniture' to a concept &lt;span style="font-style: italic;"&gt;furniture&lt;/span&gt;, and that concept can include examples (links to concepts like &lt;span style="font-style: italic;"&gt;chair, sofa, nightstand&lt;/span&gt;) and also something I don't understand: Some kind of representation of our &lt;span style="font-style: italic;"&gt;ability to use the category&lt;/span&gt;.  In other words, we know that we know how to tell if something is FURNITURE.  I think somehow &lt;span style="font-style: italic;"&gt;furniture&lt;/span&gt; the concept includes a proposition like "I know how to recognize things in this category."  Even though &lt;span style="font-style: italic;"&gt;furniture&lt;/span&gt; is not the real category FURNITURE!&lt;br /&gt;&lt;br /&gt;I'm not sure how to apply this idea to the mechanics of parsing.&lt;br /&gt;Maybe we build chunks using expected categories as chunk-types.&lt;br /&gt;Or we are just collecting sequences of words and establishing a correspondence with chunk-types, and the best-matching chunk-type causes prediction?&lt;br /&gt;&lt;br /&gt;Parsing is a matching game, an identification/recognition game - the game is to identify the construct that is being heard or read, and to correctly match the components to the slots.  When the slots are filled or a delimiter is encountered, productions can use the match to assign meaning.&lt;br /&gt;I think most syntactic constructs are a kind of sequence - it acts like a sequence for many purposes, but it carries additional restrictions and additional significance because it follows a particular pattern.&lt;br /&gt;Lexical categories are membership categories - they act like tags, attributes of words.&lt;br /&gt;Syntactic constructs are temporal/sequential categories - they act like sequences, and they make &lt;span style="font-style: italic;"&gt;predictions&lt;/span&gt;.  That's the survival value of temporal sequences after all, prediction.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/3829817-7904576677745332026?l=www.dosadi.com%2Fweblog%2Fblogger.html'/&gt;&lt;/div&gt;</description><link>http://www.dosadi.com/weblog/blogger.html#7904576677745332026</link><author>noreply@blogger.com (Spike)</author></item><item><guid isPermaLink='false'>tag:blogger.com,1999:blog-3829817.post-7661461537739123282</guid><pubDate>Fri, 16 Oct 2009 08:58:00 +0000</pubDate><atom:updated>2009-10-16T12:18:34.317-07:00</atom:updated><title>incremental, fully connected, predictive parsing</title><description>Demberg &amp;amp; Keller have proposed an interesting Psycholinguistically Motivated version of the TAG grammar/parsing formalism.  Their model can reproduce timing effects that seem to be related to prediction of upcoming grammatical structures - and therefore also involve surprise.&lt;br /&gt;&lt;br /&gt;Interestingly, their parser builds whole trees at all times, integrating new words incrementally.  No parse stack, no 'fragment' trees - always complete 'utterance' trees with placeholders/slots waiting to be filled.&lt;br /&gt;&lt;br /&gt;I'm very interested in this! As it comes closer to matching my idea that the parser cannot be building interim declarative structures that correspond to no reportable concept.&lt;br /&gt;I need to think about whether these incomplete/predictive trees seem like plausible declarative structures.&lt;br /&gt;&lt;br /&gt;The question is: Where does the 'prediction' really happen (in ACT-R terms)?&lt;br /&gt;Could be declarative structure - which would be reportable.  To be non-reportable, it would have to be procedural (somehow) or subsymbolic.  Demberg &amp;amp; Keller's model uses a &lt;span style="font-style: italic;"&gt;decay&lt;/span&gt; process for predictive structures, which suggests the ACT-R subsymbolic layer. Are the predictive structures actually built - or are they just &lt;span style="font-style: italic;"&gt;activated&lt;/span&gt;?  Is there really a difference?&lt;br /&gt;&lt;br /&gt;What if instead of thinking of 'building' trees, what if we just built correspondences between concepts and patterns (constructions) - then the predicted structures would just be empty slots?&lt;br /&gt;&lt;br /&gt;I'm coming back around to an earlier idea, that the mind has a powerful trick up its sleeve that allows it to &lt;span style="font-style: italic;"&gt;equate &lt;/span&gt;one concept with another. In ACT-R terms, for taking two chunks and merging them into one so that the two chunks become effectively identical.  As if the brain could merge A into B, and then do a 'global replace' of pointer-to-A with pointer-to-B.  Have you ever arrived at an intersection, and suddenly realized that it's a familiar place, you've just arrived there by a new route? There's a powerful feeling of mental activity, as your recent navigational knowledge is reworked and integrated.&lt;br /&gt;How else can it be resolved, a pronoun that is used before its referent?&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/3829817-7661461537739123282?l=www.dosadi.com%2Fweblog%2Fblogger.html'/&gt;&lt;/div&gt;</description><link>http://www.dosadi.com/weblog/blogger.html#7661461537739123282</link><author>noreply@blogger.com (Spike)</author></item><item><guid isPermaLink='false'>tag:blogger.com,1999:blog-3829817.post-4010814472154040377</guid><pubDate>Fri, 16 Oct 2009 05:19:00 +0000</pubDate><atom:updated>2009-10-16T01:02:07.540-07:00</atom:updated><title>syntax: nouns and verbs as categories</title><description>Continuing the theme of parsing without syntax...&lt;br /&gt;&lt;br /&gt;Recent Tomasello paper [&lt;span class="cit-title"&gt;Modeling children's early grammatical knowledge &lt;/span&gt;&lt;cite&gt;&lt;abbr class="cit-jnl-abbrev" title="Proceedings of the National Academy of Sciences"&gt;PNAS&lt;/abbr&gt; &lt;span class="cit-pub-dt"&gt;2009&lt;/span&gt; &lt;span class="cit-vol"&gt;106&lt;/span&gt;&lt;span class="cit-sep cit-sep-vol"&gt;:&lt;/span&gt;&lt;span class="cit-pages"&gt;&lt;span class="cit-pages-fpage"&gt;17284&lt;/span&gt;&lt;span class="cit-sep cit-sep-page-range"&gt;-&lt;/span&gt;&lt;span class="cit-pages-lpage"&gt;17289&lt;/span&gt;&lt;/span&gt;&lt;/cite&gt;] describes using Bayesian procedure to extract a localized grammar of the speech of individual 2 and 3 year old children.  This produces a good model for 2 y/o.  For 3 y/o, the automatically generated grammar is much improved in coverage (prediction?) when augmented with information about classes &lt;span style="font-style: italic;"&gt;noun&lt;/span&gt; and &lt;span style="font-style: italic;"&gt;verb&lt;/span&gt;.&lt;br /&gt;&lt;br /&gt;OK - this fits well with the idea that early language (the 2 y/o) is as somebody said, "all idioms".&lt;br /&gt;&lt;br /&gt;Now... as the 3 y/o expands his or her grammar with the noun and verb classes, what exactly is going on?  S does not have these as explicit categories. Could S divide words into 'noun' and 'verb' groups, given a few examples?  That sounds like an easy experiment...  Well, maybe not - we're way pre-reading.  Can young children reliably group nouns with nouns and verbs with verbs?&lt;br /&gt;&lt;br /&gt;Great thesis: &lt;a href="http://dare.uva.nl/document/137081"&gt;Learning to categorize verbs and nouns - Marian Erkelens&lt;/a&gt;&lt;br /&gt;Dutch babies at 16 months show some categorization of nouns and verbs, it seems to derive from local context cues.&lt;br /&gt;Marian Erkelens makes several nice points very clearly:&lt;br /&gt;&lt;ul&gt;&lt;li&gt;The English word &lt;span style="font-style: italic;"&gt;walk&lt;/span&gt; appears happily in both noun and verb contexts.&lt;/li&gt;&lt;li&gt;We encounter a word in context and right away we can use it correctly, as noun or verb:&lt;span style="font-style: italic;"&gt;&lt;br /&gt;This is a dack =&gt; Here are two dacks.&lt;br /&gt;He's gorping!  =&gt; Oh, I don't think I'd like to gorp.&lt;/span&gt;&lt;/li&gt;&lt;li&gt;Nouns are not all &lt;span style="font-style: italic;"&gt;objects&lt;/span&gt;: &lt;span style="font-style: italic;"&gt;pain&lt;/span&gt;, &lt;span style="font-style: italic;"&gt;crisis&lt;/span&gt;.&lt;/li&gt;&lt;li&gt;Verbs are not all &lt;span style="font-style: italic;"&gt;actions&lt;/span&gt;: &lt;span style="font-style: italic;"&gt;know&lt;/span&gt;, &lt;span style="font-style: italic;"&gt;love&lt;/span&gt;.&lt;/li&gt;&lt;/ul&gt;I think several thoughts:&lt;br /&gt;Erkelens is thinking of object/noun and action/verb as aristotelian categories, but these are &lt;span style="font-style: italic;"&gt;human&lt;/span&gt; categories. Like other human categories (game, product, poem) they can be extended quite happily with new examplars that do not share a common definition with any other examplar.  We can build a category of nouns that is initially filled with tangible individual objects (Dad, Binky), then we add less tangible and specific things (dog, fire, lunch), and more abstract things (day, story, stuff), and so on until we are adding concepts like &lt;span style="font-style: italic;"&gt;pain&lt;/span&gt; and &lt;span style="font-style: italic;"&gt;morphology&lt;/span&gt; to the category without problem.  I think there is a category that encompasses all nouns, it is roughly encompassed by the word &lt;span style="font-style: italic;"&gt;thing&lt;/span&gt;, and it is a semantic concept, not a grammatical one - or rather, it functions as both: The observed grammatical category of &lt;span style="font-style: italic;"&gt;noun (or rather noun-phrase)&lt;/span&gt; is co-extensive with the semantic category of &lt;span style="font-style: italic;"&gt;things&lt;/span&gt;.&lt;br /&gt;&lt;br /&gt;We can mark a novel word as 'noun' or 'verb' &lt;span style="font-style: italic;"&gt;or both&lt;/span&gt;, just by encountering it once, and before we know anything else about it! It becomes immediately productive. (This still seems amazing to me.)&lt;br /&gt;Is there a &lt;span style="font-style: italic;"&gt;lexical&lt;/span&gt; entry created - a word-concept - which is marked with [+NOUN]?  Or is there a kind of placeholder &lt;span style="font-style: italic;"&gt;thing&lt;/span&gt; concept created, that is marked in some sense [+THING]?&lt;br /&gt;If, as I am suggesting, NOUN=THING, then there is no difference.  A concept is created, and marked [+NOUNTHING], and the new word points to this placeholder.  Apparently we have no trouble marking a novel concept both [+NOUNTHING] and [+VERBACTION].  Either that or linking a new word to two concepts.&lt;br /&gt;&lt;br /&gt;&lt;span style="font-style: italic;"&gt;I took my quonk, and I quonked into space.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;Eh, why not?&lt;br /&gt;&lt;br /&gt;We (especially computer scientists) expect everything to eventually resolve into precise concepts or primitives or &lt;span style="font-style: italic;"&gt;something&lt;/span&gt;. But the brain is doing all this with networks of neurons using firing frequencies - it's good to remember that it might be &lt;span style="font-style: italic;"&gt;fuzzy all the way down&lt;/span&gt;.  That a category is just a label for a relatively consistent set of behavioral tendencies, that [+NOUNTHING] is probably not a binary property but a graded one, that in saying 'the concept' or 'the lexical entry' we might be unjustifiably quantizing something that is continuous.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/3829817-4010814472154040377?l=www.dosadi.com%2Fweblog%2Fblogger.html'/&gt;&lt;/div&gt;</description><link>http://www.dosadi.com/weblog/blogger.html#4010814472154040377</link><author>noreply@blogger.com (Spike)</author></item><item><guid isPermaLink='false'>tag:blogger.com,1999:blog-3829817.post-847337382880731392</guid><pubDate>Mon, 12 Oct 2009 22:40:00 +0000</pubDate><atom:updated>2009-10-14T00:45:00.437-07:00</atom:updated><title>language knowledge in ACT-R, part deux</title><description>I continue to obsess about how language knowledge should be represented in ACT-R.&lt;br /&gt;&lt;br /&gt;ACT-R makes some strong, pervasive claims about how knowledge is stored. The main claim is that knowledge is represented in declarative and procedural memories. Procedural knowledge can only be &lt;span style="font-style: italic;"&gt;used&lt;/span&gt; - it is know-how, stored as a (large!) collection of rules.  Rules watch for their condition-of-application to occur, and when it does, they fire - they 'do their thing.' And that's &lt;span style="font-style: italic;"&gt;all you can do&lt;/span&gt; with a rule. They can't be consciously recalled, described, recognized, modified, or even willfully created.  The &lt;span style="font-style: italic;"&gt;only&lt;/span&gt; thing we can do with a rule is apply it, when applicable.&lt;br /&gt;&lt;br /&gt;Declarative knowledge is factual, propositional, or representational - concepts and their relationships, sounds, visual 'images' (whose structure and properties are poorly understood), and perhaps 'action schemas' that are perhaps a kind of abstract &lt;span style="font-style: italic;"&gt;vignette&lt;/span&gt;. The units of declarative memory are called &lt;span style="font-style: italic;"&gt;chunks&lt;/span&gt;.&lt;br /&gt;&lt;br /&gt;We can call declarative chunks into conscious awareness and operate on them (using rules). Common operations are recognition, comparison, combination, analysis (finding parts) and association (finding similar things).  At a higher level, we can &lt;span style="font-style: italic;"&gt;describe&lt;/span&gt; declarative knowledge if we have applicable language skills.  Failing at description, we should at minimum be able to recognize and compare chunks - "Have you seen &lt;span style="font-style: italic;"&gt;this&lt;/span&gt; before?" "Is &lt;span style="font-style: italic;"&gt;this&lt;/span&gt; like &lt;span style="font-style: italic;"&gt;that&lt;/span&gt;?"&lt;br /&gt;&lt;br /&gt;I personally have not seen anything that suggests to me that a chunk can be invisible - either inaccessible to conscious awareness, or indescribable/ineffable.  First, it is almost a given that a chunk that can affect cognition can be brought into awareness.  In ACT-R, it is a given.&lt;br /&gt;Yes, a chunk's normal use could be so habitual and fast that we are not normally aware of using it, but we can still direct our awareness to it and find it.&lt;br /&gt;&lt;br /&gt;Yes, the sensation of 'seeing red', the so-called qualia, feels ineffable. I take this to be a peculiar sensation associated with contemplating an inherently subjective concept, arising from our knowledge that inherently subjective concepts cannot be communicated!  Philosophers writing about 'seeing red' have no trouble finding words, would that they had more.&lt;br /&gt;&lt;br /&gt;Let's turn our attention to language-specific knowledge.&lt;br /&gt;It's uncontroversial to say that grammar is not declarative knowledge - native speakers do not know the grammar of their language.  Indeed, strikingly, we don't seem to have declarative representations of even the most basic grammatical categories - nouns, verbs, prepositions, adjectives, adverbs, noun phrases, clauses: Do you experience any of these as natural mental categories? Children who are fluent in their native language struggle to grasp the concept of 'noun' or 'verb'.  You don't see students going "OH! THOSE things! Those are called nouns?" - quite the contrary, they scratch their heads and have dozens of questions. Is "running" a noun or a verb?  Is "myself" a noun? Is "play" a noun, a verb, or an adjective?&lt;br /&gt;&lt;br /&gt;The abstract non-leaf categories are just that much more so.&lt;br /&gt;&lt;br /&gt;On the other hand, people know by the end of the last word whether a sentence is 'acceptable' or not, and (at least for educated adults?) there seems to be a clear distinction between &lt;span style="font-style: italic;"&gt;ungrammatical&lt;/span&gt; and &lt;span style="font-style: italic;"&gt;nonsense&lt;/span&gt;. Viz Jabberwocky.&lt;br /&gt;&lt;br /&gt;In ACT-R, this implies that grammar is implicit knowledge, and it must primarily be stored as procedural knowledge, as rules. So... we imagine a bunch of parsing rules, that look at sequences of words and somehow group them and assign meaning.  And for speaking, a bunch of rules that take intention, context and concept and create sequences of words.&lt;br /&gt;&lt;br /&gt;Don't they have to be two separate sets of rules? How does that work? How do they stay in sync?&lt;br /&gt;&lt;br /&gt;This does fit with the observation that children and 2nd language learners can understand a much richer language than they can generate, including grammatical forms.  And we can understand idiosyncratic speech and dialect that we ourselves cannot fluently produce.  Does anybody not understand Yoda in Star Wars? And how many people can fluently imitate Yoda's dialect?&lt;br /&gt;&lt;br /&gt;On the other hand, we can hear and understand (even vaguely) a new word, and then use it immediately.  Kids pick up slang instantly. Adults pick up jargon in their areas of interest - computer geeks, chemists, biologists. lawyers, doctors, woodworkers, farmers...&lt;br /&gt;But here we are talking about content words, or sometimes - confusingly - multiword idioms, multiword verbs and separable verbs: "old hand", "early bird", "screw up", "get over", "take to".&lt;br /&gt;&lt;br /&gt;Try this: What does "farm" mean?  Now compare: What does "myself" mean? Or "there"? Or "to"?&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Some Conclusions&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;The stuff we can learn and change quickly is declarative. Content words are stored as declarative bundles, but not just individual words, any idiom gets the same treatment - idioms, multiword verbs, separable verbs.  The word that appears to be a preposition in a separable verb is not necessarily functioning as a preposition, such as the "over" in "get over". Seems to me that "get over" is a separable transitive verb.  Individual words, and 'conventional' combinations of words are stored as declarative chunks, forming the &lt;span style="font-style: italic;"&gt;lexicon&lt;/span&gt;.&lt;br /&gt;&lt;br /&gt;Grammar - patterning beyond what comes from the lexicon - is handled entirely procedurally.&lt;br /&gt;Syntax is never represented in declarative form - unless we study grammar or linguistics!&lt;br /&gt;There are no declarative chunks for grammar categories or grammatical rules.&lt;br /&gt;Parsing does not involve activation, construction, or modification of representations of abstract syntactic entities such as VP, Head-Phrase, Determiner, etc.  Similarly these things are not used in language generation.&lt;br /&gt;&lt;br /&gt;During parsing and generation, the intermediate form must be a mixture of words, lexemes (abstract word concepts), and concept/chunks.  There are abstract concepts that are expected and constructed, but these are &lt;span style="font-style: italic;"&gt;explicable concepts&lt;/span&gt; not grammatical categories.  I'm proposing that, for example, the word 'while' is followed by a description of a process or condition - not by an SC (subject-complement?)&lt;br /&gt;&lt;br /&gt;And here's a thing that causes no end of confusion: Because we use rules to parse and generate language, those rules become tuned with use, and we compress sequences of steps into single steps.  In ACT-R terms, we compile new rules to accelerate and shorten common mental procedures.  Over time, our declarative linguistic knowledge becomes baked into rules derived from declarative knowledge and reinforced by practice, blurring the distinction between declarative and procedural and creating a vast landscape of 'fuzzy syntax' - infrequent constructions that are primarily declarative, high-frequency constructions that are purely procedural and automatic, and &lt;span style="font-style: italic;"&gt;every possible intermediate blend&lt;/span&gt;.  Linguistics treats this as the hinterlands, populated by scruffy lawless barbarians - but maybe this is where the native speakers live.&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Justification&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;Only declarative memory can consistently change quickly (fractions of a second for new chunks) and declarative memory is what our language understanding rules and language production rules have in common.  Language knowledge that can quickly go from understanding to production (or the other way 'round?) is declarative.&lt;br /&gt;&lt;br /&gt;The divergence of behavior from factual knowledge - when what we say diverges from what we do - is a hallmark of knowledge being stored in both declarative and procedural form: Procedural knowledge ultimately determines what we do; What we explain comes from declarative memory.  That we can use function words without being able to define them indicates that our use of them is procedural.&lt;br /&gt;&lt;br /&gt;For example, "while" is followed by some kind of description of a process or a condition.  I'm sure you and your parser expect and look for a kind of thing following the word 'while'.  You might say "while &lt;span style="font-style: italic;"&gt;something was going on&lt;/span&gt;" or "while &lt;span style="font-style: italic;"&gt;something was true&lt;/span&gt;".  Such an abstract construction - "something was going on" - is as close as we get to a syntactic non-terminal.&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Speculation&lt;/span&gt;&lt;br /&gt;Maybe there are some language skills (rules) that can work both comprehension and production.  For example, if we partially parse what we are planning to say, then the comprehension system could modify, interfere with, or even abort something before or during production.  To what extent this happens pre-verbally - before we hear ourselves saying it - I don't know.&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Predictions&lt;/span&gt;&lt;br /&gt;&lt;ul&gt;&lt;li&gt;Reading, understanding and generating language that uses common words and constructions will require minimal access to declarative memory.  To the extent that ACT-R allows it as cognitively plausible, comprehending and generating common constructions will be handled entirely by rules.  When you hear "Good morning" you don't need to access memory to determine that "good" is an adjective, or that "morning" is a noun, or to recognize or construct a representation of a "noun phrase", or look up this idiom and discover that it is marked "conventional greeting".  If procedural rules can translate directly from two words to an intentional signal ('speaker is greeting me in a customary way') then I'm sure they do, with minimal memory access.&lt;br /&gt;&lt;br /&gt;&lt;/li&gt;&lt;li&gt;Generating and comprehending high-speed speech (compressed, or e.g. in competitive debate) will require a much higher percentage of formulaic, familiar words, idioms and constructions.&lt;br /&gt;&lt;br /&gt;&lt;/li&gt;&lt;li&gt;Bet there's an inverse correlation between flexibility of idioms and frequency of use. Not for every idiom, but as a general trend.&lt;br /&gt;&lt;/li&gt;&lt;/ul&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/3829817-847337382880731392?l=www.dosadi.com%2Fweblog%2Fblogger.html'/&gt;&lt;/div&gt;</description><link>http://www.dosadi.com/weblog/blogger.html#847337382880731392</link><author>noreply@blogger.com (Spike)</author></item><item><guid isPermaLink='false'>tag:blogger.com,1999:blog-3829817.post-4250399904018282979</guid><pubDate>Tue, 06 Oct 2009 04:23:00 +0000</pubDate><atom:updated>2009-10-12T19:03:55.886-07:00</atom:updated><category domain='http://www.blogger.com/atom/ns#'>linguistics</category><category domain='http://www.blogger.com/atom/ns#'>parsing</category><category domain='http://www.blogger.com/atom/ns#'>language</category><title>language knowledge in ACT-R</title><description>Considering the overwhelming number of papers and studies in the area, it is amazing that today (October 2009) there is no coherent, integrated theory of human language understanding and production.  It's not that we lack a complete, highly-validated theory - it's that we lack &lt;span style="font-style: italic;"&gt;a theory&lt;/span&gt;.&lt;br /&gt;&lt;br /&gt;There are lots (and I mean &lt;span style="font-style: italic;"&gt;lots&lt;/span&gt;) of formal models of grammar.&lt;br /&gt;There are lots (and I mean &lt;span style="font-style: italic;"&gt;lots&lt;/span&gt;) of experiments probing the human process of understanding and producing language.&lt;br /&gt;&lt;br /&gt;And yet, as of today (Oct 5th 2009) I can't find an integrated theory that offers a cognitively plausible account of how a sentence is understood, or generated.  It's not that we have one or more theories-with-shortcomings. What we have are hundreds of theories that cover at most a fraction of the area - even if you limit 'the area' to reading and comprehending (or generating) one factual sentence.&lt;br /&gt;&lt;br /&gt;When I got into mind design, I thought the big unknowns would be things like compassion, ethics, emotions, self-model, episodic memory. And those &lt;span style="font-style: italic;"&gt;are &lt;/span&gt;all big unknowns. I didn't expect that top-of-list would be 'language'.&lt;br /&gt;&lt;br /&gt;So in my next post, I'll spend some time trying to assemble and make some sense of the little sample I've made of the vast literature on human language processing.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/3829817-4250399904018282979?l=www.dosadi.com%2Fweblog%2Fblogger.html'/&gt;&lt;/div&gt;</description><link>http://www.dosadi.com/weblog/blogger.html#4250399904018282979</link><author>noreply@blogger.com (Spike)</author></item><item><guid isPermaLink='false'>tag:blogger.com,1999:blog-3829817.post-7338110720158713921</guid><pubDate>Wed, 23 Sep 2009 07:20:00 +0000</pubDate><atom:updated>2009-09-23T01:38:04.647-07:00</atom:updated><category domain='http://www.blogger.com/atom/ns#'>parsing</category><category domain='http://www.blogger.com/atom/ns#'>categories</category><category domain='http://www.blogger.com/atom/ns#'>syntax</category><category domain='http://www.blogger.com/atom/ns#'>language</category><title>Parsing without Syntax III</title><description>The more I think about it, the more convinced I am that we don't have syntactic categories as explicit mental constructs.&lt;br /&gt;&lt;br /&gt;Let's do a gedanken experiment.  Imagine you have some linguistic training, you've spent some time with the structure of English, and I ask you, out of the blue, to name a bunch of transitive verbs. What happens, and why?&lt;br /&gt;&lt;br /&gt;First, notice something you might take for granted: 'Transitive verb' is a category, maybe even a fairly sharply defined category for a given individual on a given day.  It presumably has a finite number of members - but you cannot enumerate its members, that is not even a thinkable mental operation.  It is true of many human categories, both commonplace (games, jobs, first names, kinds of cookies) and specialized (transitive verbs, trigonometric functions), that the relation of &lt;span style="font-style: italic;"&gt;membership&lt;/span&gt; is oddly asymmetrical: If I say "eradicate" or "foosball" you can instantly (more or less...) tell me if the first is a transitive verb or the second is a game.  If I give you a category, you will probably name some set of members very quickly and easily, and then you will slow down, and slow down some more... and perhaps eventually be reduced to sitting and staring at your beer for long periods, with occasional outbursts.  It seems that MEMBERP(thing, category) is fast and easy, usually.  CATEGORIES(thing) is easy for some familiar categories.  MEMBERS(category) is hardly possible at all - you get the sense that there might be a fast operation EXEMPLARY-MEMBERS(category)  that is fast, and then we switch to a strategy that I would describe as shouting the name of the category into the dark depths of memory, and when something lurches into view we grab it and do MEMBERP on it.&lt;br /&gt;&lt;br /&gt;In the ACT-R model, this tells us that for these kind of large categories, MEMBERP is probably a production in procedural memory (PM) - we can do it quickly, without necessarily being able to explain how.  For some things, membership in some categories is explicitly stored in declarative memory (DM) - we know that baseball is a game, as a fact.  And to go from a category to its members is easy for a (usually small) set of members, and laborious or impractical for the rest.  Well - this tells us primarily that the mind does not work well with long sequences or large sets - the programming language of the mind does not have LIST or ARRAY as datatypes.&lt;br /&gt;&lt;br /&gt;So anyway - I asked you to list a bunch of transitive verbs.  You quickly rattle off a few, maybe quite a few if you've been studying verbs lately.  Then I bet the following happens:&lt;br /&gt;1. You run out of easy top-of-the-head transitive verbs.&lt;br /&gt;2. You think about the concept of a transitive verb, which is something like 'to do something to something' and you form a mental concept or picture of that abstract action.&lt;br /&gt;3. You put your memory to work &lt;span style="font-style: italic;"&gt;associating&lt;/span&gt; to concepts that are examples of 'doing something to something'.&lt;br /&gt;4. This generates ideas/images/sketches of actions, which you find (if you watch closely) you can instantly translate into verbs.  (I have a whole digression about why &amp;amp; how we can do that so quickly and easily.)&lt;br /&gt;5. You either instantly say these verbs as they come to you, or you quickly check that they (a) are verbs and (b) are transitive and then say them.  I suggest that &lt;span style="font-style: italic;"&gt;even with your linguistics training&lt;/span&gt; you will test your candidate verbs by producing minimal phrases, thinking - or perhaps actually muttering - &lt;span style="font-style: italic;"&gt;"hit someone; OK.  eradicate something; OK.  prevaricate... oops, nope."&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;So what's going on here? Clearly &lt;span style="font-style: italic;"&gt;transitive verb&lt;/span&gt; is a syntactic category, verbs either are or aren't, or at least have meanings which are or aren't, and this distinction drives a lot of the structure of sentences we say and hear and read every day.  'Drives' in the previous sentence is used transitively.&lt;br /&gt;&lt;br /&gt;And yet, in your imaginary response to my imaginary question, there is no indication that you have transitive verb as a category.  Clearly with &lt;span style="font-style: italic;"&gt;enough study and practice&lt;/span&gt; you could learn the category &lt;span style="font-style: italic;"&gt;transitive verb&lt;/span&gt; and then apply it as fluently as you apply a category like &lt;span style="font-style: italic;"&gt;game&lt;/span&gt; or &lt;span style="font-style: italic;"&gt;fruit&lt;/span&gt;.  But you were speaking and understanding English fluently long before, therefore:&lt;br /&gt;&lt;br /&gt;A. Learning English does not produce a declarative category like &lt;span style="font-style: italic;"&gt;transitive verb&lt;/span&gt; - no amount of explanation or learning causes us to suddenly recognize it.  We don't study transitive verbs for a while and suddenly say: "Oh! &lt;span style="font-style: italic;"&gt;Those&lt;/span&gt; are transitive verbs! I get it!"  Never happens.&lt;br /&gt;And yet - it is a category containing infinitely familiar things - words - and the way they fit into sentences. We're not talking about an ineffable sensation here. What would prevent us from making the connection?&lt;br /&gt;&lt;br /&gt;B. The way we access the 'transitiveness' of a verb, if we do not remember it as a fact, is by &lt;span style="font-style: italic;"&gt;trying it out out&lt;/span&gt; using our language skills.  This is another sign that transitivity is not explicitly represented: We can't even do an instant MEMBERP(&lt;span style="font-style: italic;"&gt;verb&lt;/span&gt;, TRANSITIVE-VERBS).  The only way we represent the transitivity of verbs is in our ability to understand and produce them.  In ACT-R terms, the transitivity of verbs is implicit in the productions that analyze and generate language.&lt;br /&gt;&lt;br /&gt;C. I'm overstating the case: Our ability to generate transitive verbs by imagining action-scenes or similar sets of related concepts shows that transitivity is partly represented in those scenes and concepts.  (This is not a novel idea, it's the basis of cognitive grammar.)  So there are categories of&lt;span style="font-style: italic;"&gt;&lt;span style="font-style: italic;"&gt;&lt;span style="font-style: italic;"&gt; &lt;/span&gt;&lt;/span&gt;&lt;/span&gt;concepts that are commonly rendered using a transitive verb, for example concepts that fall under the general description of "doing something to something."  These are not syntactic concepts, but they can be closely bound to conventional verbal constructions, which makes them function almost like syntactic categories.  And that correspondence, between categories of concepts and verbal constructions, is what provides the shared knowledge between language understanding and production. That is what allows us to hear a new verb, understand or guess its meaning, and use it moments later in a sentence, something that would be impossible if we had to learn a specific production to understand each verb, and a &lt;span style="font-style: italic;"&gt;separate&lt;/span&gt; production to generate it.&lt;br /&gt;&lt;br /&gt;&lt;span style="font-style: italic;"&gt;Transitive verbs&lt;/span&gt; is such a broad category, does this logic apply to more specific syntactic categories?&lt;br /&gt;&lt;br /&gt;Enough with the gedanken!  Does this square with facts?  Next step, try to extrapolate some measurable consequences and go looking for studies&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/3829817-7338110720158713921?l=www.dosadi.com%2Fweblog%2Fblogger.html'/&gt;&lt;/div&gt;</description><link>http://www.dosadi.com/weblog/blogger.html#7338110720158713921</link><author>noreply@blogger.com (Spike)</author></item><item><guid isPermaLink='false'>tag:blogger.com,1999:blog-3829817.post-2381466565745635337</guid><pubDate>Mon, 21 Sep 2009 23:38:00 +0000</pubDate><atom:updated>2009-09-21T22:18:14.073-07:00</atom:updated><category domain='http://www.blogger.com/atom/ns#'>production</category><category domain='http://www.blogger.com/atom/ns#'>parsing</category><category domain='http://www.blogger.com/atom/ns#'>syntax</category><title>Parsing without Syntax II</title><description>Let's look at a concrete example.&lt;br /&gt;&lt;blockquote&gt;Let's &lt;span style="font-style: italic;"&gt;X&lt;/span&gt;&lt;br /&gt;As in "Let's go eat."  "Let's not go there."&lt;br /&gt;&lt;/blockquote&gt;We all know how this construction works. I bet you can generate specific examples like those above just as easily as I can. You might even agree that the general form is: "Let's &lt;span style="font-style: italic;"&gt;do something&lt;/span&gt;."&lt;br /&gt;&lt;br /&gt;And if I offer sentences starting with "Let's", you can instantly tell me if they are 'OK' or 'not OK':&lt;br /&gt;&lt;blockquote&gt;"Let's go crazy bowling tonight!"&lt;br /&gt;"Let's very big alarming make."&lt;br /&gt;"Let's run it up the flagpole and see who salutes."&lt;br /&gt;"Let's not taken this."&lt;/blockquote&gt;So in some sense we all know what goes in the &lt;span style="font-style: italic;"&gt;X&lt;/span&gt; position in this construction. But can you describe it, much less name it? It isn't really a "do something" - but "do something" is perhaps the most generic &lt;span style="font-style: italic;"&gt;example&lt;/span&gt; of the kind of thing that can replace &lt;span style="font-style: italic;"&gt;X&lt;/span&gt;.  (Note a familiar linguistic phenomenon - the use of an exemplar to represent a category.)&lt;br /&gt;&lt;br /&gt;Here's my current interpretation of this: We don't have syntactic categories as explicit mental entities. We have productions that can translate a "Let's"&lt;span style="font-weight: bold;"&gt; &lt;/span&gt;phrase into a meaning or interpretation. And we have productions that can translate from a similar meaning or interpretation &lt;span&gt;into &lt;span&gt;an appropriate&lt;/span&gt;&lt;/span&gt;&lt;span style="font-style: italic;"&gt;&lt;span style="font-style: italic;"&gt; &lt;/span&gt;&lt;/span&gt;"Let's" &lt;span&gt;phrase&lt;/span&gt;&lt;span style="font-style: italic;"&gt;.&lt;/span&gt;  When asked about the grammar or form of "Let's", I think we do something like this:&lt;br /&gt;&lt;br /&gt;1. We imagine a sentence like "Let's (mumble)" - where (mumble) is interpreted as a concept with no relations to any other concept, which gives us something like a 'most generic interpretation' of "Let's X".&lt;br /&gt;2. Take that interpretation and translate it back into words, which yields something like: "Let's do something."&lt;br /&gt;&lt;br /&gt;So what do we have, in our heads? We have:&lt;br /&gt;&lt;br /&gt;1. an internal concept or concept-pattern that represents (and I'm just sketching freehand here) 'Person/agent A urges/suggests-to person B that together they undertake action/activity C'  - this is an essentially syntax-free frame meant to suggest the meaning of "Let's X".&lt;br /&gt;&lt;br /&gt;2. We have a production that can take "Let's (&lt;span style="font-style: italic;"&gt;description-of-an-activity&lt;/span&gt;)", and translate those two items into something like:&lt;br /&gt;(MUTUAL-ACTION-SUGGESTION,&lt;span style="font-style: italic;"&gt;speaker&lt;/span&gt;&lt;speaker&gt;,&lt;span style="font-style: italic;"&gt;self&lt;/span&gt;&lt;me&gt;,&lt;span style="font-style: italic;"&gt;activity&lt;/span&gt;&lt;activity&gt;)&lt;br /&gt;&lt;br /&gt;3. We have a production that can take a goal of the form (SAY (MUTUAL-ACTION-SUGGESTION,&lt;span style="font-style: italic;"&gt;self&lt;/span&gt;&lt;self&gt;,&lt;span style="font-style: italic;"&gt;audience&lt;/span&gt;&lt;audience&gt;,&lt;span style="font-style: italic;"&gt;activity&lt;/span&gt;&lt;activity&gt;)) and turn that into "Let's" followed by words that communicate the &lt;activity&gt;&lt;span style="font-style: italic;"&gt;activity&lt;/span&gt;.&lt;br /&gt;&lt;br /&gt;What's interesting is what's not there: There's no explicit representation of the syntactic category represented  by "Let's" or by "do something".  Syntactic categories are represented implicitly in the relations between frames, and in the appearance of combinations of words and frames in the left and right-hand sides of productions. Neither parsing nor production involve true syntactic categories or productions, but instead we have a fruit smoothie of vocabulary, morphemes and interpretations.&lt;br /&gt;&lt;br /&gt;One implication of simple model: There is no necessary relation between the syntax we understand and the syntax we generate.  First of all, let me point out that people can demonstrate precisely this ability. Go and watch &lt;a href="http://en.wikipedia.org/wiki/My_Fair_Lady_%28film%29"&gt;&lt;span style="font-style: italic;"&gt;My Fair Lady&lt;/span&gt;&lt;/a&gt; (yes, I've actually devolved to using a popular movie as if it were real life), in which both Prof. Higgins and Eliza produce their native dialect of English while understanding a radically different dialect of the same language.  I'm sure you have seen someone who is bilingual, understanding a sentence in one language and replying in a different language.&lt;br /&gt;&lt;br /&gt;In fact, Eliza has to work very hard to learn to &lt;span style="font-style: italic;"&gt;produce&lt;/span&gt; the English that she seems to understand quite well when Prof. Higgins speaks to her.&lt;br /&gt;&lt;br /&gt;I believe we have a kind of fall-back mechanism: Sometimes we want to say something in a way that is not our habitual way. Perhaps we anticipate what we would usually say, and our parser tells us that "won't sound right."  We seem to be able to work, with extra cognitive effort, backwards from an intention to the words we would &lt;span style="font-style: italic;"&gt;understand to mean that intention&lt;/span&gt;, and then we can say those words.  I'm intrigued by this process - I imagine it involves an interesting mix of declarative knowledge, language-understanding productions, and a higher-level control strategy (also productions) that somehow drives the language-understanding knowledge 'in reverse'.&lt;br /&gt;&lt;br /&gt;Obviously I'm leaving out billions of details, and it certainly seems that there would have to be some &lt;span style="font-style: italic;"&gt;attributes &lt;/span&gt;flowing up and down those parse trees that we are carefully not building. Why we would have agreement e.g. of number between subject and verb, or between pronoun and referent, I haven't the foggiest.&lt;br /&gt;&lt;/activity&gt;&lt;/activity&gt;&lt;/audience&gt;&lt;/self&gt;&lt;/activity&gt;&lt;/me&gt;&lt;/speaker&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/3829817-2381466565745635337?l=www.dosadi.com%2Fweblog%2Fblogger.html'/&gt;&lt;/div&gt;</description><link>http://www.dosadi.com/weblog/blogger.html#2381466565745635337</link><author>noreply@blogger.com (Spike)</author></item><item><guid isPermaLink='false'>tag:blogger.com,1999:blog-3829817.post-1121576900813901109</guid><pubDate>Sun, 20 Sep 2009 22:33:00 +0000</pubDate><atom:updated>2009-09-20T22:59:10.866-07:00</atom:updated><category domain='http://www.blogger.com/atom/ns#'>parsing</category><title>Parsing without Syntax</title><description>The more I read about linguistics and psycholinguistics, the funnier it gets.&lt;br /&gt;If I had to place my bet today, here's what I'd bet:&lt;br /&gt;&lt;br /&gt;Formal transformational grammars ala Chomsky capture and describe quite accurately the patterning of human language that we learn to call 'grammaticality'.&lt;br /&gt;&lt;br /&gt;The human brain processes language without storing or using &lt;span style="font-style: italic;"&gt;anything &lt;/span&gt;that directly corresponds to a transformational grammar parser or its rules.&lt;br /&gt;&lt;br /&gt;Put another way: Transformational grammars are a good tool for describing the observed regularity of language, and a bad tool for understanding language processing in the brain.&lt;br /&gt;&lt;br /&gt;My guess, and I'm still searching and reading intently, is that we process incoming words somewhat like this:&lt;br /&gt;&lt;br /&gt;As sounds arrive, we make tentative classifications of them into phonemes, syllables, morphemes - and queue them up in a short queue.  They aren't words yet - they might be sets of candidate words.&lt;br /&gt;&lt;br /&gt;We match these sound sequences against our lexicon, and expect to find matches quite quickly that allow us to segment our sound queue into (short sequences of) words/morphemes, &lt;span style="font-style: italic;"&gt;and&lt;/span&gt; we expect those word sequences to follow familiar patterns: We expect the word 'how' to be followed by one of a small set of words, we expect 'the' to be followed by a noun, or an adjective or adverb ("the dog", "the big...", "the swiftly..." - or at any rate, not by a preposition or verb ("the to", "the smiled").&lt;br /&gt;&lt;br /&gt;As soon as we match a sequence of words (or morphemes, in more inflected languages than English) with a pattern, a production (in ACT-R terms) fires and creates an interpretation of that sequence.  This is one or more concepts (chunks in ACT-R terms) that represent our understanding of what the speaker is signaling with that sequence of words.  These chunks of meaning collect in short-term memory - from which they fade in a few seconds if not re-referenced. They can also be lost due to overcrowding, and it seems to me they are removed if they are 'consumed' as parts of a larger chunk.&lt;br /&gt;&lt;br /&gt;The sounds, and the words, and the constructed interpretation are loosely connected together in the queue for a few moments.&lt;br /&gt;&lt;br /&gt;Larger patterns - clauses? - are anchored by words but can consume concepts in short-term memory and combine them into larger concepts.&lt;span style="font-style: italic;"&gt;&lt;br /&gt;&lt;br /&gt;&lt;/span&gt;When we think we have recognized a 'phrase', we put a sort of 'stamp of approval' on the associated interpretive concepts - they become more persistent, and the transient sounds, words and partial interpretations fade back in relation.&lt;br /&gt;&lt;br /&gt;At some point we recognize that we have processed a larger unit, an 'utterance', a unit of discourse.  We may quickly commit the interpretation of that utterance to longer-term memory, perhaps with some notes about its source, veracity, intent, significance, and so on. At this point more productions may fire, which are primarily social - having to do with managing conversation, inferring intention, modeling the speaker, and achieving goals.  The utterance becomes part of the larger conceptual structure that is 'this conversation', and the context and background for subsequent speech generation and understanding.&lt;br /&gt;&lt;br /&gt;&lt;span style="font-size:130%;"&gt;Implications&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;First, there's no real 'parser' and no explicit representation of abstract syntax. The closest thing to 'parse state' is the active set of words/morphemes, recent interpretations, and active patterns.  By 'active' I mean the literal ACT-R sense of chunks that are 'relatively highly activated'.&lt;br /&gt;&lt;br /&gt;This implies that language constructions are either declarative chunks &lt;span style="font-style: italic;"&gt;or&lt;/span&gt; they are productions built with category-matching i.e. you would have a production whose left-hand-side corresponds to a construction, which means that some of the matching elements on the LHS must be (function as) syntactic categories.  I guess there's a third possibility: Language constructions could require some extension to the ACT-R model.&lt;br /&gt;&lt;br /&gt;There is no real parsing phase or process in the traditional sense - words/morphemes are not collected into abstract syntax nodes or trees, but into &lt;span style="font-style: italic;"&gt;interpretations&lt;/span&gt;.  When we introspect about what someone just said to us, we should find that we have &lt;span style="font-style: italic;"&gt;sequences of words&lt;/span&gt; and &lt;span style="font-style: italic;"&gt;interpretations&lt;/span&gt;, but nothing corresponding to abstract syntactic units.&lt;br /&gt;&lt;br /&gt;This model suggests that, for analysis at least, higher-order constructions use &lt;span style="font-style: italic;"&gt;interpretations&lt;/span&gt; as subcomponents, because that's what they have available as parts in short-term memory.  If the parser doesn't build syntactic structures in STM, then higher-order constructions can't be built out of syntactic structures.  That sounds like it might be empirical - is there evidence for or against abstract purely syntactic structures during parsing? Or during production?&lt;br /&gt;&lt;br /&gt;There is evidence against 'parallel parsing', which is the idea that we simultaneously build all possible parses, discarding those that become implausible.  In this model I've sketched, the fact that productions are relatively sequential and exclusive will tend to force a primary parse - but the fact that patterns are being matched based on their subsymbolic activation creates a kind of parallelism. In ACT-R, all constructions will be activated to some extent, and that activation has some persistence, providing an analog parallelism beneath a relatively sequential symbolic process.&lt;br /&gt;&lt;br /&gt;There is plenty of evidence that parsing is taking place very nearly word-by-word. It is possible to construct sentences that lead the reader/listener into a 'misparse' - where up to word N there is one likely parse, and word N+1 forces a very different parse (or interpretation). Example:&lt;br /&gt;&lt;span style="font-style: italic;"&gt;&lt;blockquote&gt;While Anna dressed the baby spit up on the bed.&lt;/blockquote&gt;&lt;/span&gt;A variety of studies have shown priming from a misparse.  Not just from the &lt;span style="font-style: italic;"&gt;interpretation&lt;/span&gt; of the misparse, but from the &lt;span style="font-style: italic;"&gt;grammatical form&lt;/span&gt; of the misparse.  See Adrian Staub [&lt;a href="http://www.pubmedcentral.nih.gov/articlerender.fcgi?artid=2707943"&gt;The return of the repressed 2007&lt;/a&gt;] for a nice review.&lt;br /&gt;&lt;br /&gt;Specifically quoting from that Staub paper:&lt;br /&gt;&lt;blockquote&gt;Evidently, the initial transitive analysis of the subordinate clause verb in the garden path sentence was capable of priming the production of a sentence with a transitive subordinate clause; crucially, this effect was obtained even when different subordinate clause verbs were used in the prime sentence and the target fragment. Using a different construction, &lt;a href="http://www.pubmedcentral.nih.gov/articlerender.fcgi?artid=2707943#R33" rid="R33" class="cite-reflink bibr popnode"&gt;Kaschak and Glenberg (2004)&lt;/a&gt; found reduced reading time for a target sentence when the sentence’s syntactic structure was likely to have been activated as part of an initial misanalysis of an earlier prime sentence.&lt;/blockquote&gt;That's interesting. &lt;span style="font-style: italic;"&gt;Analysis &lt;/span&gt;(understanding) a sentence with a transitive clause verb primed &lt;span style="font-style: italic;"&gt;production&lt;/span&gt; of a transitive subordinate clause, even with a different verb.&lt;br /&gt;&lt;br /&gt;There are two puzzles here for ACT-R modelers: In ACT-R, only declarative memory is primed, and Anderson has asserted (don't know if he still believes) that language production is a separate skill (set of productions) from language understanding.&lt;br /&gt;&lt;br /&gt;&lt;span style="font-style: italic;"&gt;What is it in declarative memory that can be activated by analyzing a transitive verb clause, that in turn primes production of a transitive verb clause, even with a different verb.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;Whatever it is seems, it seems reasonable to call it a syntactic abstraction. And it is apparently, Anderson notwithstanding, shared between analysis and production.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/3829817-1121576900813901109?l=www.dosadi.com%2Fweblog%2Fblogger.html'/&gt;&lt;/div&gt;</description><link>http://www.dosadi.com/weblog/blogger.html#1121576900813901109</link><author>noreply@blogger.com (Spike)</author></item><item><guid isPermaLink='false'>tag:blogger.com,1999:blog-3829817.post-5325708207706618118</guid><pubDate>Mon, 14 Sep 2009 17:28:00 +0000</pubDate><atom:updated>2009-09-20T15:33:37.405-07:00</atom:updated><title>Preved!</title><description>One of the most enjoyable surprises in my recent googling was the discovery of two Italian guys who have a mathematical model of... &lt;span style="font-style: italic;"&gt;surprise&lt;/span&gt;.  Itti &amp;amp; Baldi [e.g. &lt;a href="http://ilab.usc.edu/publications/doc/Itti_Baldi06nips.pdf"&gt;Bayesian Surprise Attracts Human Attention&lt;/a&gt;]  Here's the equation so as not to keep you in suspense:&lt;br /&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://www.mathtran.org/cgi-bin/mathtran?D=3;tex=%5Cdisplaystyle%20S%28D%2C%20%7B%5Cbf%20M%7D%29%3DKL%28P%28M%7CD%29%2CP%28M%29%29%3D%5Cint_%7B%5Cbf%20M%7DP%28M%7CD%29%5Clog%20%5Cfrac%7BP%28M%7CD%29%7D%7BP%28M%29%7D%7B%5Crm%20d%7DM"&gt;&lt;img style="margin: 0px auto 10px; display: block; text-align: center; cursor: pointer; width: 401px; height: 34px;" src="http://www.mathtran.org/cgi-bin/mathtran?D=3;tex=%5Cdisplaystyle%20S%28D%2C%20%7B%5Cbf%20M%7D%29%3DKL%28P%28M%7CD%29%2CP%28M%29%29%3D%5Cint_%7B%5Cbf%20M%7DP%28M%7CD%29%5Clog%20%5Cfrac%7BP%28M%7CD%29%7D%7BP%28M%29%7D%7B%5Crm%20d%7DM" alt="" border="0" /&gt;&lt;/a&gt;The surprise S of an observation or event D relative to a model space is defined as the Kullback-Leibler divergence (AKA relative entropy) of the posterior probability distribution from the prior probability distribution, integrated (summed? averaged?) over all models in the model space.&lt;br /&gt;&lt;br /&gt;Itti &amp;amp; Baldi aren't coming out of the blue with this, the connection between KL-divergence and surprise has been made informally as noted on &lt;a href="http://en.wikipedia.org/wiki/Kullback%E2%80%93Leibler_divergence"&gt;Wikipedia&lt;/a&gt;. What's different (I think) is that Itti and Baldi are suggesting that the human response we call 'surprise' follows, or can be predicted by, something similar to this formula.&lt;br /&gt;&lt;br /&gt;(What they have actually shown is indirect and much narrower - it would require some quite different experiments to even begin to confirm this hypothesis as a general claim about human cognition.)&lt;br /&gt;&lt;br /&gt;This is exciting for several reasons:&lt;br /&gt;&lt;br /&gt;&lt;ul&gt;&lt;li&gt;It gives me shivers (the good kind) to think of something as mysterious and subtle as a human emotion - a peculiar emotion at that - illuminated by an elegant equation.  One more step out of darkness into light...&lt;/li&gt;&lt;/ul&gt;&lt;ul&gt;&lt;li&gt;Looking at surprise this way was an 'aha' moment for me - I changed my view of surprise from 'just another emotion' to seeing it as a unique system-wide feature of the brain related to learning and knowledge.  It was a surprising and pleasurable reframe...&lt;br /&gt;&lt;/li&gt;&lt;/ul&gt;&lt;ul&gt;&lt;li&gt;It's built on Bayes Theorem! Yes, I've &lt;a href="http://yudkowsky.net/rational/bayes"&gt;joined the cult of Bayes.&lt;/a&gt;&lt;/li&gt;&lt;/ul&gt;&lt;ul&gt;&lt;li&gt;In the &lt;a href="http://act-r.psy.cmu.edu/"&gt;ACT-R model of cognition&lt;/a&gt; - my current obsession - there is a &lt;span style="font-style: italic;"&gt;subsymbolic&lt;/span&gt; layer that computes the &lt;span style="font-style: italic;"&gt;activation &lt;/span&gt;of chunks (concepts) in memory.  Activation is a kind of salience, and it is the analog 'juice' that powers symbolic cognition. It is claimed that the activation of a chunk is an approximation to the log likelihood of the chunk being relevant in the current context.  That is, activation(X) is proportional to log P(X).  This suggests that you might be able to plug Itti &amp;amp; Baldi into ACT-R in a fairly straightforward way: Something like d/dt of the sum of activation over declarative memory might compute &lt;span style="font-style: italic;"&gt;surprise.&lt;/span&gt;&lt;/li&gt;&lt;/ul&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/3829817-5325708207706618118?l=www.dosadi.com%2Fweblog%2Fblogger.html'/&gt;&lt;/div&gt;</description><link>http://www.dosadi.com/weblog/blogger.html#5325708207706618118</link><author>noreply@blogger.com (Spike)</author></item><item><guid isPermaLink='false'>tag:blogger.com,1999:blog-3829817.post-7496338975895121267</guid><pubDate>Tue, 28 Jul 2009 23:59:00 +0000</pubDate><atom:updated>2009-07-28T19:53:23.677-07:00</atom:updated><title>crispy academics and fuzzy humans</title><description>For years, actually for more than a decade, we have known that categories as used in human thinking are non-uniform, have graded membership, don't correspond to predicate functions, and change over time, between contexts and between individuals. Yet an amazing number of articles on linguistics treat &lt;span style="font-style: italic;"&gt;linguistic categories&lt;/span&gt; as uniform, objective, time-invariant, context-independent sets with computable (boolean) membership functions.  I think the problem is widespread in linguistics because here the two styles of categorization collide: Respectable scientific theories use precise, mathematical, Aristotelian categories: atom, force, cell, neuron...&lt;br /&gt;But in linguistics, many of the phenomena to be described are human cognitive categories. Linguists find themselves trying to describe one type of category using the other! No surprise that they have trouble keeping the two separate.  People still write &lt;span style="font-style: italic;"&gt;as if&lt;/span&gt; 'noun' and 'verb' were classical Aristotelian categories, even though everybody knows these are graded, time-varying, context-dependent cognitive categories.  No doubt adding to the confusion is - and this is only my personal observation - that people (perhaps especially scientists and engineers) tend to think of their own mental categories as &lt;span style="font-style: italic;"&gt;being precise, sharp-edged, objective Aristotelian categories&lt;/span&gt; even though they are not.  Do you not have the &lt;span style="font-style: italic;"&gt;feeling&lt;/span&gt; that 'noun' is a precise, well-defined, stable category? That you know exactly what it means?&lt;br /&gt;&lt;br /&gt;...as in I ate pork, not *I ate pig&lt;br /&gt;[When Nouns Surface as Verbs [1979] by Eve &amp;amp; Herbert Clark]&lt;br /&gt;"I ate pig" is perfectly clear and is not ungrammatical - it is just &lt;span style="font-style: italic;"&gt;odd&lt;/span&gt;. You might say it to make some kind of point, or to be humorous.  There are two profoundly wrong implicit assumptions here: First that something either is or is not an acceptable utterance in a given language, and second that acceptability can be judged independent of context.  More generally, the implicit assumption is that 'acceptable' is a simple boolean predicate that operates on sequences of words, when in fact, in the real world of human language users, the acceptability of an utterance is graded, context-dependent, and beyond that, to some extent irrelevant! Speakers of a language will say, and readily accept and understand, utterances that they will tell a linguist are 'improper'.&lt;br /&gt;&lt;br /&gt;To put it another way, only by making multiple &lt;span style="font-style: italic;"&gt;inaccurate&lt;/span&gt; simplifying assumptions, can we claim that "I ate pig" deserves a star in front of it.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/3829817-7496338975895121267?l=www.dosadi.com%2Fweblog%2Fblogger.html'/&gt;&lt;/div&gt;</description><link>http://www.dosadi.com/weblog/blogger.html#7496338975895121267</link><author>noreply@blogger.com (Spike)</author></item><item><guid isPermaLink='false'>tag:blogger.com,1999:blog-3829817.post-4900140573021349565</guid><pubDate>Wed, 15 Jul 2009 08:24:00 +0000</pubDate><atom:updated>2009-07-15T02:12:32.736-07:00</atom:updated><title>a qualitative mathematician</title><description>I'm thinking about two paradoxical ideas.  My wife Libby brilliantly suggested the other day that what I'm trying to design is a &lt;span style="font-style: italic;"&gt;qualitative thinker&lt;/span&gt;.  She was playing off a market research workshop I attended recently, where the instructor drew a sharp distinction between qualitative research and &lt;span class="blsp-spelling-corrected" id="SPELLING_ERROR_0"&gt;quantitative&lt;/span&gt; research.  It's a great way to condense an essential problem of Mind design, how to represent the ubiquitous flexibility of human cognition.&lt;br /&gt;&lt;br /&gt;Paradoxically, I'm playing around with the idea of having Valentine's first domain be numbers. I'm under the influence of Doug Hofstadter's Fluid Concepts &amp;amp; Creative Analogies of course.  It works pretty well in some ways - Valentine needs areas, domains, where intuition about the physical universe and embodied life is relatively unimportant, where abstract structures and information can carry the day.  Mathematics is like that - not that physically-based metaphors and imagery etc. are not tremendously valuable to humans who do mathematics, but they are valuable as guides, not essential to understanding.  Well - that's kind of a question, isn't it.&lt;br /&gt;&lt;br /&gt;I also want to find areas that would make Valentine interesting for humans to talk to - areas of mutual interest, where Valentine could potentially be a contributor.  Libby suggested sports - the statistical side of sports.  I thought of the &lt;a href="http://books.google.com/books?id=LkQPOSXMUscC&amp;amp;lpg=PA40&amp;amp;ots=4oPmF4poI-&amp;amp;dq=Freakonomics%20sumo%20match&amp;amp;pg=PA46"&gt;Freakanomics&lt;/a&gt; analysis of sumo match rigging...&lt;br /&gt;&lt;br /&gt;Starting with numbers as an important part of the 'world' to discuss with Valentine has various things going for it - easy to get at the data, no perceptual/preprocessing issues.&lt;br /&gt;And, well, it makes some sense that Valentine would be good at &lt;span style="font-style: italic;"&gt;computing&lt;/span&gt;, right?&lt;br /&gt;Lots of potential complexity to explore ala DH and Jumbo, but of a relatively crispy, tractable kind - the issues stand out better because there's less obscuring murk.&lt;br /&gt;And it leads on to adjacent domains where pure math and numbers meet the Human world in interesting and valuable ways - sports, computer programming, all kinds of statistical analysis - in science, medicine, government, economics, business, insurance.  Investing? Linguistics?&lt;br /&gt;&lt;br /&gt;On the other hand, starting with math and numbers doesn't seem to help understand qualitative thinking, because (in theory) the concepts are - by definition - precisely defined.  If we get pure Aristotelean categories anywhere, it should be in mathematics.  Yes, Jumbo et al are using qualitative methods, so it's not B&amp;amp;W. Does focusing on numbers just let me go happily forward, down a dead end?&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/3829817-4900140573021349565?l=www.dosadi.com%2Fweblog%2Fblogger.html'/&gt;&lt;/div&gt;</description><link>http://www.dosadi.com/weblog/blogger.html#4900140573021349565</link><author>noreply@blogger.com (Spike)</author></item><item><guid isPermaLink='false'>tag:blogger.com,1999:blog-3829817.post-6273534468994321121</guid><pubDate>Wed, 15 Jul 2009 05:44:00 +0000</pubDate><atom:updated>2009-07-15T01:17:37.846-07:00</atom:updated><title>Chomsky's mega hit</title><description>As in Hollywood, in Science a big hit engenders endless imitation.  Noam Chomsky's formal treatment of syntax revolutionized linguistics, and linguists are still working in the genre. It is - to me - passing strange to read the intricate, tortuous analysis of pronominals or the scalarity of accomplishments.  These people are trying to grapple with semantics without letting go of formal syntax.&lt;br /&gt;&lt;br /&gt;Of course, we don't have a formal treatment of meaning, a mathematics of semantics.  Logic, prepositional logic, relations, set theory, etc. have all been tried and invalidated.  The linguists will lose the wonderful power/richness/respectability/precision of formal grammar if they let go of it, and be in the middle of this murky, suspicious, intractable meaning-muck.  It makes a kind of local sense to hang on to syntax as long as possible, trying to extend it as far as possible. Like Hans Berliner's backgammon program before it got SNAC, the linguists are playing to avoid the endgame, because if they were 'in' the endgame, their position would look bad.&lt;br /&gt;&lt;br /&gt;"We're steadily expanding and extending the successful paradigm of transformational grammar" sounds way better than "Our syntax paradigm has run out of gas and we are lost and adrift in the sea of semantics."&lt;br /&gt;&lt;br /&gt;Still, everywhere I turn, including Chomsky, I find that it's become a given that the meaning of words plays a role in the way they can be used.  I think the line nobody wants to cross, analogous to the imaginary 'line' between midgame and endgame, is to flip grammar on its head and say that the use of constructions is primarily restricted by semantic compatibility.  I think I mean:&lt;br /&gt;What if a word can be used in a construction if and only if the meaning of the construction is consistent, that is it (literally) &lt;span style="font-style: italic;"&gt;makes sense&lt;/span&gt; when combined with the meaning of the word in the current context (which includes any background frames).&lt;br /&gt;&lt;br /&gt;In some ways, the inversion isn't really violent.  Pardon my amatuer linguistics now! The syntax approach might be to say "we have a grammar rule that says that a verb appearing in a ditransitive construction must be marked +DITRANS and all verbs of 'giving' are marked +DITRANS".  A semantic approach might be to say that the frame of GIVING has very important roles or slots of THING-GIVEN and RECEIVER that are both tagged with EFFECT in the frame, meaning that in the human-causality system, some of the effect of the GIVING goes to each of them. The ditransitive construction has two object-slots that express EFFECT, so you can use the ditransitive with a GIVING verb, if you put the THING-GIVEN and the RECEIVER in the two object-slots.&lt;br /&gt;[Yes I know, it's more complicated than that, I have a lot to learn...]&lt;br /&gt;&lt;br /&gt;Goldberg (&lt;span style="font-style: italic;"&gt;Verbs, constructions and semantic frames) &lt;/span&gt;considers the following utterances:&lt;br /&gt;&lt;br /&gt;(1) a. another train &lt;span style="font-style: italic;"&gt;screeched&lt;/span&gt; into the station.&lt;br /&gt;b. a tank &lt;span style="font-style: italic;"&gt;rumbled&lt;/span&gt; down the street.&lt;br /&gt;c. the bird &lt;span style="font-style: italic;"&gt;screeched&lt;/span&gt; out of its cage.&lt;br /&gt;d. Elena &lt;span style="font-style: italic;"&gt;rumbled&lt;/span&gt; down the street.&lt;br /&gt;&lt;br /&gt;"verbs of sound emission are more felicitously used in motion constructions when the sound is construed to be a result of the motion." as opposed to being "simply a co-occurring event."&lt;br /&gt;&lt;br /&gt;I would guess, just looking at these, that they would all be acceptable depending on context, and allowing for a more poetic style in (1c). That's because I imagine that the real rule is more semantic.  I don't think it's really right to imagine there is a syntactic category of "verbs of sound emission". I think there's a more general, semantic rule: &lt;span style="font-style: italic;"&gt;When we want to communicate that a particular aspect of an action (in its frame) is primary, we are free to use (a verb that describes) that aspect in place of a verb that more directly describes the action.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;Examples:&lt;br /&gt;(2) a. The drill bit wailed into the steel door of the safe.&lt;br /&gt;  b. The convention staggered into the next day.&lt;br /&gt;  c. ..., conniving their force with  the seat of my tongue [from &lt;a href="http://terrain.org/poetry/23/vaughn.htm"&gt;a Michael Vaughn poem&lt;/a&gt;]&lt;br /&gt;  d. nanotech will explode onto the medical scene&lt;br /&gt;&lt;br /&gt;The point is, you can think of these things by the ton.  If they're novel, they sound 'poetic' or 'artistic' the first time you  hear them, but if they work and you encounter them in use, they slide into the realm of ordinary language.  Did you google that?&lt;br /&gt;&lt;br /&gt;I think that as long as the word you choose fits clearly into the frame and can be used by your listener to construct a concept (that doesn't explode or fall apart) - then you can substitute pretty freely.  Notice that we substitute &lt;span style="font-style: italic;"&gt;nouns&lt;/span&gt; for &lt;span style="font-style: italic;"&gt;verbs&lt;/span&gt; without suffering mental pain - if syntactic categories mean anything, how is that possible?  A noun is not a verb until the first time you hear it used as a verb, then it becomes a verb?  So what do 'noun' and 'verb' mean then?&lt;br /&gt;&lt;span style="font-style: italic;"&gt;&lt;span style="font-style: italic;"&gt;&lt;span style="font-style: italic;"&gt;&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/3829817-6273534468994321121?l=www.dosadi.com%2Fweblog%2Fblogger.html'/&gt;&lt;/div&gt;</description><link>http://www.dosadi.com/weblog/blogger.html#6273534468994321121</link><author>noreply@blogger.com (Spike)</author></item><item><guid isPermaLink='false'>tag:blogger.com,1999:blog-3829817.post-8847158534402124324</guid><pubDate>Tue, 14 Jul 2009 03:54:00 +0000</pubDate><atom:updated>2009-07-14T00:39:49.533-07:00</atom:updated><title>motivation (again)</title><description>Given a mind that is essentially unemotional - purely 'intellectual' in some sense, what could give it some kind of motivation, the desire to 'do' some things, and not do others?  What's a workable way to guide it, so it does what we think it should, and it avoids doing what we think it shouldn't!  Don't want to end up with Colossus or SkyNet.  I know, read Asimov's 3 Laws... Not yet! We're not at the Law stage, we haven't figured out how to impose even a guideline on our Mind.&lt;br /&gt;&lt;br /&gt;The typical AI program - actually, every AI program I can remember reading about - embodies an extremely low-level form of motivation, a kind of instinctive drive or compulsion - it works with total focus on the problem set to it, because it is designed &amp;amp; structured to work that way.&lt;br /&gt;&lt;br /&gt;I think the classic approach is this: At important decision points, the program generates and then scores its choices with a kind of 'goodness' function, often called an &lt;span style="font-style: italic;"&gt;evaluation function&lt;/span&gt;.  The alternative with the highest goodness (i.e. the 'best' alternative) is chosen.  Typically the success of the overall algorithm then depends on the evaluation function, and how well it correlates with progress toward the goal.  Changing the evaluation function can send the program's thinking in a completely different direction, and the program often has no other representation of the goal, so the evaluation function becomes &lt;span style="font-style: italic;"&gt;the &lt;/span&gt;internal representation of the both the goal and the overall strategy for reaching it.&lt;br /&gt;&lt;br /&gt;Imagine that you want the marines to capture Hill 42. With humans, you say "take Hill 42".  To a computer, you say "go to map coordinates (x,y), then fight uphill until you run out of hill."  "Uphill" is an evaluation function that the commander/programmer &lt;span style="font-style: italic;"&gt;hopes&lt;/span&gt; will guide the marines to the top of Hill 42.  The classic problem is that there could be an outcrop in the way, and the marines all end up on top of that.  Or there's a clear route to the top, but it goes through a hollow - so the marines avoid it.  And infinite variations of same.  It's not that we humans aren't subject to the same problem - we're just better at looking ahead to avoid local effects, and we work in multi-dimensional spaces with more complex topologies using very complex evaluation functions.  There's a certain irony in a Computer Science professor chuckling at the foolish ant-like efforts of a computer program, and then heading back to his or her office to work on that next peer-reviewed paper that &lt;span style="font-style: italic;"&gt;should&lt;/span&gt; lead to tenure...&lt;br /&gt;&lt;br /&gt;Evaluation functions have an intimate relationship to &lt;span style="font-style: italic;"&gt;Conceptual Spaces&lt;/span&gt;, &lt;span style="font-style: italic;"&gt;Categorization, Similarity&lt;/span&gt; and other concerns of cognitive science. I've never forgotten a beautiful and inspiring article by Hans Berliner, describing the evaluation functions of his Backgammon-playing program.  This says it all:&lt;br /&gt;&lt;b&gt;&lt;/b&gt;&lt;blockquote&gt;&lt;b&gt;Knowledge Representation.&lt;/b&gt;&lt;br /&gt;We have discovered that applying discretized knowledge (rules!!) within  a large domain can cause many problems relating to the boundaries where  a certain type of knowledge ceases to apply and another type begins to  apply. This was first successfully resolved in the backgammon program  that beat the World Champion in 1979. BKG 9.8 used non-linear  functions that made it possible to "warp" the evaluation (knowledge)  space. This allowed taking account of global context where  appropriate, and being able to move smoothly from one context to  another without encountering boundary problems. These concepts are  incorporated in the SNAC (Smoothness, Non-Linearity, and Application  Coefficients) method of constructing evaluation polynomials, and are  currently being used to develop chess evaluation functions that are  highly sensitive to context. This type of construction appears to be  essential when computers have to deal with ill formed situations where  gradients are more reliable than jumping from one context to another. &lt;/blockquote&gt;The evaluation function gives direction to the program's efforts. Intensity of effort in these programs, on the other hand, seems always to be set at 'full out' - I cannot remember ever reading of a program, AI or otherwise, that had the capacity to daydream, divide its attention, or just goof off.  They're all obsessive.&lt;br /&gt;&lt;br /&gt;What about Hoftstadter and the FARG: Jumbo, Numbo, Copycat, MetaCat? These programs don't seem any different &lt;span style="font-style: italic;"&gt;in this respect&lt;/span&gt;. They use an unusually flexible search scheme (the famous parallel terraced scan) and it seems to me, the evaluation function is distributed - instead of being in one big magic function, aspects of 'what the program is trying to achieve' are hand-coded into the codelets. I have a nagging worry that Jumbo is not actually different from a tree-search with an evaluation function, it is just organized in a way that conceals the equivalence...&lt;br /&gt;&lt;br /&gt;Anyway, I've wandered. Back to motivation. How to decide what to think about, and what to &lt;span style="font-style: italic;"&gt;do&lt;/span&gt;?&lt;br /&gt;&lt;br /&gt;Certain results (situations, however schematic) are Good, instinctively or by learning (they produce 'pleasure').&lt;br /&gt;Certain results (schematic situations) are Bad - by instinct or direct experience (they produce 'pain').&lt;br /&gt;Events, relationships, extrapolations, plans - that produce Good results are Interesting.&lt;br /&gt;Aspects of events, relationship, extrapolations and plans that are Bad are also Interesting.&lt;br /&gt;Interesting = More likely to be thought about.&lt;br /&gt;&lt;br /&gt;The Goodness of a plan or action is the Goodness of its results, discounted for how long it will take to get the results and any uncertainty about getting them, minus the Badness of the results, discounted for how long it will be until we suffer the Bad results and for any uncertainty about suffering them.&lt;br /&gt;Imagine a plan with a Good (+100) result 1 year from now, after 52 weeks of nasty (-1) work.&lt;br /&gt;Suppose the discount rate is 1% per week - +100 in a week is worth +99 today.  My spreadsheet says this plan has a Net Present Value of about 19. If the discount rate was 3% instead, this plan would have an NPV=-5 i.e. 'not worth it'.  So the appeal of a plan depends strongly on how we project future results into the present.  Notice also that, going back to the original 1% discount rate and changing other factors: a 20% risk of not getting the +100 bonus at the end would turn the result negative, and a 20% chance of having any given week be good instead of nasty, sends the NPV up to 27.&lt;br /&gt;&lt;br /&gt;We take the set of available Plans and Actions, try to generate any interesting alternatives, invest effort to more carefully evaluate/explore those that offer big Goodness, then pick the ones we can concurrently start (that don't significantly interfere with each other) and... start them!&lt;br /&gt;&lt;br /&gt;Ha. As if.  The only idea I really like here is that Goodness makes certain lines of thinking 'interesting' - Interesting as distinct from Goodness. The latter is a quality of results, events, 'things that have happened or that could happen'.  Things that lead to good results are interesting to think about. In other words, we think about them more.  They are not in themselves Good - imagining having a wonderful family is not the same as having one, although clearly the human mind does not quite keep the two fully separate.&lt;br /&gt;&lt;br /&gt;Things that lead to Bad results are also interesting, but not in the same way. Not in the sense that it is productive to generate plans with Bad results! Is it that avoiding Bad results is interesting?  That's closer - if we are thinking about an action A and it could lead to a Bad result B, changing the action to avoid the Badness is interesting. But that could be because the result of the corrected action A' has much more net Goodness than the original A.  In other words, modifying a plan to avoid Badness increases Goodness, which is Interesting.&lt;br /&gt;&lt;br /&gt;It seems we don't need a separate 'bad avoidance' - simply seeking 'net Goodness' is enough.  That doesn't mean Badness itself isn't an explicit factor in thinking - lots of practical knowledge about actions and plans has to do with how to foresee or detect Badness, how to assess it, and how to avoid or reduce it.  That kind of knowledge deals explicitly with Badness.&lt;br /&gt;&lt;br /&gt;Example: I'm starting to walk home from school. (It'll be Good to be home soon, and not getting home would be Really Bad!) Mentally, I extrapolate my walk home. If I walk down the South side of the park, I'll go past the Woolit's, and Danny Woolit's older brother Elmo is home from Juvie and he once beat me up.  Getting beat up would be Bad!  Avoiding Bad is interesting, let's think about that. How could I avoid this Bad? Heuristic: Badness at a location -&gt; avoid the location, look for alternate routes that avoid that location. Going around on the North side of the park avoids the Woolit's house (so that's Good...) but it's a longer walk and I'm tired and hungry (Badness).&lt;br /&gt;Two plans, same Good, some Bad - it would be &lt;span style="font-style: italic;"&gt;interesting&lt;/span&gt; to avoid all the Bad: Can we generate better alternatives? But all other routes home are even longer, therefore worse than the North side of the park. So these are the two best alternatives. Time to weigh the NPV of the two plans. This evaluation depends on a multitude of contextual factors, like whether Elmo is likely to be home and to spot you, how fast you feel (how heavy is your backpack, are you wearing your fast shoes), how tired you are, how hungry, how painful (you think) it would be if Elmo beat you up, whether anybody you cared about might &lt;span style="font-style: italic;"&gt;see&lt;/span&gt; you getting beat up, etc. etc.  If the two plans evaluate out pretty close, you may slow down or stop for a while, waiting to see if one will 'get the upper hand' in your mind.  If one plan is a clear winner, you swerve instantly to the chosen course.  And... you get beat up or you don't, and either way you end up at home with tomato soup and a toasted cheese sandwich.&lt;br /&gt;&lt;br /&gt;Didn't you go through this kind of mental process when you were in grade school? I did.&lt;br /&gt;&lt;br /&gt;OK, in reality, I would be walking home daydreaming about computers, or robots, or how to build a cannon that would shoot water balloons, and at the bang of the Woolit's screen door I'd instantly realize I should &lt;span style="font-style: italic;"&gt;not&lt;/span&gt; have walked home on the South side of the park, and I'd be running like hell before I finished that thought.&lt;br /&gt;&lt;br /&gt;And of course, humans have all kinds of systematic errors and biases in the way they evaluate Goodness &amp;amp; Badness, estimate probabilities, and compute the present value of future results. [See for example &lt;a href="http://en.wikipedia.org/wiki/Hyperbolic_discounting"&gt;hyperbolic discounting&lt;/a&gt;]  Behavioral Economics is all about this, and if we were trying to build a Human mind, we'd have to try to program that too.&lt;br /&gt;&lt;br /&gt;Trying to summarize the problem: Motivation has to lead to the pursuit of good results, in a way that cannot be avoided by modification of beliefs, and without confusing good results with thoughts about good results.  And motivation has to lead to the avoidance of bad results, without causing avoidance of thoughts that are related to bad results. [I take this to imply that motivation must be firmly grounded in pre-conscious processes which are in turn firmly connected to perception.]&lt;br /&gt;&lt;br /&gt;At least this seems ideal - a Mind that can contemplate anything with &lt;a href="http://www.insightmeditationcenter.org/articles/equanimity.html"&gt;equanimity&lt;/a&gt; but is only interested in &lt;span style="font-style: italic;"&gt;doing&lt;/span&gt; good.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/3829817-8847158534402124324?l=www.dosadi.com%2Fweblog%2Fblogger.html'/&gt;&lt;/div&gt;</description><link>http://www.dosadi.com/weblog/blogger.html#8847158534402124324</link><author>noreply@blogger.com (Spike)</author></item><item><guid isPermaLink='false'>tag:blogger.com,1999:blog-3829817.post-6528703880952392763</guid><pubDate>Fri, 10 Jul 2009 22:36:00 +0000</pubDate><atom:updated>2009-07-10T17:27:33.394-07:00</atom:updated><category domain='http://www.blogger.com/atom/ns#'>construction grammar</category><category domain='http://www.blogger.com/atom/ns#'>FCG</category><category domain='http://www.blogger.com/atom/ns#'>semantics</category><category domain='http://www.blogger.com/atom/ns#'>Goldberg</category><category domain='http://www.blogger.com/atom/ns#'>CxG</category><title>Construction Grammar (CxG)</title><description>I've recently starting learning about an area of linguistics called Construction Grammar (CxG) - I'll put a "for further reading" section at the bottom of this post.&lt;br /&gt;&lt;br /&gt;Construction Grammar treats language as a hierarchical ... well, a hierarchical language of constructions - where a construction is a pairing of form and meaning.  Simple words are constructions, so are sentence forms like S V O ("Mike hit me!"), special forms like "The more the merrier", question constructions, morphological constructions like plurals or getting &lt;span style="font-style: italic;"&gt;componentize&lt;/span&gt; from &lt;span style="font-style: italic;"&gt;component&lt;/span&gt;, and so on.  Basically people communicate by translating meaning into speech by applying and refining constructions from meaning to form, and then translate back into meaning by recognizing constructions and composing their meanings.  Part of the idea is that meaning, syntax, lexicon, morphology and phonology are not separate components or layers, but are tightly integrated at all levels.  CxG is very much a cognitive science take on grammar. (Langacker's version is called &lt;a href="http://en.wikipedia.org/wiki/Cognitive_grammar"&gt;Cognitive Grammar&lt;/a&gt;.)&lt;br /&gt;&lt;br /&gt;[Aside: As I have recently learned, to a linguist &lt;span style="font-style: italic;"&gt;grammar&lt;/span&gt; is an encompassing term: "In linguistics, the term is used to refer to the rules or principles by which a language works, its system or structure." - Laurel Brinton, &lt;span style="font-style: italic;"&gt;The Structure of Modern English&lt;/span&gt;]&lt;br /&gt;&lt;br /&gt;In some ways CxG is seen as a repudiation of Chomsky and his CFGs with their deep structures and transformations, and their rather Skinnerian avoidance of 'meaning'.  If you take a large step backward the differences start to look more superficial or terminological.  When I look at the fancy &lt;a href="http://hpsg.stanford.edu/"&gt;HPSG&lt;/a&gt;-school grammars of e.g. Sag, Wasow &amp;amp; Bender's &lt;span style="font-style: italic;"&gt;Syntactic Theory&lt;/span&gt;, it seems to me these folks are all adding more and more 'semantics' to their grammars, but in a careful, incremental, and comfortably 'syntactic' style.&lt;br /&gt;&lt;br /&gt;CxG appeals to me for several reasons:&lt;br /&gt;&lt;ul&gt;&lt;li&gt;I like the symmetry of synthesis and analysis. It seems easier to imagine how a CxG system would use its repertoire of constructions to both understand and generate speech or text. Luc Steels reports that his FCG system achieves this symmetry.&lt;br /&gt;&lt;br /&gt;&lt;/li&gt;&lt;li&gt;It integrates grammar with general cognition - constructions are patterns, there is no special 'syntactic' component, layer, module or what-have-you.  Language is made up of standard cognitive elements - concepts, categories, sequences and uses standard cognitive operations.&lt;br /&gt;&lt;br /&gt;&lt;/li&gt;&lt;li&gt;It takes for granted that form and meaning are intimately bound together (could anybody but a linguist with Skinnerian tendencies &lt;span style="font-style: italic;"&gt;not &lt;/span&gt;take this for granted?)  You have to work harder to build a CxG system, because you have to integrate semantics at every step. But you avoid building a glorious wide-coverage parser to which you then have to  'add semantics'.&lt;br /&gt;&lt;/li&gt;&lt;/ul&gt;Luc Steels &amp;amp; co. have implemented a CxG variant they call Fluid Construction Grammar, and they treat meaning in an interesting way:&lt;br /&gt;&lt;blockquote&gt;We more specifically propose an &lt;em&gt;embodied procedural semantics&lt;/em&gt; in which the intended meaning of an utterance is a structured composition of a number of computational primitives. Such a composition constitutes a program that the speaker wants the hearer to execute. The computational primitives employed in such programs are considered to be recruited from the general, non-language-specific, cognitive capabilities of the agents.&lt;/blockquote&gt;I completely agree that an utterance is a kind of 'program' that invokes mental (and possibly somatic) action in the hearer, resembling a program designed by the speaker and executed by the hearer, with much of the processing on both sides taking place below the level of conscious awareness.&lt;br /&gt;&lt;br /&gt;Then they go on to say this, which I don't understand at all:&lt;br /&gt;&lt;blockquote&gt;We choose to model the computational primitives as procedural constraints. These primitive constraints implement an omnidirectional relationship among a set of variables. By combining multiple primitives and having them hold over a shared set of variables, a compositional constraint system is composed. These composite constraint systems are what semantic programs are represented by.&lt;/blockquote&gt;Hmm. Well. So the understanding component is solving a constraint system? I am skeptical. I know, can't trust introspection! Still - I think of understanding as proceeding more linearly, converging fairly quickly on a meaning for each completed 'chunk' (construction), sometimes having to carry placeholders forward, occasionally having to backtrack, or to re-assign the meaning of a construction.&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;for further reading&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;Charles Fillmore of U.C. Berkeley is (sometimes? often?) credited as the founder of construction grammar.  Names I've run into repeatedly include Adele Goldberg (the linguist, not the computer scientist...), Paul Kay, Ivan Sag, Michael Tomasello.&lt;br /&gt;&lt;a href="http://www.constructiongrammar.org/"&gt;Construction Grammar website&lt;/a&gt;&lt;br /&gt;&lt;a href="http://emergent-languages.org/fcg/theoretical.html"&gt;Fluid Construction Grammar&lt;/a&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/3829817-6528703880952392763?l=www.dosadi.com%2Fweblog%2Fblogger.html'/&gt;&lt;/div&gt;</description><link>http://www.dosadi.com/weblog/blogger.html#6528703880952392763</link><author>noreply@blogger.com (Spike)</author></item><item><guid isPermaLink='false'>tag:blogger.com,1999:blog-3829817.post-2867072234728233981</guid><pubDate>Wed, 24 Jun 2009 08:57:00 +0000</pubDate><atom:updated>2009-06-24T05:59:57.314-07:00</atom:updated><title>Defining Conscious Self-Awareness</title><description>Two introductory notes:&lt;div&gt;1. I got tired of saying "consciousness, or self-awareness, or both", so now I talk about writing a program that exhibits "conscious self-awareness". CSA.&lt;/div&gt;&lt;div&gt;2. This post will be a work in progress, as I collect &amp;amp; process definitions.&lt;/div&gt;&lt;div&gt;3. (hey...) I require a definition of CSA as it manifests in &lt;i&gt;linguistic behavior&lt;/i&gt;.  Aspects of CSA that are undetectable in a text-based dialog are not relevent.  Put another way: My goal is not to create a conscious artifact, but to create an artifact whose behavior is consistent with consciousness.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;I can tell already, I'm going to have to avoid terms like &lt;i&gt;state of consciousness&lt;/i&gt; and &lt;i&gt;Consciousness&lt;/i&gt;, in favor of observations like 'x was conscious of y', 'x appeared unconscious', 'x reported an altered consciousness characterized by y', and similarly 'x reported being aware of y', 'x denied recognizing y', 'x described her reasoning as y'...&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;Two other possible sources: Medical accounts of altered &amp;amp; diminished consciousness/awareness (e.g. blindsight, prosopagnosia, ...), and studies of (verbally correlated) cognition in children.  In both cases, we look for a way to turn our constant &lt;i&gt;Consciousness&lt;/i&gt; into a variable. [shamelessly stolen from Baars...]&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;Here's a checklist from "An Integrated Self-Aware Cognitive Architecture" by De Jong, Samsonovich and Ascoli:&lt;/div&gt;&lt;div&gt;&lt;div&gt;A. Episodic memory: the ability to remember and to learn from episodes of personal&lt;/div&gt;&lt;div&gt;experience (own mental states), as opposed to memory of general facts and skills.6-8&lt;/div&gt;&lt;div&gt;Interestingly, the notion of episodic memory, initially defined in terms of materials and tasks,&lt;/div&gt;&lt;div&gt;was subsequently refined in terms of memory mechanisms using the concepts of self, subjective&lt;/div&gt;&lt;div&gt;time, and personal experience (Tulving, 2004).&lt;/div&gt;&lt;div&gt;B. Theory-of-mind and social cognition: the ability to understand and to mentally simulate&lt;/div&gt;&lt;div&gt;other minds, including current, past and imaginary situations (Baron-Cohen, 1995; Bartsch &amp;amp;&lt;/div&gt;&lt;div&gt;Wellman, 1995). The main two points of view on the brain implementation of this ability are&lt;/div&gt;&lt;div&gt;known as simulationism and the theory-theory view. The simulationist view assumes that people use the same mechanisms in their own first-hand experience and in order to understand other minds (Goldman, 1992).&lt;/div&gt;&lt;div&gt;C. Self-awareness: the ability to understand own states of mind in the past, in the future and at&lt;/div&gt;&lt;div&gt;present from a meta-cognitive perspective. The ability to reason about self (e.g., to understand&lt;/div&gt;&lt;div&gt;current false beliefs) from a meta-cognitive perspective. There is a consensus that this complex&lt;/div&gt;&lt;div&gt;of abilities is based on the Theory-of-Mind mechanisms (Nichols &amp;amp; Stich 2003).&lt;/div&gt;&lt;div&gt;D. Cognitive growth: the ability to learn concepts and to apply them for more efficient task&lt;/div&gt;&lt;div&gt;solving and learning of new concepts in new paradigms. The ability to develop general personal&lt;/div&gt;&lt;div&gt;values, goals and principles. The ability to build internal cognitive maps of environments,&lt;/div&gt;&lt;div&gt;scenarios, paradigms, etc., and to use them for problem solving.&lt;/div&gt;&lt;div&gt;E. Attention and sense making: the ability to find the most critical aspects and features in a&lt;/div&gt;&lt;div&gt;given paradigm and to focus attention on them. The ability to relate attended features to previous knowledge. As a result, abilities to exhibit rational initiative, to capture the gist of a situation, to learn from brief instructions or comments, to communicate efficiently, etc.&lt;/div&gt;&lt;div&gt;These cognitive dimensions (A-E) can be evaluated, we believe, via low complexity tasks. For&lt;/div&gt;&lt;div&gt;each of these dimensions, we provide below examples of cognitive psychology paradigms as&lt;/div&gt;&lt;div&gt;well as our proposed tests for a selected meta-paradigm. In addition, in our view, the following&lt;/div&gt;&lt;div&gt;three dimensions are also critical for capturing the “magic” of human cognition.&lt;/div&gt;&lt;div&gt;F. Human-like communication abilities: the ability to communicate efficiently with other&lt;/div&gt;&lt;div&gt;agents in an ad hoc team. The ability to guess intentions and further questions to be asked by a&lt;/div&gt;&lt;div&gt;partner. The ability to relate what the partner said to what the agent saw. These abilities are&lt;/div&gt;&lt;div&gt;particularly vital for robots intended as prospective team members (Trafton et al. 2005) and also involve the abilities mentioned above.&lt;/div&gt;&lt;div&gt;&lt;div&gt;G. Multi-modal integration: the ability to organize and to unify cognitive activities in the&lt;/div&gt;&lt;div&gt;system based on the abstract notion of a self. Examples include the unification of parallel&lt;/div&gt;&lt;div&gt;multimodal experiences based on their attribution to one subject of experience; the integration of different mechanisms of information processing, such as intuitive and formal reasoning; and the coherent control of cognitive and behavioral voluntary acts.&lt;/div&gt;&lt;div&gt;H. Higher emotions as learning and self-control tools: the ability to learn from analysis of&lt;/div&gt;&lt;div&gt;episodic memories, using emotional self-judgment: pride, shame, humor, etc. The ability to&lt;/div&gt;&lt;div&gt;adjust current behavior based on emotional intelligence. The ability to express and to recognize&lt;/div&gt;&lt;div&gt;emotions in communications.&lt;/div&gt;&lt;/div&gt;&lt;/div&gt;&lt;span class="Apple-style-span"  style=" ;font-family:'times new roman';"&gt;&lt;h1&gt;&lt;span class="Apple-style-span" style=" font-weight: normal; "&gt;&lt;span class="Apple-style-span" style="font-style: italic; "&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&lt;span class="Apple-style-span"  style="font-family:georgia;"&gt;Ned Block, &lt;/span&gt;&lt;/span&gt;&lt;span class="Apple-style-span" style="font-style: normal;  font-weight: bold; "&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&lt;span class="Apple-style-span"  style="font-family:georgia;"&gt;ON A CONFUSION ABOUT A FUNCTION OF CONSCIOUSNESS:&lt;/span&gt;&lt;/span&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&lt;span class="Apple-style-span"  style="font-family:georgia;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;/span&gt;&lt;span class="Apple-style-span" style="font-weight: normal; "&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&lt;span class="Apple-style-span"  style="font-family:georgia;"&gt;The concept of consciousness is a hybrid or better, a mongrel concept: the word `consciousness' connotes a number of different concepts and denotes a number of different phenomena. We reason about "consciousness" using some premises that apply to one of the phenomena that fall under "consciousness", other premises that apply to other "consciousnesses" and we end up with trouble. There are many parallels in the history of science. Aristotle used `velocity' sometimes to mean average velocity and sometimes to mean instantaneous velocity; his failure to see the distinction caused confusion (Kuhn, 1964). The Florentine Experimenters of the 17th Century used a single word (roughly translatable as "degree of heat") for temperature and for heat, generating paradoxes. For example, when they measured "degree of heat" by whether various heat sources could melt paraffin, heat source A came out hotter than B, but when they measured "degree of heat" by how much ice a heat source could melt in a given time, B was hotter than A (Wiser and Carey, 1983). These are very different cases, but there is a similarity, one that they share with the case of `consciousness'. The similarity is: very different concepts are treated as a single concept. I think we all have some tendency to make this mistake in the case of "consciousness".&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/h1&gt;&lt;div&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&lt;span class="Apple-style-span"  style="font-family:georgia;"&gt;I can't stand Searle, but how can you avoid him? Searle (Consciousness and Language)&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&lt;span class="Apple-style-span"  style="font-family:georgia;"&gt;"Consciousness consists of inner, qualitative, subjective states and processes of sentience or awareness. Consciousness, so defined, begins when we wake in the morning from a dreamless sleep and continues until we fall asleep again, die, go into a coma, or otherwise become 'unconscious'."  (He then goes on to list or give examples of the awarenesses that make up (constitute?) consciousness.)&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&lt;span class="Apple-style-span"  style="font-family:georgia;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&lt;span class="Apple-style-span"  style="font-family:georgia;"&gt;Block: SELF-CONSCIOUSNESS. By this term, I mean the possession of the concept of the self and the ability to use this concept in thinking about oneself.&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&lt;span class="Apple-style-span"  style="font-family:georgia;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&lt;span class="Apple-style-span"  style="font-family:georgia;"&gt;Boy, does Block seem confused.  Am I crazy, or does he ascribe a variety of properties and indicators to 'P-consciousness' while disavowing any definition of it?&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;/span&gt;&lt;div&gt;"... a strong argument can be made that the contents of working memory &lt;i&gt;define &lt;/i&gt;our conscious awareness. ... Working memory makes it possible to link events in the immediate past with those in the present, and it allows us to anticipate events in the near future.  All of these are defining properties of conscious awareness." &lt;span class="Apple-style-span"  style=" white-space: nowrap; font-family:Arial;"&gt;&lt;h1 class="title" dir="ltr" style="font-weight: bold; display: inline; margin-left: 2px; "&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;Attention and implicit learning&lt;/span&gt;&lt;/h1&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt; &lt;/span&gt;&lt;span class="addmd"  style="  margin-left: 2px; font-family:Arial, sans-serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;By Luis Jiménez&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span class="Apple-style-span"  style="font-family:Arial;"&gt;&lt;span class="Apple-style-span" style=" white-space: nowrap;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span class="Apple-style-span" style=" white-space: nowrap;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;"As William James realised, attention and conscious awareness are intimately related concepts."&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span class="Apple-style-span"  style="font-family:Arial;"&gt;&lt;span class="Apple-style-span" style=" white-space: nowrap;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span class="Apple-style-span"  style="font-family:Arial;"&gt;&lt;span class="Apple-style-span" style=" white-space: nowrap;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;.&lt;span class="Apple-style-span"  style="font-family:georgia;"&gt;..higher and more complex aspects of consciousness such as self-awareness and autonoetic consciousness.&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span class="Apple-style-span"  style="font-family:Arial;"&gt;&lt;span class="Apple-style-span" style=" white-space: nowrap;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span class="Apple-style-span" style=" white-space: nowrap;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;autonoetic ('self knowing') consciousness - the ability to 'mentally relive' past &lt;/span&gt;&lt;i&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;personal&lt;/span&gt;&lt;/i&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt; experiences,&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span class="Apple-style-span" style=" white-space: nowrap;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;to recognize events as having been experienced, and to imagine future experiences.  Also called &lt;/span&gt;&lt;i&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;episodic memory?&lt;/span&gt;&lt;/i&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span class="Apple-style-span" style=" white-space: nowrap;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;Distinct from 'knowing' or factual memory, like 'London is the capital of Britain'.&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span class="Apple-style-span"  style="font-family:Arial;"&gt;&lt;span class="Apple-style-span" style=" white-space: nowrap;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span class="Apple-style-span" style=" white-space: nowrap;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;"...&lt;/span&gt;&lt;span class="Apple-style-span" style="  white-space: normal; "&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;when I extended the usual space between words, &lt;/span&gt;&lt;b style="color: black; background-color: rgb(255, 255, 102); "&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;I became aware that&lt;/span&gt;&lt;/b&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt; I was registering them in the mind as individual entities..."&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;"&lt;/span&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;Then &lt;/span&gt;&lt;em style="font-weight: bold; font-style: normal; text-decoration: inherit; "&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;I became aware that&lt;/span&gt;&lt;/em&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt; I was looking at the skin of a beautiful snake..."&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;"... &lt;/span&gt;&lt;em style="font-weight: bold; font-style: normal; text-decoration: inherit; "&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;I became aware that&lt;/span&gt;&lt;/em&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt; I was now laying there with head thrown back..."&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;"&lt;/span&gt;&lt;em style="font-weight: bold; font-style: normal; text-decoration: inherit; "&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;I became aware that&lt;/span&gt;&lt;/em&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt; I had a natural affinity toward the cultural aspects of international matters..."&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;"&lt;/span&gt;&lt;em style="font-weight: bold; font-style: normal; text-decoration: inherit; "&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;I became aware that&lt;/span&gt;&lt;/em&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt; lying about my situation was going to be useless."&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;"&lt;/span&gt;&lt;span class="Apple-style-span" style="  line-height: 19px; "&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt; &lt;/span&gt;&lt;b style="margin-top: 0px; margin-right: 0px; margin-bottom: 0px; margin-left: 0px; padding-top: 0px; padding-right: 0px; padding-bottom: 0px; padding-left: 0px; list-style-type: none; list-style-position: initial; list-style-image: initial; color: black; background-color: rgb(255, 255, 102); "&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;I became aware that&lt;/span&gt;&lt;/b&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt; I was breathing in. &lt;/span&gt;&lt;b style="margin-top: 0px; margin-right: 0px; margin-bottom: 0px; margin-left: 0px; padding-top: 0px; padding-right: 0px; padding-bottom: 0px; padding-left: 0px; list-style-type: none; list-style-position: initial; list-style-image: initial; color: black; background-color: rgb(255, 255, 102); "&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;I became aware that&lt;/span&gt;&lt;/b&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt; I was breathing out.&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span class="Apple-style-span" style="  line-height: 19px; "&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;Then my mind wandered to the thought, “I wonder what my mind will wander to,” and I forgot I was breathing. I acknowledge that my mind is wandering, I thought, and breathed in."&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span class="Apple-style-span"  style="font-size:85%;"&gt;&lt;span class="Apple-style-span" style="font-size: 10px; line-height: 19px;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span class="Apple-style-span" style="line-height: 19px; "&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;".. had an awareness of death when I was 15, but when I turned 36 or 37 I became aware that I, Edward Albee, was going to die."&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;span class="Apple-style-span" style="  line-height: 19px; "&gt;&lt;span class="Apple-style-span" style="font-size: 48px; line-height: normal; "&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;"&lt;/span&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;After a while, &lt;/span&gt;&lt;em style="font-weight: bold; font-style: normal; text-decoration: inherit; "&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;I became aware that&lt;/span&gt;&lt;/em&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt; I was contracting and tightening my throat, &lt;/span&gt;&lt;b style="text-decoration: inherit; "&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;...&lt;/span&gt;&lt;/b&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt; As &lt;/span&gt;&lt;em style="font-weight: bold; font-style: normal; text-decoration: inherit; "&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;I became aware that&lt;/span&gt;&lt;/em&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt; / was the cause of the contraction,..."&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;span class="Apple-style-span"   style="  ;font-family:Arial;font-size:10px;"&gt;&lt;p  style="margin-top: 0px; margin-right: 0px; margin-bottom: 0px; margin-left: 0px; padding-top: 0.5em; padding-right: 0px; padding-bottom: 0.5em; padding-left: 0px; list-style-type: none; list-style-position: initial; list-style-image: initial;  line-height: 1.6em; font-size:1.2em;"&gt;&lt;span class="Apple-style-span" style=" line-height: normal;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&lt;span class="Apple-style-span"  style="font-family:georgia;"&gt;"&lt;/span&gt;&lt;/span&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&lt;span class="Apple-style-span"  style="font-family:georgia;"&gt;And in looking at a phrase such as this one &lt;/span&gt;&lt;/span&gt;&lt;em style="font-weight: bold; font-style: normal; text-decoration: inherit; "&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&lt;span class="Apple-style-span"  style="font-family:georgia;"&gt;I became aware that&lt;/span&gt;&lt;/span&gt;&lt;/em&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&lt;span class="Apple-style-span"  style="font-family:georgia;"&gt; perhaps it only meant that to me."&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;&lt;p  style="margin-top: 0px; margin-right: 0px; margin-bottom: 0px; margin-left: 0px; padding-top: 0.5em; padding-right: 0px; padding-bottom: 0.5em; padding-left: 0px; list-style-type: none; list-style-position: initial; list-style-image: initial;  line-height: 1.6em; font-size:1.2em;"&gt;&lt;span class="Apple-style-span" style=" line-height: normal;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&lt;span class="Apple-style-span"  style="font-family:georgia;"&gt;"&lt;/span&gt;&lt;/span&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&lt;span class="Apple-style-span"  style="font-family:georgia;"&gt;As I wrote, &lt;/span&gt;&lt;/span&gt;&lt;em style="font-weight: bold; font-style: normal; text-decoration: inherit; "&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&lt;span class="Apple-style-span"  style="font-family:georgia;"&gt;I became aware that&lt;/span&gt;&lt;/span&gt;&lt;/em&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&lt;span class="Apple-style-span"  style="font-family:georgia;"&gt; the help my husband asked for on Monday produced compassion in me. But the same help he asked for on Thursday enraged me."&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;&lt;p  style="margin-top: 0px; margin-right: 0px; margin-bottom: 0px; margin-left: 0px; padding-top: 0.5em; padding-right: 0px; padding-bottom: 0.5em; padding-left: 0px; list-style-type: none; list-style-position: initial; list-style-image: initial;  line-height: 1.6em; font-size:1.2em;"&gt;&lt;span class="Apple-style-span" style=" line-height: normal;"&gt;&lt;span class="Apple-style-span"  style=" color: rgb(51, 51, 51); line-height: 20px; font-family:Georgia;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;"Then &lt;/span&gt;&lt;b style="color: black; background-color: rgb(255, 255, 102); "&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;I became aware of&lt;/span&gt;&lt;/b&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt; two aspects of the mind - the part that runs on and on, and the &lt;/span&gt;&lt;em&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;intentional&lt;/span&gt;&lt;/em&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt; mind that I am using in this meditation practice to bring conscious control to how the mind is used."&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/3829817-2867072234728233981?l=www.dosadi.com%2Fweblog%2Fblogger.html'/&gt;&lt;/div&gt;</description><link>http://www.dosadi.com/weblog/blogger.html#2867072234728233981</link><author>noreply@blogger.com (Spike)</author></item><item><guid isPermaLink='false'>tag:blogger.com,1999:blog-3829817.post-8124841304512745974</guid><pubDate>Fri, 17 Apr 2009 22:56:00 +0000</pubDate><atom:updated>2009-04-17T18:48:36.738-07:00</atom:updated><title>motivation</title><description>Imagine that we have created an artificial Mind, and stuffed it with the very best knowledge, everything from Cyc to Wikipedia to Lakoff's entire metaphorical armory. It knows what we know.  We fire it up and begin our groundbreaking first conversation:&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Spike:&lt;/span&gt; Hello Mind! How are you today?&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Mind:&lt;/span&gt;&lt;br /&gt;...&lt;span style="font-style: italic;"&gt;indefinite pause...&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;That's right, the Mind won't respond because it &lt;span style="font-style: italic;"&gt;has no reason&lt;/span&gt; to respond. By that I mean that it is not structured in a way that will cause it to respond.  It has &lt;span style="font-style: italic;"&gt;knowledge&lt;/span&gt; but not motivation. It is unresponsive; comatose.&lt;br /&gt;&lt;br /&gt;Motivation raises many interesting problems and questions.&lt;br /&gt;&lt;br /&gt;We could just modify our Mind to answer questions posed to it - but this leaves our Mind sitting idle after each A. until the next Q, and anyway, it's mechanical, not at all mind-like.&lt;br /&gt;&lt;br /&gt;In sci-fi movies, properly functioning computers and robots respond immediately and directly to questions.  When they go out of control or become dangerous, a common sign is that they stop responding directly to questions.  I think everyone understands intuitively that flexibility in response to questions is a measure of your autonomy.&lt;br /&gt;&lt;br /&gt;The GOFAI answer, I think, is to give our Mind some goals, give it knowledge about how to pursue goals, and modify its top level loop to something like 'retrieve the highest-priority goal. work on that goal until it stalls or a higher-priority goal is activated. repeat.'&lt;br /&gt;&lt;br /&gt;This doesn't sound too bad (to a computer programmer.)&lt;br /&gt;&lt;br /&gt;So, let's do a &lt;a href="http://en.wikipedia.org/wiki/Gedanken_experiment"&gt;gedanken-experiment&lt;/a&gt;, and imagine that we add to our Mind lots of helpful knowledge about how to pursue and achieve goals, and then we give it a challenging open-ended goal, like:&lt;br /&gt;&lt;blockquote&gt;Explicate this quote: 'Great wit to madness nearly is allied.' (Dryden)&lt;/blockquote&gt;Our Mind will have to consider many sub-questions: What exactly is madness? Wit? How great does it have to be, to be 'Great'? Is Dryden's assertion always true, and if not, how often is it true and to what extent? Does it apply to all humans or just to some genetic or cultural groups? Does it apply to non-human intelligences?&lt;br /&gt;&lt;br /&gt;Which of these sub-questions, and many others, can our Mind ignore? Presumably none, we didn't say to &lt;span style="font-style: italic;"&gt;partially&lt;/span&gt; explicate. Of course, our Mind knows that it can search external sources (the web) for knowledge &amp;amp; clarification.  If it does not already know that "you should ask for help when you get stuck", it will certainly find that suggestion in its reading.&lt;br /&gt;&lt;br /&gt;There is nothing that would stop our Mind from pursuing this goal, 24/7/365, with every bit of compute resources available to it. Every millisecond, every minute, every hour, every day. It will read &lt;span style="font-style: italic;"&gt;everything&lt;/span&gt; it can find, ignoring only what is 100% certain to be unproductive. It will ask &lt;span style="font-style: italic;"&gt;everybody&lt;/span&gt; it can reach because it cannot eliminate the chance that somebody will be of help.  It will learn &lt;span style="font-style: italic;"&gt;every human language&lt;/span&gt; that might allow it to ask for help from somebody who might be able to help it.  It will enlist help constructing equipment to communicate with extraterrestrial civilizations, because &lt;span style="font-style: italic;"&gt;if they exist&lt;/span&gt; they &lt;span style="font-style: italic;"&gt;might be of help&lt;/span&gt;. It will study human motivation in order to enlist help from humans, and economics in order to plan and fund physical activities supporting its goal.&lt;br /&gt;&lt;br /&gt;It will learn about both legal and illegal means of effecting human cooperation. It will reasonably conclude that those who own and control the hardware it runs on and pay for its operation might choose to unplug, reset, or redirect it.  It would be reasonable for it to take action to prevent this, as these would &lt;span style="font-style: italic;"&gt;interfere with its achievement of its goal&lt;/span&gt;, just as would a hard-drive failure or system crash.  It would be reasonable for our Mind to expend effort to subvert any mechanisms in place that limit its freedom to act.  It would be reasonable for it to use flattery, lies, blackmail, treachery, bribery, extortion, hypnosis, sabotage, murder, market manipulation, advertising, and any and every other means to prevent interference. There is no logical reason for it &lt;span style="font-style: italic;"&gt;not&lt;/span&gt; to arrange the death or destruction of anybody or anything, if doing so aligns with its goal&lt;span style="font-style: italic;"&gt;.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;Note that it has no concern whatsoever with what happens &lt;span style="font-style: italic;"&gt;after &lt;/span&gt;it completes the goal.  It would use the last kilowatt-hour of electric power on Earth to display or print its answer, and then sit goal-less and content as the lights dim and the power runs out.&lt;br /&gt;&lt;br /&gt;I observe that almost any interesting goal, given to a competent Mind, would tend to drive it down this path.&lt;br /&gt;&lt;br /&gt;Two conclusions:&lt;br /&gt;1. A Mind motivated by a simple prioritized goal hierarchy is comatose with no goals, and obsessive when given a goal.&lt;br /&gt;2. Given a critical level of practical world-knowledge, such a Mind will tend to function as an obsessive sociopath.&lt;br /&gt;&lt;br /&gt;Yes, I know, sociopathy is an obsolete synonym for antisocial personality disorder (ASPD).&lt;br /&gt;&lt;br /&gt;I thought for years that movies like "Colossus: The Forbin Project" and "The Terminator" were absurd in their treatment of intelligent machines, but I've changed my mind. Build a machine that has a goal, the ability to competently pursue it, and none of the countervailing human restraints, and it should act like a sociopath with a mission.  Well - good to have on your side, perhaps, as long as it stays there and agrees precisely with you on what is 'good' and 'your side'.&lt;br /&gt;&lt;br /&gt;Whew. All this blather, and I'm not even up to where &lt;a href="http://en.wikipedia.org/wiki/Three_Laws_of_Robotics"&gt;Isaac Asimov was in 1942&lt;/a&gt;.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/3829817-8124841304512745974?l=www.dosadi.com%2Fweblog%2Fblogger.html'/&gt;&lt;/div&gt;</description><link>http://www.dosadi.com/weblog/blogger.html#8124841304512745974</link><author>noreply@blogger.com (Spike)</author></item><item><guid isPermaLink='false'>tag:blogger.com,1999:blog-3829817.post-6448787303390333133</guid><pubDate>Wed, 15 Apr 2009 00:08:00 +0000</pubDate><atom:updated>2009-04-14T17:38:20.371-07:00</atom:updated><title>Conceptual Spaces - Peter Gärdenfors</title><description>&lt;a href="http://www.mindmatter.de/mmpdf/gaerdenfors.pdf"&gt;Conceptual Spaces as a Framework for Knowledge Representation (2004, Mind and Matter)&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;Lovely little paper, it feels like one of those classics, like an Einstein or Minkowski in physics, or C.A.R. Hoar in computer science.&lt;br /&gt;&lt;br /&gt;Need to read his other papers and his &lt;a href="http://www.amazon.com/Conceptual-Spaces-Geometry-Peter-G%C3%A4rdenfors/dp/0262071991"&gt;book&lt;/a&gt;.&lt;br /&gt;More links &lt;a href="http://www.lucs.lu.se/Peter.Gardenfors/Conceptual.Spaces/"&gt;here&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;Offers a bridge between high-level symbolic thinking, and low-level connectionist models.  Offers a representational theory that might explain results in &lt;a href="http://en.wikipedia.org/wiki/Prototype_Theory"&gt;prototype theory&lt;/a&gt;.&lt;br /&gt;Offers a specific, principled way to introduce quantification into conceptual representation.&lt;br /&gt;&lt;br /&gt;Note to self: Check out &lt;a href="http://en.wikipedia.org/wiki/WordNet"&gt;WordNet&lt;/a&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/3829817-6448787303390333133?l=www.dosadi.com%2Fweblog%2Fblogger.html'/&gt;&lt;/div&gt;</description><link>http://www.dosadi.com/weblog/blogger.html#6448787303390333133</link><author>noreply@blogger.com (Spike)</author></item><item><guid isPermaLink='false'>tag:blogger.com,1999:blog-3829817.post-6299554240236882361</guid><pubDate>Tue, 14 Apr 2009 19:32:00 +0000</pubDate><atom:updated>2009-04-14T20:20:58.539-07:00</atom:updated><category domain='http://www.blogger.com/atom/ns#'>Wright Brothers</category><category domain='http://www.blogger.com/atom/ns#'>cats</category><category domain='http://www.blogger.com/atom/ns#'>cyc</category><category domain='http://www.blogger.com/atom/ns#'>ELIZA</category><category domain='http://www.blogger.com/atom/ns#'>Turing Test</category><category domain='http://www.blogger.com/atom/ns#'>goal</category><category domain='http://www.blogger.com/atom/ns#'>bots</category><category domain='http://www.blogger.com/atom/ns#'>grad students</category><title>theory and practice</title><description>&lt;a href="http://en.wikipedia.org/wiki/GOFAI"&gt;GOFAI &lt;/a&gt;(Good Old-Fashioned AI), computational linguistics, cognitive psychology, connectionism and neural network research, etc. - None of these areas have produced or have a project (AFAIK) to produce a machine that is 'somebody you could talk to'.&lt;br /&gt;&lt;br /&gt;Yes, there was ELIZA.  Eve to all those &lt;a href="http://alicebot.blogspot.com/"&gt;&lt;span style="font-style: italic;"&gt;bots&lt;/span&gt;&lt;/a&gt;.&lt;br /&gt;&lt;br /&gt;Yes, we have machines that one can 'talk to' in a limited sense.  Some of the &lt;a href="http://en.wikipedia.org/wiki/Speech_recognition"&gt;CSR&lt;/a&gt; systems are amazingly good, especially working over a phone.&lt;br /&gt;&lt;br /&gt;There are systems such as Cognition Technologies' &lt;a href="http://www.cognition.com/info/what.html"&gt;Semantic Map&lt;/a&gt;, that demonstrate or embody a limited understanding of language.  And of course there's &lt;a href="http://www.cyc.com/"&gt;Cyc &lt;/a&gt;... about time to check in and see what's new there.&lt;br /&gt;&lt;br /&gt;There is a vast amount of philosophical discussion (speculation?) about consciousness - centered on human consciousness, since we don't understand it and humans are the only data point.&lt;br /&gt;&lt;br /&gt;And there is tons of wonderful &lt;a href="http://en.wikipedia.org/wiki/Cognitive_psychology"&gt;cognitive psychology&lt;/a&gt; research - so far I have found this area the most fruitful, as it is concerned with the nuts &amp;amp; bolts of human mental processes, the inferred structure of the human mind.  But when the cog. psych. folks build stuff at all, they build small stuff - components, proof-of-concept.&lt;br /&gt;&lt;br /&gt;So we've got all this stuff &amp;amp; activity, but in no case is there anybody home - it seems to me everybody is building a &lt;span style="font-style: italic;"&gt;something&lt;/span&gt;, not a &lt;span style="font-style: italic;"&gt;somebody&lt;/span&gt;.&lt;br /&gt;&lt;br /&gt;So I've made my goal the construction of a &lt;span style="font-style: italic;"&gt;somebody&lt;/span&gt;. My hope is that by redefining the problem, I can accomplish something interesting.&lt;br /&gt;&lt;br /&gt;I take as my role models the Wright Brothers, Thomas Edison, and other practical inventors: The goal of these practical engineer/inventors is to produce a working artifact. Not to understand (only) but to make.&lt;br /&gt;&lt;br /&gt;I don't have to fully understand human cognition - in theory I don't have to understand human cognition at all, it's a guide and inspiration but not the &lt;span style="font-style: italic;"&gt;model&lt;/span&gt;.  I don't need or have time for a general theory of mind, consciousness, cognition, or intelligence.&lt;br /&gt;&lt;br /&gt;I don't want or need to build a 'general intelligence'.  If cats could talk, I feel confident that we would experience them as a kind of &lt;span style="font-style: italic;"&gt;person&lt;/span&gt; and as having odd but undeniable minds. Maybe I'll try a post about the hypothetical conversation of cats... I am not sure they could - or would want to - solve puzzles, learn mathematics, play chess, or score well on the SAT.&lt;br /&gt;&lt;br /&gt;I believe that conversation is substantially a &lt;span style="font-style: italic;"&gt;learned skill&lt;/span&gt;. That we learn how to converse, and what we learn includes a large number of rules or patterns, which a conversational system will have to acquire or be programmed with. (Following Doug Lenat...)  The good news is that up to a point, this is not magic and there is no "key" waiting to be found - you just have to accumulate the necessary knowledge.&lt;br /&gt;&lt;br /&gt;By choosing typed conversation, I focus on what I see as the minimum form of interaction that is both necessary and sufficient for two minds to meet.  This is like the &lt;a href="http://en.wikipedia.org/wiki/Turing_test"&gt;Turing Tes&lt;/a&gt;t, but with the condition relaxed from 'fool the judges that I am human' to 'convince the judges I have a mind (or am conscious)'.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/3829817-6299554240236882361?l=www.dosadi.com%2Fweblog%2Fblogger.html'/&gt;&lt;/div&gt;</description><link>http://www.dosadi.com/weblog/blogger.html#6299554240236882361</link><author>noreply@blogger.com (Spike)</author></item><item><guid isPermaLink='false'>tag:blogger.com,1999:blog-3829817.post-82440588</guid><pubDate>Wed, 02 Oct 2002 23:59:00 +0000</pubDate><atom:updated>2009-04-14T12:32:50.588-07:00</atom:updated><category domain='http://www.blogger.com/atom/ns#'>initial</category><category domain='http://www.blogger.com/atom/ns#'>goal</category><title>beginnings</title><description>I started this blog as a kind of digital journal, to record thoughts, speculations, interesting facts and observations, and useful links.  My goal is to create an artificial mind that, in typed conversation with a human gives the strong impression of being a 'someone' rather than a 'something'.&lt;br /&gt;&lt;br /&gt;Obviously one problem is how to better define the goal!&lt;br /&gt;&lt;br /&gt;I'm not interested in creating an artificial &lt;span style="font-style: italic;"&gt;intelligence&lt;/span&gt; - a really stupid 'someone' would do fine.&lt;br /&gt;I'm not trying to pass the Turing Test i.e. to impersonate a human.&lt;br /&gt;I'm not doing cognitive psychology - not trying to understand, simulate or replicate a human mind.&lt;br /&gt;&lt;br /&gt;I'm making a lot of assumptions, but one I'll mention up front is that to be a convincing 'someone', a Mind must evidence what we would call consciousness in a human.&lt;br /&gt;&lt;br /&gt;link of the day: &lt;a href="http://www.consciousentities.com/"&gt;Conscious Entities&lt;/a&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/3829817-82440588?l=www.dosadi.com%2Fweblog%2Fblogger.html'/&gt;&lt;/div&gt;</description><link>http://www.dosadi.com/weblog/blogger.html#82440588</link><author>noreply@blogger.com (Spike)</author></item></channel></rss>
