Cognitive Linguistics for AI — Complete Course

How the Human Mind Shapes Artificial Intelligence

A rigorous, applied course bridging cognitive science, linguistics, and AI engineering — from metaphor theory to transformer architectures.

11 Modules
32 Lessons
90+ Key Concepts
Applications
NLP LLMs Frame Semantics Metaphor Theory Prototype Theory Embodied AI Prompt Engineering Cognitive Semantics Transformers AI Ethics Neuro-Symbolic AI Centaur Models Interpretability Cognitive Diversity CMT-CoT Prompting ChatGPT vs Claude vs Gemini Cognitive Engineering
MODULE 01
Foundations of CL & AI
4 lessons
MODULE 02
Embodied Cognition & NLP
3 lessons
MODULE 03
Conceptual Metaphor
2 lessons
MODULE 04
Frame Semantics
2 lessons
MODULE 05
Prototype Theory
2 lessons
MODULE 06
Construction Grammar
2 lessons
MODULE 07
Cognitive Semantics
3 lessons
MODULE 08
Discourse & Pragmatics
2 lessons
MODULE 09
AI Applications
3 lessons
MODULE 10
Frontiers & Future
4 lessons
NEW
MODULE 11
CL in the Era of Modern LLMs
4 lessons
Module 1 · Lesson 1.1

What is Cognitive Linguistics?

Understanding the discipline that changed how we think about language, meaning, and mind — and why it's the key to better AI.

Definition & Origins Core

Cognitive Linguistics (CL) is an interdisciplinary approach to language that emerged in the 1970s–1980s, pioneered by George Lakoff, Mark Johnson, Ronald Langacker, and Charles Fillmore. Unlike formal linguistics (which treats language as an autonomous, rule-based system), CL holds that language is inseparable from cognition.

The central thesis: meaning is not arbitrary symbols mapped to external reality, but is structured by the way the human mind experiences and conceptualizes the world.

🧠
Meaning is Embodied
Language meaning is rooted in bodily experience. Abstract concepts are understood through physical schemas derived from interacting with the world.
🗂
Categories are Fuzzy
Human conceptual categories are not binary but gradient, organized around prototypes rather than necessary-and-sufficient conditions.
🔀
Grammar Encodes Meaning
Grammatical structures are not arbitrary — they reflect conceptual structures. Syntax is motivated by semantics and pragmatics.
🌐
Context is Constitutive
Meaning is always context-dependent. The same expression can mean radically different things depending on frames and construals.

The Major Sub-fields Overview

Sub-fieldFocusKey ScholarAI Relevance
Conceptual Metaphor TheoryHow abstract thought uses bodily metaphorsLakoff & JohnsonLLM metaphor processing, reasoning
Frame SemanticsKnowledge structures that activate meaningCharles FillmoreKnowledge graphs, FrameNet, NLU
Prototype TheoryGradient category membershipEleanor RoschClassification, fuzzy ML systems
Cognitive GrammarGrammar as conceptual organizationRonald LangackerSyntax parsing, construction grammars
Construction GrammarForm-meaning pairings at all levelsGoldberg, KayTransformer architectures, parsing
Image Schema TheoryPre-conceptual, spatial reasoning patternsJohnson, MandlerSpatial AI, robotic reasoning
💡
Key Insight: Cognitive Linguistics treats language as a window into the mind. For AI researchers, this means language data is not just text — it's a record of human conceptual structure, ready to be learned from and engineered with.
Module 1 · Lesson 1.2

Why Cognitive Linguistics Matters for AI

The gap between statistical language modelling and genuine understanding — and how CL helps bridge it.

The Understanding Problem Critical

Large Language Models are extraordinarily good at predicting text. But prediction is not the same as understanding. CL gives us a vocabulary and a framework for asking: what would it actually mean for an AI to understand language the way a human does?

The gap between statistical competence and semantic understanding is precisely where cognitive linguistics has the most to offer AI engineering and AI safety research.

THE UNDERSTANDING GAP
Raw Text Data
Statistical Patterns
LLM Output
Grounded Meaning
Human Understanding
🎯
Prompt Engineering
Understanding how AI models activate frames, metaphors, and prototypes allows engineers to craft prompts that reliably elicit the desired cognitive "space" in the model.
🔍
Interpretability
CL-derived concepts like frame activation and prototype gradience give us testable hypotheses about what is actually happening inside transformer attention heads.
⚖️
Bias Detection
Conceptual metaphors and prototypes embedded in training data encode social biases. CL tools help identify and measure these systematically.
🤖
Human-AI Communication
CL's pragmatics research (Gricean maxims, relevance theory) directly informs how conversational AI systems should manage cooperative dialogue.
🌍
Multilingual AI
Cross-linguistic conceptual structure — what's universal vs. language-specific — is critical for building AI that works across languages without cultural distortion.
Why now? The explosion of LLMs has made CL more relevant than ever. These models learn from human language at scale — which means they implicitly learn cognitive-linguistic structures. Understanding those structures is essential for alignment, evaluation, and improvement.
Module 1 · Lesson 1.3

Core Principles & AI Parallels

Mapping the founding commitments of Cognitive Linguistics onto the architecture of modern AI systems.

Principle 1: The Cognitive Commitment Foundational

CL is committed to describing language in a way that is consistent with what we know about the mind from cognitive science. Language is not a self-contained formal system — it reflects general cognitive principles.

AI Parallel

Neural language models that share architectural features with biological neural processing (attention, distributed representations) may be capturing something cognitively real — not just a convenient approximation.

Principle 2: The Generalization Commitment Foundational

CL seeks general principles that apply across all aspects of language — phonology, morphology, syntax, semantics, pragmatics, and discourse are all subject to the same cognitive principles.

AI Parallel

Transformer architectures apply the same attention mechanism across all levels of linguistic structure — an architectural choice that mirrors CL's commitment to unified cognitive principles.

Principle 3: Encyclopedic Semantics Foundational

Word meaning is not a minimal dictionary definition but an access point to vast encyclopedic knowledge. The word bird activates not just "feathered biped" but nesting behavior, song, migration, cultural associations, and more.

AI Parallel

Word embeddings in LLMs encode rich, context-sensitive semantic neighborhoods — operationalizing encyclopedic semantics computationally. A token's embedding captures far more than a dictionary definition.

Principle 4: Usage-Based Model Foundational

Linguistic knowledge is built up from actual instances of language use, not from innate abstract rules. Grammar is the statistical residue of countless acts of communication.

AI Parallel

This is precisely how LLMs are trained. Pre-training on massive corpora of usage instances is a direct computational implementation of the usage-based model. CL predicted this architecture decades before it existed.

🎯 Check Your Understanding

Which CL principle most directly predicted the success of large language models trained on vast text corpora?
Module 2 · Lesson 2.1

Embodied Meaning in AI

How the body shapes meaning — and the profound challenge this poses for disembodied AI systems.

Embodied Cognition — The Thesis Core Theory

George Lakoff and Mark Johnson argued in Philosophy in the Flesh (1999) that human concepts are not abstract, disembodied symbols. Instead, our conceptual system is grounded in sensorimotor experience. We understand "warmth" emotionally because we have felt physical warmth. We understand "UP" as positive because erect posture correlates with health and power.

This is not metaphor — it is the literal claim that the brain's conceptual system uses the same neural structures as sensorimotor processing.

🔥
Thermal Concepts
"Warm welcome," "cool reception," "heated argument" — temperature maps directly onto social and emotional concepts because of embodied thermal experience.
↕️
Vertical Orientation
"High status," "feeling low," "rising expectations" — verticality structures social, emotional, and evaluative concepts through the body's experience of gravity.
📦
Container Schemas
"In trouble," "out of ideas," "within reason" — the in/out bodily experience of containers structures abstract reasoning about categories, states, and boundaries.
🫀
Force Dynamics
"Pushing an argument," "resisting change," "breaking through barriers" — physical force and resistance structure how we reason about causation, agency, and obligation.

The Grounding Problem for AI Challenge

AI language models learn language from text — a disembodied medium. They have never felt heat, experienced gravity, or been inside a container. Yet they successfully use and manipulate all the conceptual structures that depend on these experiences.

This raises profound questions: Are LLMs using genuine conceptual structure, or are they sophisticated mimics? Can meaning that was originally grounded in the body survive transmission through text alone?

Research Finding

Probing studies have shown that LLMs encode systematic directional biases consistent with verticality metaphors (MORE IS UP, HAPPY IS UP). They learned these from text patterns alone — suggesting that embodied structure is recoverable from language data even without a body.

⚠️
Open Question: Multimodal models (GPT-4V, Gemini) that process images alongside text may achieve a form of "virtual grounding" through visual experience. Whether this constitutes genuine embodied understanding or a richer form of simulation remains an active research debate.
Module 2 · Lesson 2.2

Grounding Language in AI Models

Approaches to solving the symbol grounding problem — from robotics to multimodal transformers.

The Symbol Grounding Problem Core Problem

Harnad (1990) formalized the symbol grounding problem: if all symbols in a system are defined only in terms of other symbols, how does the system connect to the world? A dictionary that defines all words with other words is a closed loop — it only makes sense to someone who already understands some words through non-symbolic experience.

Pure text-based LLMs face this challenge: their representations are grounded only in relations to other tokens, not in the world itself.

🤖
Robotics Grounding
Embodied AI systems (robots) ground language in sensorimotor experience. The word "push" is grounded in the robot's actual experience of applying force to objects.
🖼️
Vision-Language Models
CLIP, DALL-E, GPT-4V ground language in visual representations. "Red apple" connects linguistic tokens to pixel distributions, providing a form of perceptual grounding.
🎵
Audio Grounding
Models trained on audio alongside text (Whisper, audio LLMs) ground language in acoustic features — "loud," "sharp," "resonant" gain non-textual referents.
🌐
World Models
Systems like DreamerV3 learn world models from interaction, grounding predictions in simulated physical consequences — a step toward CL-style embodied understanding.

Degrees of Grounding Framework

LevelSystem TypeGrounding SourceExample
Symbolic OnlyRule-based NLPSymbolic rules onlyEarly GOFAI systems
Statistical TextualText-only LLMsCo-occurrence statisticsGPT-2, BERT
PerceptualVision-Language ModelsImage-text alignmentGPT-4V, Gemini
Action-BasedEmbodied AI / RoboticsSensorimotor feedbackRT-2, PaLM-E
Full EmbodiedBiological cognitionFull body-world interactionHumans

🎯 Check Your Understanding

A vision-language model that connects the word "red" to pixel distributions in images represents which level of grounding?
Module 3 · Lesson 3.1

Conceptual Metaphor Theory & LLMs

How systematic metaphorical mappings structure human reasoning — and how AI models learn and use them.

Conceptual Metaphor Theory Core Theory

Lakoff and Johnson's Metaphors We Live By (1980) is one of the most cited books in cognitive science. Its central claim: metaphor is not a literary device but a fundamental cognitive mechanism. We understand abstract domains by systematically mapping structure from concrete, embodied source domains.

These are not random poetic flourishes — they are systematic, consistent, and culturally shared conceptual structures.

💼
ARGUMENT IS WAR
"He attacked my position." "She defended her argument." "I demolished his thesis." The entire conceptual domain of argument is structured by military conflict metaphors.
TIME IS MONEY
"Don't waste my time." "I've invested years in this." "Save time by automating." Abstract time is understood through the concrete domain of financial resources.
🛤️
LIFE IS A JOURNEY
"She's at a crossroads." "He's lost his way." "We're headed in the right direction." Life progress is understood through spatial movement toward destinations.
💡
IDEAS ARE LIGHT
"An illuminating thought." "She shed light on the problem." "I'm in the dark about this." Abstract understanding maps onto the embodied experience of visual perception.

How LLMs Represent Conceptual Metaphors Research

Multiple studies have used probing classifiers and attention analysis to investigate how LLMs represent conceptual metaphors. Key findings:

  • LLMs encode systematic metaphorical relationships in their embedding spaces — metaphorically related words cluster in ways consistent with CMT predictions
  • Models trained on more data show stronger alignment with human conceptual metaphor judgments, suggesting metaphor is largely learnable from text
  • Attention patterns in BERT-like models cluster metaphorical source and target domain tokens — suggesting metaphorical mappings are encoded in transformer weights
  • LLMs can generate novel metaphors that are judged as natural by humans, implying productive metaphorical competence beyond mere pattern matching
  • Cross-lingual studies show metaphorical representations partially align across languages in multilingual models, consistent with the universality hypothesis
🔬
Research Frontier: The MetaNet project at ICSI and TroFi dataset have created annotated corpora of metaphorical language use that are now being used to benchmark LLM metaphor understanding. GPT-4 achieves near-human performance on novel metaphor interpretation tasks.
Module 3 · Lesson 3.2

Metaphor in Prompt Engineering

Using conceptual metaphor theory to write better prompts and understand how framing shapes AI output.

Metaphor Priming in Prompts Applied

Because LLMs have internalized conceptual metaphors from training data, the metaphors you activate in a prompt shape how the model reasons about the topic. This is not a trick — it reflects genuine metaphorical structure in the model's representations.

// ARGUMENT IS WAR frame → combative, defensive reasoning Prompt A: "Attack the weaknesses in this business plan." // ARGUMENT IS EXPLORATION frame → collaborative, curious reasoning Prompt B: "Explore what this business plan might be missing." // ARGUMENT IS CONSTRUCTION frame → building, structural reasoning Prompt C: "Help me strengthen the foundations of this business plan." // All three prompts ask for critique — but activate different cognitive frames // and will produce measurably different outputs in tone, structure, and content

Practical Metaphor Techniques Applied

  • Role metaphors: "Act as a detective" (REASONING IS INVESTIGATION) vs "Act as a doctor" (REASONING IS DIAGNOSIS) activates different inference patterns
  • Spatial metaphors: "Step back and look at the big picture" activates wide-scope overview processing; "zoom into the details" activates close analysis
  • Journey metaphors: "Walk me through your reasoning" structures output as sequential narration
  • Container metaphors: "What's inside this concept? What's outside its scope?" explicitly structures boundary analysis
  • Temperature metaphors: "What's the hot take? What's the cold, hard truth?" activates different evaluative registers
Advanced Technique: Metaphor Conflict

Deliberately activating conflicting metaphors can generate richer analysis: "Using both a medical diagnosis frame AND a detective investigation frame, analyze why this product launch failed." The model must reconcile competing conceptual structures, often producing more nuanced output.

🎯 Applied Challenge

A prompt says: "Tear apart the logic in this argument." Which conceptual metaphor does this primarily activate?
Module 4 · Lesson 4.1

Frames & Knowledge Structures

Charles Fillmore's Frame Semantics — the theory that every word activates a rich background knowledge structure.

What is a Frame? Core Theory

A frame is a structured knowledge schema that represents a type of event, relationship, or object in the world. When you understand a word, you don't just access a definition — you activate a whole frame with roles, relationships, and expectations.

Fillmore's classic example: the word COMMERCIAL TRANSACTION activates a frame with roles: Buyer, Seller, Goods, Money, and Place — plus expectations about what typically happens.

COMMERCIAL TRANSACTION FRAME
Buyer
Seller
Goods
Money
Place
Transaction

All of "buy," "sell," "purchase," "vendor," "customer," "price," "shop" activate this same frame — but highlight different roles within it.

Frame Inheritance & Hierarchy Structure

Frames are organized in hierarchical networks. The COMMERCIAL TRANSACTION frame inherits from TRANSACTION, which inherits from EXCHANGE, which inherits from TRANSFER. Each level adds specificity while inheriting the structure above it.

This hierarchy allows for frame-based inference: if X is a shop (COMMERCIAL TRANSACTION), then there must be goods, a price, and an expectation of payment — even if never mentioned.

🔑
Lexical Access
Every content word is a frame-evoking element. Verbs are particularly rich frame-activators, specifying participant roles and event structure.
🧩
Frame Elements
Participant roles within a frame (Agent, Patient, Theme, Goal, etc.) are frame elements — the semantic "slots" that syntactic arguments fill.
📡
Null Instantiation
Frame elements can be absent from text but still active in understanding. "She ate" implies food exists even without mention — a powerful inference mechanism.
Module 4 · Lesson 4.2

FrameNet & AI Applications

How Fillmore's frame semantics became a computational resource — and how it powers modern NLU systems.

FrameNet: The Computational Resource Applied

FrameNet (framenet.icsi.berkeley.edu) is a lexical database built on Fillmore's frame semantics. It contains over 1,200 semantic frames, 13,000 lexical units, and 200,000+ annotated sentences. It is one of the most important resources in computational linguistics.

🔍
Semantic Role Labeling
SRL systems identify frame elements in text. "John sold Mary a book for $20" → Seller:John, Buyer:Mary, Goods:book, Money:$20. Used in QA, IE, and summarization.
📰
Information Extraction
Frame-based IE extracts structured event information. ATTACK frames extract Attacker, Target, Weapon, Place from news — powering intelligence analysis systems.
Question Answering
Frame element slots guide what information to extract to answer "Who sold what to whom for how much?" — frame structure maps directly to question types.
🎭
Sentiment Analysis
Frame-aware sentiment is more accurate than bag-of-words approaches. "The medicine killed the pain" vs "The medicine killed the patient" requires frame knowledge to parse correctly.
// Frame-based analysis pipeline Input: "Apple acquired the startup for $2 billion." FrameNet Lookup: "acquired" → COMMERCE_BUY frame Buyer: "Apple" Goods: "the startup" Money: "$2 billion" Seller: [null-instantiated — inferrable] // Frame elements = structured knowledge for downstream tasks // QA: "Who bought what?" → Buyer + Goods // IE: Extract all M&A events → COMMERCE_BUY frames

Frame Semantics in LLMs Research

Modern LLMs implicitly learn frame-like structures from training data — but in an unstructured way. Research directions include:

  • FrameNet-guided fine-tuning to make implicit frame knowledge explicit and controllable
  • Frame-conditioned generation: constraining LLM output to fill specific frame element slots
  • Probing LLMs for frame element representation — do attention heads specialize in specific frame roles?
  • Automatic FrameNet extension using LLMs to generate new frame annotations at scale
Module 5 · Lesson 5.1

Categories & Prototypes in AI

Why human categories are not boolean — and the implications for AI classification systems.

Classical vs. Prototype Theory Core Theory

The classical theory of categories (dominant until the 1970s) holds that categories are defined by necessary and sufficient conditions. Something is a BIRD if and only if it has features F1, F2, F3...

Eleanor Rosch's pioneering experiments in the 1970s showed this is wrong for human cognition. Categories are gradient, with some members being better examples than others. A robin is a "better" bird than a penguin, even though both are equally birds by classical definition.

BIRD CATEGORY — Prototype Structure
ROBIN
Prototype
Typicality: 1.0
EAGLE
Near Prototype
Typicality: 0.8
OSTRICH
Peripheral
Typicality: 0.5
PENGUIN
Marginal
Typicality: 0.3

Prototype Theory Principles Theory

  • Family resemblance: Members share overlapping features with each other, not a common core — like Wittgenstein's family members who share some but not all traits
  • Graded membership: Category membership is a matter of degree, not yes/no — measured by typicality ratings in psychological experiments
  • Prototype effects: Prototypical members are processed faster, remembered better, and used in reasoning more readily than peripheral members
  • Basic level categories: Intermediate-level categories (DOG, not ANIMAL or POODLE) are cognitively privileged — the level at which most knowledge is organized
Module 5 · Lesson 5.2

Fuzzy Logic & AI Classification

How prototype theory maps onto fuzzy sets, vector space models, and modern ML classification.

Prototype Theory → Computational Models Applied

Prototype theory directly inspired fuzzy set theory (Zadeh, 1965) and has profound implications for how we design and evaluate AI classification systems. Classical binary classifiers violate the cognitive reality of graded category membership.

🌫️
Fuzzy Sets
Elements have membership degrees between 0 and 1, not binary 0/1. A penguin has membership 0.3 in BIRD, not 0 — better modeling cognitive category structure.
📐
Vector Space Models
Word embeddings create prototype-like structures: category centroids emerge from averaging member vectors. Distance to centroid ≈ typicality rating.
🏔️
Softmax Probabilities
Neural classifiers outputting probability distributions naturally implement graded membership. A 70% confidence classification is cognitively more honest than forced binary output.
🔗
Clustering Algorithms
Fuzzy c-means clustering explicitly implements graded membership. K-means' hard assignment violates prototype theory; fuzzy approaches align better with human categorization.

Implications for AI Evaluation Critical

If human categories are gradient, then binary accuracy metrics (correct/wrong) are a poor measure of AI classification quality. A system that classifies a penguin as "not bird" is wrong — but it's less wrong than classifying a robin as "not bird."

Better Evaluation Approaches

Typicality-weighted accuracy: weight errors by prototype distance. Calibration: measure whether model confidence reflects category gradient. Human-model correlation: compare model confidence gradients to human typicality ratings. These metrics better capture cognitive reality than binary accuracy.

🎯 Check Your Understanding

A text classifier must categorize "She runs a small catering business" as BUSINESS type. Prototype theory suggests which approach is most cognitively valid?
Module 6 · Lesson 6.1

Constructions & Language Models

Construction Grammar's insight that form and meaning are inseparable — and why this matters for transformers.

What is a Construction? Core Theory

Construction Grammar (Goldberg 1995, Kay & Fillmore 1999) proposes that grammar consists of constructions: form-meaning pairings at all levels, from morphemes to sentence patterns. Crucially, constructions contribute meaning that cannot be derived from the words alone.

The Ditransitive Construction

Pattern: [Subj V Obj1 Obj2] → meaning: TRANSFER

"She gave him a book." → standard use (GIVE is inherently a transfer verb)
"She sneezed him the napkin." → unusual, but the CONSTRUCTION forces a transfer reading
"She talked him into compliance." → completely non-physical, but construction = transfer of state

The TRANSFER meaning comes from the CONSTRUCTION, not the verb "sneeze" or "talk."

🔄
Ditransitive
[Subj V Obj Obj2] = CAUSED MOTION / TRANSFER. Forces a transfer reading regardless of the verb used.
💥
Resultative
[Subj V Obj Adj/PP] = CAUSE TO BECOME. "She painted the wall red" — the object reaches the result state.
🚀
Caused Motion
[Subj V Obj PP] = CAUSE TO MOVE TO. "She sneezed the napkin off the table" — motion reading imposed by construction.
🔥
Way Construction
"She elbowed her way through the crowd." — idiomatic construction with MANNER + PATH meaning. Not compositional.
Module 6 · Lesson 6.2

Syntax–Semantics in Transformers

Evidence that transformer models learn construction-like representations — and how to leverage this.

Constructions in Transformer Representations Research

A rich body of probing research has investigated what syntactic and semantic knowledge is encoded in transformer layers. Construction Grammar predicts that form-meaning pairings should be holistically represented — and this is largely what is found.

  • Early layers encode local syntactic patterns; middle layers encode construction-level form-meaning associations; upper layers encode discourse-level pragmatics
  • Attention heads specialize: some track subject-verb agreement (syntactic), others track semantic role relationships (construction-level)
  • Contextual embeddings for the same verb differ systematically across constructions — "gave" in ditransitive vs. simple transitive has measurably different representations
  • Models generalize construction patterns to novel verbs — "She blicked him the widget" receives a transfer interpretation, consistent with construction-level learning

Construction-Aware Prompt Engineering Applied

Understanding which constructions you activate in a prompt allows more precise control over AI output structure and meaning.

// Resultative construction → forces focus on end-state "Explain quantum computing [until it is simple enough for a child]." // → Resultative: [V until Adj] → process aimed at achieving a result state // Ditransitive → frames output as explicit transfer of knowledge "Teach me the key insights of quantum computing." // → [teach + me + Obj] → TRANSFER frame: teacher → learner → content // Way construction → frames output as navigating obstacles "Work your way through the complexities of quantum computing." // → [V way PP] → MANNER + PATH: implies obstacles to overcome
🔬
Key Research: Ettinger (2020) and Linzen et al. (2016) demonstrated that BERT-like models learn construction-level form-meaning associations, not just surface statistics. This validates Construction Grammar's prediction that grammar is a network of meaningful patterns, not a derivational rule system.
Module 7 · Lesson 7.1

Image Schemas & AI Reasoning

Pre-conceptual spatial patterns that structure all human thought — and their surprising presence in AI systems.

What are Image Schemas? Core Theory

Mark Johnson proposed that image schemas are pre-linguistic, prelinguistic patterns of sensorimotor experience that structure all higher-level cognition. They are abstract, skeletal patterns derived from repeated bodily interactions with the environment.

Image schemas are not mental images — they are dynamic, structured patterns of interaction that can be elaborated metaphorically to structure abstract domains.

📦
CONTAINER Schema
Interior, boundary, exterior. "In love," "out of scope," "within limits." All bounded regions, categories, and states use this schema.
🛤️
PATH Schema
Source, trajectory, goal. "Achieving a goal," "following a plan," "progress toward success." All goal-directed processes map onto this schema.
FORCE Schema
Agent, force, resistance. "Pushing an argument," "social pressure," "resistance to change." Causation and agency are structured by force dynamics.
⚖️
BALANCE Schema
Symmetry, equilibrium. "Balancing trade-offs," "equilibrium," "offsetting costs." All equilibrium reasoning uses the balance schema.
🔗
LINK Schema
Connected entities. "Relationship," "association," "bond," "connection." All relational concepts use linking schemas.

Image Schemas in AI Spatial Reasoning Applied

Image schemas are directly relevant to spatial AI, robotic planning, and visual reasoning:

  • Robotic navigation: PATH schema (source, trajectory, goal) maps directly onto motion planning algorithms — A* search is a computational PATH schema
  • Scene understanding: CONTAINER schema structures how vision models parse spatial relationships — objects are "in" or "on" or "outside" containers
  • Causal reasoning: FORCE schema structures how AI models represent cause-effect relationships in knowledge graphs
  • LLM spatial tasks: Probing studies show LLMs encode image-schematic spatial relationships — "above," "between," "through" activate schema-consistent representations
Module 7 · Lesson 7.2

Semantic Roles & Natural Language Understanding

Thematic roles, VerbNet, PropBank — and how semantic role labeling powers modern NLU.

Thematic Roles Core Theory

Every verb frames an event with participant roles. These thematic roles (also called theta roles or semantic roles) capture the relationship between participants and the event — independent of their syntactic position.

RoleDefinitionExample
AgentIntentional causer of event"John broke the window."
PatientEntity undergoing change"John broke the window."
ThemeEntity moving or described"She sent the package."
ExperiencerEntity mentally affected"She feared the storm."
GoalEndpoint of motion/transfer"He gave it to her."
SourceStart point of motion/transfer"She came from Paris."
InstrumentMeans by which event occurs"She cut it with scissors."
BeneficiaryEntity benefiting from event"She baked a cake for him."

Semantic Role Labeling in NLU Applied

SRL systems automatically identify who did what to whom, where, when, and how. This provides structured semantic representations for downstream tasks.

"The company quietly laid off 500 workers last quarter." Predicate: laid_off Agent: "The company" → WHO acted Patient: "500 workers" → WHO was affected Manner: "quietly" → HOW it was done Time: "last quarter" → WHEN it happened // SRL output enables: // - QA: "Who did the company lay off?" → Patient // - Bias detection: Is the Agent always named when Patient is? // - Summarization: Extract key predicate-argument structures
Module 8 · Lesson 8.1

Coherence & Discourse Models

How texts cohere beyond the sentence level — and what AI needs to understand discourse structure.

Discourse Coherence Core Theory

Coherent discourse is more than a sequence of grammatical sentences. It requires that sentences stand in coherence relations to each other — logical, causal, temporal, and rhetorical connections that make the text hang together as a unified whole.

Hobbs (1979) and Mann & Thompson's Rhetorical Structure Theory (RST, 1988) formalized these relations. RST identifies 30+ coherence relations including ELABORATION, CONTRAST, CAUSE, EVIDENCE, CONCESSION, and BACKGROUND.

🔍
ELABORATION
A clause gives more detail about what was said in the previous clause. Most body paragraphs use this relation to expand on a topic sentence.
CAUSE / RESULT
"Sales dropped. Therefore, we cut budgets." Causal relations are fundamental to explanatory discourse and argumentative structure.
↔️
CONTRAST
"Model A is fast. But Model B is accurate." Sets up comparison that guides the reader's evaluation across the discourse.
🔗
EVIDENCE
A subsequent clause supports or justifies a preceding claim. Critical for argumentative discourse and scientific writing.

Discourse in LLMs Research

LLMs generate locally coherent text remarkably well but struggle with global discourse structure in long documents. Key failure modes:

  • Entity tracking errors over long spans — losing track of what pronoun refers to whom across many paragraphs
  • Inconsistent discourse relations — claiming CAUSE in one section and CONTRAST in another for the same relationship
  • Missing BACKGROUND information — assuming the reader knows context not yet established
  • Failures in RST-based summarization — missing the nucleus-satellite structure that identifies what is most vs. least important
Module 8 · Lesson 8.2

Grice's Maxims & Conversational AI

The cooperative principle and how pragmatic competence shapes the quality of human-AI dialogue.

The Cooperative Principle Core Theory

Paul Grice (1975) proposed that human communication is governed by a Cooperative Principle: "Make your conversational contribution such as is required, at the stage at which it occurs, by the accepted purpose or direction of the talk exchange." This breaks down into four maxims:

Quality
Do not say what you believe to be false. Do not say that for which you lack adequate evidence. → AI hallucination is a massive Maxim of Quality violation.
📏
Quantity
Be as informative as required. Do not be more informative than required. → Verbose AI responses and over-hedged disclaimers violate Quantity.
🎯
Relation
Be relevant. → AI responses that provide tangentially related content instead of what was actually asked violate Relation (Relevance).
💎
Manner
Be perspicuous: avoid obscurity, avoid ambiguity, be brief, be orderly. → Convoluted AI prose and unnecessary hedging violates Manner.

Implicature in AI Applied

Grice's greatest insight: speakers routinely mean more than they say, and listeners infer this via the cooperative principle. Implicature is this additional meaning.

Implicature Example for AI

User: "Can you write this in simpler terms?"
Literal meaning: A yes/no question about capability.
Implied meaning: Please actually rewrite this in simpler terms.

A pragmatically competent AI (like current LLMs) correctly reads the implicature and rewrites, rather than answering "Yes, I can." Grice explains why this is rational and cooperative.

Conversational AI failures often reduce to violations of Gricean maxims or failures to compute implicature. CL-informed evaluation of AI dialogue should explicitly test for pragmatic competence using Gricean criteria.

🎯 Check Your Understanding

An AI assistant is asked "What time is the meeting?" and responds with a 3-paragraph explanation of how meetings are generally scheduled in organizations. Which Gricean maxim does this most violate?
Module 9 · Lesson 9.1

CL-Informed Prompt Design

Synthesizing the entire course into a systematic framework for cognitively-grounded prompt engineering.

The CL Prompt Engineering Framework Applied

Every effective prompt operates simultaneously at multiple CL levels. Expert prompt engineers — whether they know it or not — are applying cognitive-linguistic principles. This framework makes those principles explicit and systematic.

🗂
Frame Level
What frame does your prompt activate? "Review this code" activates an EVALUATION frame. "Debug this code" activates a PROBLEM-SOLVING frame. Choose deliberately.
🌀
Metaphor Level
What source domain are you invoking? Judicial metaphors (judge, verdict) → formal evaluation. Surgical metaphors (cut, remove) → precise deletion tasks.
🎭
Prototype Level
Specify the prototype when precision matters: "Write a formal email (think annual report language)" vs "Write a formal email (think polite request)."
🔨
Construction Level
Use constructions that presuppose the desired output structure. Resultative: "Explain X until it is clear to a 10-year-old." Constrains the process and end state.
💬
Pragmatic Level
Maximize Gricean cooperativity. Be as specific as required (Quantity), keep it relevant (Relation), be clear and ordered (Manner), ground claims in evidence (Quality).

Prompt Analysis: Before & After Applied

// BEFORE: Cognitively vague "Tell me about this business plan." // Frame: unclear (evaluation? summary? advice?) // Metaphor: none activated // Prototype: "tell me about" is maximally vague // Construction: simple declarative, no result state specified // Pragmatics: under-specified (violates Quantity) // AFTER: CL-informed "Act as a venture capitalist who has 10 minutes before a pitch meeting. Identify the three biggest structural weaknesses in this business plan, then suggest one concrete fix for each — until each fix is specific enough to implement this week." // Frame: VC EVALUATION + TIME PRESSURE // Metaphor: STRUCTURE IS ARCHITECTURE (weaknesses, structural) // Prototype: "VC with 10 min" = specific, actionable evaluator // Construction: Resultative (until specific enough) + LIST OF THREE // Pragmatics: Relation (focused), Quantity (3 items), Manner (implement this week)
Module 9 · Lesson 9.2

Bias, Ethics & Language in AI

How cognitive-linguistic structures encode and perpetuate social biases in AI systems.

Language Encodes Ideology Critical

CL has always been attentive to how language doesn't just describe reality but actively constructs it. The frames, metaphors, and prototypes embedded in language carry ideological content — and AI trained on human language inherits this content at scale.

⚖️
Metaphor & Framing Bias
Describing crime as a "beast" (vs. a "virus") in LLM training data shifts model recommendations toward enforcement (vs. prevention). Lakoff's framing research shows how metaphor choice constrains reasoning.
👤
Prototype Bias
If the prototype "doctor" in training data is male, models exhibit gender bias in medical contexts. Prototype biases are more insidious than lexical biases because they structure entire inference patterns.
🗂
Frame Asymmetry
Historical texts frame different social groups in systematically different frames (AGENT vs. PATIENT, ACTOR vs. OBJECT). LLMs trained on this data inherit asymmetric role assignments.
📏
Semantic Change Bias
Words change meaning over time — often reflecting social change. Training data that spans time periods without temporal weighting may mix outdated conceptual structures with current ones.

CL-Based Bias Detection Methods Applied

  • Metaphor auditing: Systematically identifying which source domains are used to describe which social groups — revealing conceptual hierarchies
  • Frame asymmetry analysis: Using SRL to measure whether members of different groups are disproportionately assigned Agent vs. Patient roles in generated text
  • Prototype probing: Testing model assumptions about "typical" members of social categories using cloze tests and embedding proximity
  • WEAT testing: Word Embedding Association Test (Caliskan et al. 2017) measures associations between concepts in embedding spaces — directly operationalizing prototype theory
  • Counter-framing: Deliberately prompting alternative frames to measure how easily LLMs shift conceptual structures — revealing how deeply biases are embedded
⚠️
Ethical Imperative: CL-informed bias detection is not just about fairness metrics — it reveals the conceptual structures through which AI systems model social reality. Fixing surface-level lexical bias without addressing deeper frame and metaphor biases leads to systems that are superficially fair but cognitively discriminatory.
Module 10 · Lesson 10.1

Multimodal Cognition & AI

When language meets vision, sound, and action — and how CL frameworks extend to multimodal AI.

Multimodal Meaning Construction Advanced

Human cognition is fundamentally multimodal — we understand the word "red" not as a dictionary entry but as the integrated product of visual experience, emotional associations, cultural frames, and linguistic co-occurrence. CL's embodiment thesis predicts that richer AI cognition requires multimodal grounding.

👁️
Vision-Language Models
CLIP, GPT-4V, Gemini Vision learn cross-modal associations. The CL prediction: shared image-schema representations should emerge at the intersection — "above," "through," "expanding" should have consistent cross-modal representations.
🎧
Audio-Language Models
Thermal, tactile, and auditory schemas should be better grounded in audio-language models. "Loud," "sharp," "resonant" gain acoustic referents beyond text statistics.
🤲
Embodied Agents
RT-2, PaLM-E, and other vision-language-action models begin to approach full embodied grounding. Force, motion, and contact schemas gain genuine sensorimotor reference.
🧩
Cross-Modal Coherence
CL's insight: the same image schemas underlie all modalities. A successful multimodal AI should show unified schema representations that are consistent across vision, language, and action.

Testing CL Predictions in Multimodal Models Research

  • Do vision-language models show consistent VERTICALITY metaphor activation across image (visual up/down) and language (positive/negative) modalities?
  • Can multimodal models solve CL's "conceptual conflict" problems — cases where visual and linguistic information activate conflicting frames?
  • Do multimodal models show more human-like prototype effects than text-only models — specifically for perceptually grounded categories like COLOR and SHAPE?
  • Do embodied AI agents develop action-grounded semantic representations that differ systematically from text-only representations in cognitively predicted ways?
Module 10 · Lesson 10.2 — Final

Future: AGI & Cognitive Depth

What would it take for AI to achieve genuine cognitive-linguistic competence? And what does CL tell us about the path forward?

The Cognitive Depth Problem Frontier

Current LLMs have impressive cognitive width — they can perform adequately across an enormous range of linguistic tasks. But they may lack cognitive depth — the rich, embodied, frame-structured, prototype-organized conceptual system that underlies human language understanding.

CL gives us the vocabulary to specify precisely what this depth consists of — and therefore what tests we would need to pass to claim genuine AI language understanding.

🧬
Embodied Grounding
Full AGI will require sensorimotor grounding of abstract concepts. The field is moving toward this with embodied AI agents — but bodily experience at human richness remains distant.
🌐
Cultural Frame Knowledge
Frames are not universal — they are culturally constructed. AGI will need not one universal frame system but a multilingual, multicultural network of frames — a Pangloss of conceptual worlds.
Dynamic Categorization
Human categories are constructed on the fly in context — "Is a briefcase a piece of furniture if someone is using it as a table?" AGI needs dynamic, context-sensitive prototype construction.
🔮
Metaphorical Creativity
Novel conceptual metaphor creation — genuinely new mappings between domains — is a hallmark of human intelligence. Testing AI for this goes far beyond metaphor recognition.

CL Research Agenda for AI — 2025–2035 Roadmap

  • Benchmark development: Create CL-theoretically motivated benchmarks for frame activation, prototype gradience, metaphor creativity, and image-schematic reasoning
  • Interpretability through CL: Use frame semantics and construction grammar as theoretical lenses for mechanistic interpretability research in transformers
  • Cross-cultural AI: Build multilingual, frame-aware systems that respect the cultural specificity of conceptual systems rather than defaulting to Western cognitive defaults
  • Bias through a CL lens: Move beyond lexical bias to frame-level and metaphor-level debiasing methods
  • Dialogue systems: Design conversational AI using Relevance Theory and Gricean pragmatics as explicit evaluation criteria
  • Multimodal schema learning: Test whether shared image-schema representations emerge in multimodal models trained on diverse sensory inputs
🎓
Course Complete! You've now covered the full landscape of Cognitive Linguistics for AI — from foundational theories to frontier research questions. The field is young, the questions are profound, and the tools CL provides are uniquely suited to making AI systems that are not just statistically impressive, but cognitively principled.
Key Takeaways
LLMs are implicit cognitive systems — understanding them requires cognitive-linguistic theory
Frame semantics and prototype theory are directly operationalizable in computational systems
Conceptual metaphors shape AI reasoning — and can be deliberately engineered in prompts
The grounding problem is the central challenge — multimodal and embodied AI are the path forward
AI bias is fundamentally cognitive-structural — not just lexical
CL provides the theoretical vocabulary for AI interpretability and evaluation
Module 1 · Lesson 1.4 — NEW

The Centaur Model: AI as a Cognitive Surrogate

A new class of foundation model trained not on text, but on millions of human decisions — and what it means for AI as a simulator of cognition.

Beyond Language: Cognitive Foundation Models Cutting Edge

In 2024, researchers published the "Centaur" model in Nature — a foundation model trained on data from millions of decisions collected from psychological experiments. Unlike LLMs trained on text, Centaur is trained on the structured outputs of human cognition: choices made under uncertainty, reaction times, error patterns, and behavioral tendencies across cognitive tasks.

This represents a fundamental shift: from AI that simulates language to AI that simulates the mind itself. The model can predict what a new human participant will do in an unseen cognitive task with remarkable accuracy — a true cognitive surrogate.

CENTAUR VS. STANDARD LLM — TRAINING DATA COMPARISON
Standard LLM
Web text, books, code
Learns: linguistic patterns, world knowledge, reasoning traces in text
Centaur Model
Psych experiment decisions
Learns: human decision patterns, cognitive biases, behavioral tendencies, error profiles

Key Concepts Theory

  • Cognitive foundation models: Pre-trained on behavioral data rather than linguistic data — or fine-tuned LLMs adapted to predict human cognitive outputs across diverse tasks
  • Behavior prediction: The ability to forecast what a specific type of person (defined by their prior decision profile) will do in a novel task — a computational theory of mind at scale
  • Fine-tuning on psychological datasets: Adapting base models using annotated cognitive task data: memory tasks, attention experiments, decision-making under risk, learning paradigms
  • Limitations of instruction understanding: Centaur highlights a critical gap — a model can simulate the outputs of cognition without understanding the instructions that framed the task from a human perspective
🧪
Clinical Applications
Cognitive surrogates could model how patients with specific cognitive profiles (ADHD, early dementia, dyslexia) will respond to interventions — before clinical trials are run.
🎯
UX & Interface Design
Predict how different user profiles will behave when navigating a new interface — replacing expensive A/B testing with cognitive simulation.
⚠️
Cognitive Privacy
If AI can simulate individual cognitive profiles, it raises profound privacy concerns: prediction of mental states, vulnerabilities, and decision-making under manipulation.
🔬
Theory Testing
Cognitive models can serve as computational implementations of psychological theories — testable, falsifiable, and scalable in ways lab experiments cannot be.
⚠️
CL Implication: If Centaur-style models can simulate human cognitive behavior, they implicitly capture the conceptual structures CL describes — frame activations, prototype effects, metaphor use — not as learned linguistic patterns but as behavioral tendencies. This is a new kind of empirical validation for CL theory.

🎯 Check Your Understanding

What fundamentally distinguishes a "Centaur" cognitive foundation model from a standard large language model?
Module 2 · Lesson 2.3 — NEW

Sensorimotor Grounding in Robotics

Where CL meets the physical world — how embodied AI systems ground abstract language in motor control and affordance.

The Physical Symbol System Hypothesis vs. Connectionism Foundational Debate

Newell and Simon's Physical Symbol System Hypothesis (1976) claimed that physical symbol manipulation is both necessary and sufficient for general intelligent action. This positioned intelligence as substrate-independent: any system that can manipulate symbols can be intelligent.

Connectionism — and later CL — pushed back: symbols without grounding are meaningless. Intelligence is not substrate-independent; it is rooted in the physical interaction of a body with an environment. Robotics has become the empirical battleground for this debate.

🖐️
Affordance Learning
Gibson's concept of affordances — what an object offers for action — is being implemented in embodied AI. A robot learns "graspable," "pushable," "stackable" through physical interaction, not definitional rules.
🗣️
Grounding Commands in Motor Control
RT-2, PaLM-E and similar models translate linguistic commands ("pick up the red block") into motor control sequences. Language is grounded in the robot's kinematic space.
🔄
Feedback Loops
Sensorimotor grounding is not one-way. Physical outcomes update semantic representations: "fragile" is grounded by experiencing breakage. The body teaches language.
🌍
Sim-to-Real Transfer
Models trained in simulation then deployed in the physical world face a "grounding gap." CL's emphasis on actual bodily experience predicts this challenge and suggests why pure simulation may never fully substitute.

CL Concepts in Robotic Language Grounding Applied

CL ConceptRobotics ImplementationExample System
Image Schema: CONTAINER3D spatial bounding volumes for object placementSpatialVLA, RT-2
Image Schema: FORCEHaptic feedback learning for manipulationDexterous manipulation models
Image Schema: PATHMotion planning from source to goalA* / RRT + language grounding
Affordance (Gibson)Object interaction possibility learningAffordanceNet, Contact-GraspNet
Prototype TheoryCategory generalization across object variantsOpen-vocabulary detection models
Frame SemanticsAction schema libraries for task planningSayCan, Code as Policies
Case Study: SayCan (Google DeepMind)

SayCan grounds language instructions in robot affordances: it combines an LLM (which ranks possible actions by linguistic plausibility) with a value function (which ranks actions by physical feasibility given the current scene). The result: "Can you bring me something to clean up a spill?" → semantically plausible AND physically executable actions are ranked highest. This is CL's embodied grounding operationalized in a real system.

💡
The Key Tension: CL predicts that full linguistic competence requires full embodied grounding. But current robot systems achieve impressive language-grounded behavior with only partial grounding. Does this suggest CL's embodiment thesis is too strong? Or that these systems are exploiting text-pre-trained linguistic scaffolding while remaining fundamentally ungrounded? This is one of the most active debates in embodied AI research.
Module 7 · Lesson 7.3 — NEW

Event Semantics & Temporal Reasoning

How AI represents time, duration, and the internal structure of events — and why temporal reasoning remains a persistent challenge.

Event Structure in Cognitive Linguistics Core Theory

Every verb in human language encodes not just an action but an event structure — the internal temporal shape of the event. This is aspectual framing: the same real-world event can be described as ongoing, completed, repeated, or instantaneous depending on how the speaker "frames" it.

Zeno Vendler (1957) proposed the foundational four-way classification of event types that underlies all subsequent event semantics:

Achievements
Instantaneous, telic. "She won the race." "He recognized her." No internal duration — they happen at a point. An achievement either happened or it didn't.
🏁
Accomplishments
Extended, telic. "She ran a marathon." "He built a house." Have duration AND a natural endpoint. Cancelable: "She ran a marathon but didn't finish" is coherent.
🌊
Activities
Extended, atelic. "She ran." "He swam." Have duration but no natural endpoint. Not cancelable: "She ran but didn't finish" makes no sense.
🏔️
States
Extended, atelic, non-dynamic. "She knows French." "He loves her." Stable conditions without inherent change. Resist progressive forms: "?She is knowing French."

Situation Models & Temporal Anchoring in LLMs Research

Humans build situation models when reading text — mental representations of the described events, including their temporal location and duration. LLMs must implicitly do something similar to answer temporal questions correctly.

  • Temporal anchoring failures: LLMs frequently confuse the temporal order of events described in non-chronological text — they struggle to maintain a coherent timeline separate from sentence order
  • Aspect blindness: Models often treat achievement and accomplishment sentences the same, failing to infer that "she arrived" (achievement) implies completion while "she was arriving" implies non-completion
  • Duration estimation: Without grounded bodily experience of time, LLMs show systematic biases in duration estimation — underestimating geological and overestimating social time scales
  • Temporal knowledge cutoff: The training cutoff imposes an artificial "event horizon" — events after the cutoff do not exist in the model's situation model of the world
// Aspectual framing changes inference — critical for AI QA systems "John was crossing the street when the car hit him." // Progressive aspect → event in progress, NOT completed // Correct inference: John may NOT have finished crossing // Common LLM error: assuming crossing was completed "John crossed the street." // Simple past → accomplishment completed // Correct inference: John DID finish crossing // Temporal reasoning benchmarks (TimeML, TempEval, TORQUE) // probe exactly these aspectual distinctions in LLM outputs

Event-Based Knowledge Representation Applied

Event semantics directly informs how AI systems structure knowledge graphs and temporal databases. The TimeML annotation scheme and EventCorefBank implement linguistic event structure in computational form:

  • Events are typed by Vendler class and linked to temporal intervals (TimeML TIMEX3)
  • Temporal relations (BEFORE, AFTER, DURING, SIMULTANEOUS) are explicitly annotated between event pairs
  • Aspect and modality markers determine whether events are actual, hypothetical, or negated
  • LLMs fine-tuned on TimeML data show significantly improved temporal reasoning, demonstrating the value of CL-informed annotation schemes

🎯 Check Your Understanding

A user asks an AI: "Did Maria finish the report?" The context is: "Maria was writing the report when the power went out." What is the correct inference, and why do LLMs often get this wrong?
Module 9 · Lesson 9.3 — NEW

Cognitive Homogenization & The Future of Thought

The "homogenizing effect" of LLMs on human expression — and why cognitive diversity may be the most important AI safety issue nobody is talking about.

The Homogenization Effect Critical Issue

A 2024 analysis in Trends in Cognitive Sciences identified a disturbing pattern: as millions of people rely on the same LLMs for writing, reasoning, and communication assistance, their outputs are converging toward the statistical center of the training distribution. The diversity of human expression — the conceptual edges, minority framings, and culturally specific reasoning patterns — is being smoothed away.

This is not a future concern. It is measurable now in text corpora, in standardized writing assistance platforms, and in the outputs of AI-augmented professional communication.

🌀
Epistemic Collapse
When AI systems trained on majority-language, WEIRD (Western, Educated, Industrialized, Rich, Democratic) data become the default reasoning assistant, minority epistemologies lose their computational support infrastructure.
📝
Linguistic Standardization
LLM writing assistance pushes users toward the statistical mean of "good writing." Dialects, registers, and stylistic signatures that deviate from the mean are flagged and corrected out of existence.
🧩
Loss of Minority Reasoning Strategies
Different cultures have different default problem-solving frames — collectivist vs. individualist, cyclical vs. linear temporal reasoning, relational vs. categorical logic. LLMs privilege the majority strategy.
🌐
Collective Intelligence Risk
Cognitive diversity is to collective intelligence what genetic diversity is to ecosystem resilience. Homogenization creates brittleness — when the dominant cognitive framework fails, no alternatives remain.

De-Westernizing AI: Cross-Cultural Cognitive Frames Applied

CL has long documented that different languages encode different conceptual worlds. The Sapir-Whorf hypothesis in its moderate form — that language shapes (but doesn't fully determine) cognition — is now well-supported. AI trained predominantly on English encodes English conceptual defaults.

  • Multilingual frame networks: Building frame semantic databases that are culturally parallel — not translations of English frames but indigenous conceptual structures. Projects like FrameNet Brasil and JapaneseFN demonstrate this approach
  • Cultural-specific metaphors: Mandarin TIME IS A VERTICAL AXIS (future is below, past is above) conflicts with English TIME IS HORIZONTAL. AI systems need to maintain cultural metaphor inventories, not just translate
  • Diverse training curation: Actively over-representing low-resource languages, oral traditions, and non-WEIRD cultural texts in pre-training corpora — not as a fairness gesture but as a cognitive diversity imperative
  • Cognitive pluralism by design: Building AI systems that can deliberately shift their default reasoning frame based on cultural context — a "Pangloss" of conceptual worlds rather than a single universal cognitive architecture
Measurement: How to Detect Homogenization

Researchers measure cognitive homogenization by: (1) tracking lexical diversity metrics (type-token ratio, vocabulary richness) in AI-assisted vs. unassisted writing over time; (2) comparing frame activation distributions in AI outputs across cultural contexts; (3) measuring conceptual metaphor convergence — are users from different cultures converging on the same source domains when writing with AI assistance? Early results suggest the answer is yes, and the convergence is toward English-language defaults.

⚠️
Urgent Research Priority: The window for preserving cognitive diversity is closing quickly. As AI writing assistance becomes ubiquitous in education and professional settings, the next generation of language users may develop their cognitive frameworks through AI — making the AI's default conceptual structures their own. CL researchers have a unique responsibility to document and protect cognitive diversity before this window closes.
Module 10 · Lesson 10.3 — NEW

Neuro-Symbolic Reasoning in LLMs

The resurgence of symbolic AI — and how hybridizing logic with neural networks addresses LLMs' deepest reasoning limitations.

Why Pure Neural Networks Fall Short Motivation

LLMs exhibit remarkable linguistic fluency but systematic failures in compositional, logical, and causal reasoning — exactly the kinds of structured inference that symbolic AI excels at. The neuro-symbolic paradigm attempts to get the best of both: neural networks' flexibility and grounding + symbolic systems' logical rigor and explicit structure.

NEURAL VS. SYMBOLIC VS. NEURO-SYMBOLIC
Neural Only
✓ Flexible, robust to noise
✓ Learns from data
✗ Brittle logical reasoning
✗ Opaque, hard to verify
Symbolic Only
✓ Logically sound
✓ Transparent, verifiable
✗ Brittle to noise
✗ Requires manual rules
Neuro-Symbolic
✓ Flexible + structured
✓ Data-driven + logical
✓ Partially verifiable
~ Still maturing

Architecture Approaches Technical

  • Logic-infused LLMs: Constrain LLM outputs using formal logic rules. The model generates candidate inferences; a symbolic verifier accepts or rejects them. Systems like Logic-LM use this approach for math and commonsense reasoning
  • Knowledge graphs for frame semantics: CL frames (FrameNet, VerbNet) can be represented as knowledge graphs that LLMs query at inference time — providing explicit relational structure that the neural component lacks internally. Wikidata, ConceptNet, and FrameNet are commonly integrated this way
  • Grounding symbolic rules in vector space: Neural Theorem Provers (NTPs) and Differentiable Inductive Logic Programming (DILP) learn logic-like rules from data but represent them as differentiable operations in vector space — enabling gradient-based learning of symbolic structure
  • Chain-of-Thought as soft symbolism: CoT prompting encourages LLMs to produce explicit reasoning steps — a lightweight neuro-symbolic approach where the "symbolic" component is natural language reasoning traces rather than formal logic
🗺️
Frame + Graph Integration
Linking LLM outputs to FrameNet knowledge graphs provides structured frame element verification — checking if generated text correctly instantiates the expected frame roles.
🔢
Mathematical Reasoning
Neuro-symbolic systems like AlphaGeometry (DeepMind) combine neural intuition with formal geometric theorem proving — achieving olympiad-level performance neither component achieves alone.
⚖️
Legal & Medical AI
High-stakes domains require verifiable reasoning chains. Neuro-symbolic approaches allow LLMs to generate candidate reasoning while symbolic components verify logical consistency with domain rules.
🔬
CL Connection: CL's frame semantics and construction grammar are already symbolic structures — formal representations of meaning that are directly implementable as knowledge graph schemas. Neuro-symbolic AI that integrates FrameNet and VerbNet is, in a very real sense, implementing CL theory computationally. The hybrid architecture CL has always described (embodied grounding + structured knowledge) maps onto neuro-symbolic architectures (neural grounding + symbolic structure).
Module 10 · Lesson 10.4 — NEW · Final

CL-Based Mechanistic Interpretability

Using cognitive linguistics as a theoretical lens to open the black box — mapping construction grammar and frame semantics onto transformer weights.

The Interpretability Crisis Motivation

Mechanistic interpretability is one of the central challenges of modern AI safety: understanding why a model produces a given output in terms of its internal computations. Current approaches often lack theoretical grounding — they identify circuits and features empirically without a cognitive theory to predict what should be found.

Cognitive Linguistics offers exactly this: a theoretically grounded, empirically validated map of how human linguistic knowledge is organized. CL structures are testable hypotheses about what should exist inside a well-trained language model.

🔌
Construction Neurons
If transformers learn Construction Grammar, specific attention heads or MLP neurons should preferentially activate for specific constructions. "Construction neurons" are units that reliably fire for the ditransitive, resultative, or way-construction patterns regardless of lexical content.
🗺️
Frame Activation Mapping
Probing classifiers trained on FrameNet labels can identify which model layers encode which frame elements. Frame activation maps reveal where in the computational graph COMMERCE_BUY frames become distinct from COMMERCE_SELL frames.
📏
CL-Based Benchmarks
CL theory generates precise, falsifiable predictions about model behavior: prototype gradience, metaphor consistency, frame inheritance, aspectual inference. These predictions become evaluation benchmarks — moving beyond accuracy to cognitive validity.
🧲
Semantic Role Circuits
Attention patterns for subject-verb-object should differ systematically from agent-verb-patient assignments when passive voice shifts syntactic roles without changing semantic roles. CL-informed probing can isolate these circuits.

Methodology: CL-Informed Probing Technical

// CL-Informed Probing Pipeline Step 1: Generate CL-hypothesis // Hypothesis: Layer 8–12 of BERT encodes frame element roles // Prediction: AGENT and PATIENT representations differ in a learnable direction Step 2: Build probe dataset // FrameNet sentences annotated with frame elements // "Apple [Buyer] bought [COMMERCE_BUY] the startup [Goods]" // Extract hidden states at element positions Step 3: Train linear probe probe = LinearClassifier(hidden_states → frame_element_label) // High probe accuracy = model encodes the distinction // Low accuracy = distinction not represented at this layer Step 4: Causal intervention // Activation patching: swap AGENT vector with PATIENT vector // Does the model's output change as CL predicts? // → Tests whether the representation is causally active, not just correlated

Current Findings & Open Questions Research Frontier

  • Layer specialization: Consistent with CL predictions, early transformer layers encode phonological/morphological patterns, middle layers encode syntactic constructions, and upper layers encode discourse-pragmatic structures
  • Prototype geometry: Category centroids in embedding space do exhibit prototype structure — members cluster around central prototypes with graded distance, validating prototype theory in computational form
  • Metaphor circuits: Preliminary evidence suggests metaphorical source-target mappings are encoded in specific attention head clusters — "construction neurons" for metaphor are being actively researched
  • Frame element binding: Attention heads that track subject-verb-object also show sensitivity to semantic role distinctions, but only partially — suggesting frame elements are distributed across multiple circuits
  • Open question: Do models that are better aligned with CL structures (as measured by CL-based benchmarks) also perform better on downstream tasks? Establishing this link would validate the CL interpretability program empirically
🎓 Course Complete — Updated Edition

You have completed all 28 lessons across 10 modules — including the 6 new lessons added from the expert review. The field is moving fast. The concepts you now hold give you both the theoretical vocabulary and the empirical tools to navigate it.

Centaur models show behavioral CL is now empirically testable at scale
Robotic grounding is the physical test of embodied cognition theory
Event semantics reveals temporal reasoning as a persistent AI gap
Cognitive homogenization is the most urgent socio-cognitive AI risk
Neuro-symbolic AI implements CL's hybrid vision computationally
CL gives interpretability research its theoretical backbone
Module 11 · Lesson 11.1 — NEW MODULE

CMT as an LLM Prompting Paradigm

Research-backed techniques for using Conceptual Metaphor Theory to dramatically improve LLM reasoning accuracy, coherence, and creative depth — with live comparisons across ChatGPT, Claude, and Gemini.

From Theory to Prompting Practice Research-Backed

A landmark 2024 study by Kramer demonstrated that CMT-based prompting significantly enhances LLM reasoning accuracy, clarity, and metaphorical coherence across a range of complex tasks. This lesson operationalizes that finding into a concrete, replicable prompting framework applicable to any frontier model.

The core insight: LLMs have already internalized conceptual metaphor structures from training data. CMT prompting doesn't teach them new knowledge — it activates the cognitive structures they already have in a deliberate, structured way.

CMT-INSPIRED CHAIN-OF-THOUGHT (CoT) STRUCTURE
Step 1
Identify Source Domain
+ its properties
Step 2
Identify Target Domain
(abstract concept)
Step 3
Create Inference
via mapping
Output
Coherent, grounded
response

The Three CMT-CoT Examples (Kramer, 2024) Applied Research

These are the canonical examples from the research demonstrating how CMT CoT structures reasoning:

Example 1
TIME IS MONEY
Source: Money — a valuable, finite resource that can be spent, saved, wasted, or invested
Target: Time — an abstract flow
Inference: Time is valuable, should be spent wisely, can be wasted or invested — guiding time management reasoning
Example 2
HEART OF STONE
Source: Stone — hard, cold, unfeeling, impervious to impact
Target: A person's emotional character
Inference: The person is emotionally unresponsive, unsympathetic — maps stone's physical properties onto personality
Example 3
WORLD IS A STAGE
Source: Stage — a platform for scripted performances, roles, audiences
Target: Human life and daily activity
Inference: Life involves roles to perform — daily activities are performances, implying social scripting and audience awareness

LLM Configuration for CMT Prompting Practical

The research specifies optimal configurations for activating CMT reasoning in frontier models:

// OPTIMAL CMT SYSTEM MESSAGE (Kramer, 2024) System: "You are a cognitive agent utilizing Conceptual Metaphor Theory (CMT). When interpreting abstract concepts, always map them to concrete physical experiences. Structure your reasoning as follows: 1. Identify the Source Domain and its concrete properties 2. Identify the Target Domain (the abstract concept) 3. Perform the Inference: explain the abstract via the concrete mapping Maintain metaphorical consistency throughout your response." // RECOMMENDED TEMPERATURE temperature: 0.7 // Balanced creativity + coherence // Too low (0.1–0.3): literal, loses metaphorical richness // Too high (0.9–1.0): incoherent, loses systematic mapping // FEW-SHOT PRIMING ALTERNATIVE User (shot 1): "Explain procrastination using CMT." Assistant: "Source: A heavy burden. Target: Unfinished tasks. Inference: Procrastination accumulates weight — the longer you delay, the heavier the burden becomes, until movement is nearly impossible."

ChatGPT vs. Claude vs. Gemini — CMT Performance Profile LLM Comparison

ModelCMT StrengthsCMT WeaknessesBest Use
ChatGPT (GPT-4o) Superior syntactic explanation of metaphors; consistent source-target mapping; strong few-shot CMT adoption Can become formulaic with repeated CMT prompting; less creative in novel metaphor generation Technical CMT-CoT reasoning tasks; step-by-step metaphor analysis
Claude (Sonnet/Opus) Actively plans coherent structures; manifests "thinking" process aligned with cognitive schemas; strong metaphor consistency across long outputs May over-elaborate inferences; sometimes adds meta-commentary about the metaphor rather than using it Long-form CMT-structured writing; complex domain mapping; nuanced emotional metaphors
Gemini (Pro/Ultra) Academic precision in metaphor identification; complex grammatical constructions; strong cross-lingual CMT Tends toward academic lexis — less accessible; sometimes prioritizes technical accuracy over metaphorical richness Cross-cultural metaphor analysis; academic and research writing; multilingual CMT tasks
DeepSeek Strong mathematical/logical metaphor domains; efficient token use in CMT prompts Less training on creative metaphorical corpora; weaker emotional domain mapping Technical/scientific domains where CMT is applied to formal concepts
📊
Research Finding (Kramer, 2024): Models preconditioned with CMT system messages showed measurable improvements in reasoning accuracy, clarity of explanation, and metaphorical coherence versus standard prompting baselines. The effect was consistent across GPT-4, Claude, and Gemini — suggesting CMT activates cross-model cognitive structures, not model-specific quirks.

🎯 Check Your Understanding

According to the Kramer (2024) research, what temperature setting is recommended for CMT-based LLM prompting, and why?
Module 11 · Lesson 11.2 — NEW MODULE

Frame Semantics & Knowledge Representation in Modern LLMs

How ChatGPT, Claude, and Gemini encode, identify, and leverage semantic frames — from latent implicit knowledge to In-Context Learning for structured extraction.

Latent Frame Knowledge in LLMs Research Finding

Recent studies (2024) confirm that major LLMs — ChatGPT, Claude, and Gemini — encode latent knowledge of Frame Semantics without explicit training on FrameNet. This knowledge is implicit in the statistical patterns of their training corpora: because humans consistently use frame-evoking language in consistent ways, the models learn frame structures as a byproduct of language modeling.

This means every frontier LLM is already a partial frame semanticist — and the question for AI engineers is how to activate and direct this latent knowledge rather than how to install it from scratch.

🔎
Frame Identification
Modern LLMs can identify semantic frames within text with high accuracy. Given "John bought the startup," they correctly identify the COMMERCE_BUY frame and its elements — Buyer: John, Goods: startup — without explicit instruction.
🧲
In-Context Learning (ICL)
LLMs can be prompted with a few FrameNet-style examples and will then extract frame-semantic arguments from new sentences with high accuracy — often outperforming traditional statistical parsers trained specifically for this task.
🔄
Backward Reasoning
Modern models use frame structures to "think backward" — given a desired output state (a frame's goal element), they can plan token sequences that consistently fill the frame toward that outcome. Planning is frame navigation.
🌐
Relational Reasoning
Frames provide the relational backbone for coherent generation. When asked to generate a story about a transaction, models activate Commerce frames and maintain consistent role assignments across the entire narrative.

ICL Frame Extraction — Practical Protocol Applied

The most powerful practical application of frame semantics in modern LLMs is using In-Context Learning to turn any frontier model into a high-accuracy semantic parser:

// IN-CONTEXT LEARNING: Frame Semantic Extraction Protocol System: "You are a semantic frame analyst. For each sentence, identify: 1. The TARGET WORD (frame-evoking element) 2. The FRAME it evokes (use FrameNet names where possible) 3. The FRAME ELEMENTS present (with their text spans) 4. Any NULL-INSTANTIATED elements (implied but absent)" // Few-shot examples (2-3 shots sufficient for high accuracy) Shot 1 Input: "Apple acquired the startup for $2B." Shot 1 Output: Target Word: "acquired" Frame: COMMERCE_BUY Buyer: "Apple" Goods: "the startup" Money: "$2B" Seller: [NULL — inferrable from context] // New input — model performs accurately without further training New Input: "She donated her library to the university." // LLM output: GIVING frame — Donor: She, Theme: library, // Recipient: university, [Purpose: null-instantiated]

Frame Semantics Across the Three Major LLMs LLM Comparison

CapabilityChatGPTClaudeGemini
Frame Identification Accuracy High — strong on common frames; weaker on specialized domain frames High — particularly strong on discourse-level frame consistency across long texts High — strong cross-lingual frame identification; strong on technical frames
Null-Instantiation Detection Moderate — identifies implied roles inconsistently Strong — actively infers implied participants; consistent with "thinking" process Moderate — better when prompted explicitly to consider missing elements
ICL Frame Extraction Excellent — adopts FrameNet format rapidly with 2–3 shots Excellent — maintains format across many examples; adds useful uncertainty flags Good — slightly more formal output format; strong for structured data pipelines
Relational Reasoning via Frames Strong — uses frames for coherent narrative generation Very strong — plans coherent multi-frame narratives; explicit schema activation Strong — particularly good at multi-step frame chains in academic contexts
💡
Key Practical Insight: You do not need to fine-tune LLMs on FrameNet data to get frame-semantic extraction. ICL with 2–3 well-formatted shots activates the latent frame knowledge that already exists in the model. This makes FrameNet-quality semantic parsing accessible to any team with API access — no training pipeline required.
Module 11 · Lesson 11.3 — NEW MODULE

Construction Grammar & Syntactic-Semantic Integration in Transformers

How frontier LLMs demonstrate Construction Grammar principles in practice — with a detailed comparison of how ChatGPT, Claude, and Gemini differ in constructional understanding.

CxG in Transformer Behavior — What the Research Shows Research

Researchers have found that frontier LLMs demonstrate a strong connection between syntactic form and semantic meaning that closely aligns with Construction Grammar's core thesis. Rather than treating syntax as an independent module, these models appear to have learned form-meaning pairings holistically — exactly as CxG predicts.

The evidence comes from three converging sources: grammaticality judgment tasks, constructional meaning identification, and syntactic explanation ability — where models must articulate why a construction means what it means.

The CxG Diagnostic Tasks Methodology

  • Constructional meaning identification: Given "She sneezed the napkin off the table," can the model identify that the CAUSED-MOTION meaning comes from the construction, not the verb "sneeze"?
  • Grammatical judgment: Is "She talked him into compliance" grammatical? If yes, which construction licenses it? Models must identify the CAUSED-CHANGE-OF-STATE construction
  • Constructional slot analysis: What fills the [Subj V Obj Adj] resultative slot and what meaning emerges? "She painted the wall red" vs. "She painted the wall quickly" — only the first is a true resultative
  • Cross-constructional inference: Does "He baked her a cake" (ditransitive) imply transfer? What about "He baked a cake for her" (prepositional dative)? Models should detect the subtle pragmatic difference CxG predicts
🔷
ChatGPT on CxG
Superior syntactic explanation — can clearly articulate why a given construction produces its meaning. Strongest on grammatical judgment tasks. Identifies "construction slots" explicitly when prompted. Best for pedagogical CxG explanation tasks.
🔶
Claude on CxG
Actively plans coherent structures that respect constructional constraints. Manifests a "thinking" process aligned with cognitive schemas — less likely to violate construction-level meaning even in long outputs. Best for CxG-constrained creative writing and generation tasks.
🔹
Gemini on CxG
Tends toward academic lexis with complex grammatical constructions. Strong cross-lingual CxG awareness. Performs best on formal linguistic analysis tasks. Particularly good at identifying CxG structures in non-English texts.

Implications for LLM Design — Leveraging CxG Applied

Construction Grammar's insights can be directly leveraged to improve LLM output quality in three ways:

// TECHNIQUE 1: Construction-Constrained Generation // Force a specific construction to shape the output structure "Rewrite this explanation using only resultative constructions ([Subject V Object to Adj-state]) to emphasize outcomes over processes." // TECHNIQUE 2: Constructional Diagnosis Prompting // Use CxG to probe model understanding of unusual sentences "Analyze: 'The professor lectured the class into a stupor.' Identify: (a) which construction licenses this, (b) what meaning the construction contributes that the verb alone doesn't, (c) which frame elements are present." // TECHNIQUE 3: Cross-Construction Contrastive Prompting // Exploit constructional meaning differences for precise communication "Compare: 'She gave him the keys' vs 'She gave the keys to him.' In what contexts would each be more natural? What subtle differences in perspective or presupposition does each carry?" // → Activates CxG's insight that ditransitive vs. prepositional dative // encode different speaker perspectives on the transfer event

🎯 Applied Challenge

A user prompts: "She laughed him out of the room." According to Construction Grammar — which the research confirms LLMs have internalized — where does the CAUSED-MOTION meaning of this sentence come from?
Module 11 · Lesson 11.4 — FINAL LESSON

Cognitive Engineering & Ethical AI

Designing AI systems with human-like cognitive architectures — and the ethical responsibilities that come with building machines that think like us.

What is Cognitive Engineering? Emerging Field

Cognitive Engineering is the discipline of designing AI systems that not only process information efficiently but do so through architectures that are explicitly informed by human cognitive structure. Rather than treating LLMs as black-box function approximators, cognitive engineering asks: how should we design, train, evaluate, and deploy AI so that its internal representations and reasoning processes map onto what we know about human cognition from CL and cognitive science?

This is the synthesis of everything this course has covered — it is CL applied not just as analysis but as design philosophy.

🏗️
CL-Informed Architecture
Design choices guided by CL: training on usage-based corpora (usage-based model), attention mechanisms that respect frame-element boundaries, output constraints that honor construction-level semantics.
🔍
CL-Informed Evaluation
Moving beyond perplexity and BLEU: benchmarks that test prototype gradience, metaphor consistency, frame inference, aspectual reasoning, and construction-level meaning — the dimensions CL theory tells us matter for genuine understanding.
🛡️
CL-Informed Alignment
Using frame semantics and prototype theory to detect and measure misalignment: does the model's conceptual structure match human conceptual structure? Divergence is a measurable alignment failure.
⚖️
CL-Informed Ethics
Frame asymmetry, prototype bias, metaphor ideology — CL's tools for detecting the conceptual structures embedded in language directly serve AI ethics auditing at a level deeper than surface-level content filtering.

Neuro-Symbolic Integration for Explainable AI Technical

The most practically important frontier in cognitive engineering is building AI systems whose reasoning is explainable through CL structures. Rather than post-hoc attribution methods that highlight input tokens, CL-informed explainability maps AI decisions onto human-interpretable conceptual structures:

  • Frame-level explanation: "This recommendation was made because the COMMERCE_BUY frame was activated with you as Buyer and Product X as Goods" — more interpretable than attention weight visualizations
  • Metaphor-level explanation: "This risk assessment used the JOURNEY metaphor: you are currently at a crossroads with two paths of different risk profiles" — grounds AI reasoning in human cognitive structures
  • Construction-level explanation: "This instruction was parsed as a CAUSED-CHANGE-OF-STATE construction, interpreting X as the intended result state" — enables disambiguation of ambiguous instructions
  • Prototype-level explanation: "This classification has 0.7 confidence because the input has 70% overlap with the category prototype, not 100%" — communicates uncertainty in human-interpretable terms

The Role of CL in Achieving AGI Frontier

The final question: is CL-informed AI a path toward Artificial General Intelligence — or a richer form of narrow AI that is better aligned with human cognition?

The CL-AGI Thesis

AGI, if it means AI that understands language the way humans do, requires the cognitive structures CL describes: embodied grounding, frame-organized knowledge, prototype-structured categories, metaphor-based abstract reasoning, and pragmatic competence. A system that achieves all of these at human level would, by definition, have human-level language understanding — and language understanding may be the clearest path to general intelligence. CL is not just a tool for building better NLP systems. It is a map of what general intelligence looks like.

🎓
Course Complete — Final Edition. You have now completed all 32 lessons across 11 modules. From the foundational principles of CL to the cutting edge of CMT prompting research, cognitive engineering, and LLM comparison — this course gives you both the theory and the tools to work at the intersection of human cognition and artificial intelligence.
Module 11 — Core Takeaways

The industry-ready bridge between CL theory and modern AI practice.

CMT-CoT prompting at temperature 0.7 measurably improves LLM reasoning across all major models
All frontier LLMs encode latent frame knowledge — ICL activates it without fine-tuning
Claude plans via schemas; ChatGPT explains constructions; Gemini excels cross-lingually
Cognitive engineering is the design philosophy that unifies all CL applications in AI
CL-informed explainability is more human-interpretable than attention attribution
CL is not just a tool — it is a map of what AGI must achieve