Cognitive Linguistics for AI — Complete Course

How the Human Mind Shapes Artificial Intelligence

A rigorous, applied course bridging cognitive science, linguistics, and AI engineering — from metaphor theory to transformer architectures.

11 Modules

32 Lessons

90+ Key Concepts

∞ Applications

NLP LLMs Frame Semantics Metaphor Theory Prototype Theory Embodied AI Prompt Engineering Cognitive Semantics Transformers AI Ethics Neuro-Symbolic AI Centaur Models Interpretability Cognitive Diversity CMT-CoT Prompting ChatGPT vs Claude vs Gemini Cognitive Engineering

MODULE 01

Foundations of CL & AI

4 lessons

MODULE 02

Embodied Cognition & NLP

3 lessons

MODULE 03

Conceptual Metaphor

2 lessons

MODULE 04

Frame Semantics

2 lessons

MODULE 05

Prototype Theory

2 lessons

MODULE 06

Construction Grammar

2 lessons

MODULE 07

Cognitive Semantics

3 lessons

MODULE 08

Discourse & Pragmatics

2 lessons

MODULE 09

AI Applications

3 lessons

MODULE 10

Frontiers & Future

4 lessons

NEW

MODULE 11

CL in the Era of Modern LLMs

4 lessons

Module 1 · Lesson 1.1

What is Cognitive Linguistics?

Understanding the discipline that changed how we think about language, meaning, and mind — and why it's the key to better AI.

Definition & Origins Core

Cognitive Linguistics (CL) is an interdisciplinary approach to language that emerged in the 1970s–1980s, pioneered by George Lakoff, Mark Johnson, Ronald Langacker, and Charles Fillmore. Unlike formal linguistics (which treats language as an autonomous, rule-based system), CL holds that language is inseparable from cognition.

The central thesis: meaning is not arbitrary symbols mapped to external reality, but is structured by the way the human mind experiences and conceptualizes the world.

🧠

Meaning is Embodied

Language meaning is rooted in bodily experience. Abstract concepts are understood through physical schemas derived from interacting with the world.

🗂

Categories are Fuzzy

Human conceptual categories are not binary but gradient, organized around prototypes rather than necessary-and-sufficient conditions.

🔀

Grammar Encodes Meaning

Grammatical structures are not arbitrary — they reflect conceptual structures. Syntax is motivated by semantics and pragmatics.

🌐

Context is Constitutive

Meaning is always context-dependent. The same expression can mean radically different things depending on frames and construals.

The Major Sub-fields Overview

Sub-field	Focus	Key Scholar	AI Relevance
Conceptual Metaphor Theory	How abstract thought uses bodily metaphors	Lakoff & Johnson	LLM metaphor processing, reasoning
Frame Semantics	Knowledge structures that activate meaning	Charles Fillmore	Knowledge graphs, FrameNet, NLU
Prototype Theory	Gradient category membership	Eleanor Rosch	Classification, fuzzy ML systems
Cognitive Grammar	Grammar as conceptual organization	Ronald Langacker	Syntax parsing, construction grammars
Construction Grammar	Form-meaning pairings at all levels	Goldberg, Kay	Transformer architectures, parsing
Image Schema Theory	Pre-conceptual, spatial reasoning patterns	Johnson, Mandler	Spatial AI, robotic reasoning

💡

Key Insight: Cognitive Linguistics treats language as a window into the mind. For AI researchers, this means language data is not just text — it's a record of human conceptual structure, ready to be learned from and engineered with.

Module 1 · Lesson 1.2

Why Cognitive Linguistics Matters for AI

The gap between statistical language modelling and genuine understanding — and how CL helps bridge it.

The Understanding Problem Critical

Large Language Models are extraordinarily good at predicting text. But prediction is not the same as understanding. CL gives us a vocabulary and a framework for asking: what would it actually mean for an AI to understand language the way a human does?

The gap between statistical competence and semantic understanding is precisely where cognitive linguistics has the most to offer AI engineering and AI safety research.

THE UNDERSTANDING GAP

Raw Text Data

→

Statistical Patterns

→

LLM Output

≠

Grounded Meaning

→

Human Understanding

🎯

Prompt Engineering

Understanding how AI models activate frames, metaphors, and prototypes allows engineers to craft prompts that reliably elicit the desired cognitive "space" in the model.

🔍

Interpretability

CL-derived concepts like frame activation and prototype gradience give us testable hypotheses about what is actually happening inside transformer attention heads.

⚖️

Bias Detection

Conceptual metaphors and prototypes embedded in training data encode social biases. CL tools help identify and measure these systematically.

🤖

Human-AI Communication

CL's pragmatics research (Gricean maxims, relevance theory) directly informs how conversational AI systems should manage cooperative dialogue.

🌍

Multilingual AI

Cross-linguistic conceptual structure — what's universal vs. language-specific — is critical for building AI that works across languages without cultural distortion.

✅

Why now? The explosion of LLMs has made CL more relevant than ever. These models learn from human language at scale — which means they implicitly learn cognitive-linguistic structures. Understanding those structures is essential for alignment, evaluation, and improvement.

Module 1 · Lesson 1.3

Core Principles & AI Parallels

Mapping the founding commitments of Cognitive Linguistics onto the architecture of modern AI systems.

Principle 1: The Cognitive Commitment Foundational

CL is committed to describing language in a way that is consistent with what we know about the mind from cognitive science. Language is not a self-contained formal system — it reflects general cognitive principles.

AI Parallel

Neural language models that share architectural features with biological neural processing (attention, distributed representations) may be capturing something cognitively real — not just a convenient approximation.

Principle 2: The Generalization Commitment Foundational

CL seeks general principles that apply across all aspects of language — phonology, morphology, syntax, semantics, pragmatics, and discourse are all subject to the same cognitive principles.

AI Parallel

Transformer architectures apply the same attention mechanism across all levels of linguistic structure — an architectural choice that mirrors CL's commitment to unified cognitive principles.

Principle 3: Encyclopedic Semantics Foundational

Word meaning is not a minimal dictionary definition but an access point to vast encyclopedic knowledge. The word bird activates not just "feathered biped" but nesting behavior, song, migration, cultural associations, and more.

AI Parallel

Word embeddings in LLMs encode rich, context-sensitive semantic neighborhoods — operationalizing encyclopedic semantics computationally. A token's embedding captures far more than a dictionary definition.

Principle 4: Usage-Based Model Foundational

Linguistic knowledge is built up from actual instances of language use, not from innate abstract rules. Grammar is the statistical residue of countless acts of communication.

AI Parallel

This is precisely how LLMs are trained. Pre-training on massive corpora of usage instances is a direct computational implementation of the usage-based model. CL predicted this architecture decades before it existed.

🎯 Check Your Understanding

Which CL principle most directly predicted the success of large language models trained on vast text corpora?

Module 2 · Lesson 2.1

Embodied Meaning in AI

How the body shapes meaning — and the profound challenge this poses for disembodied AI systems.

Embodied Cognition — The Thesis Core Theory

George Lakoff and Mark Johnson argued in Philosophy in the Flesh (1999) that human concepts are not abstract, disembodied symbols. Instead, our conceptual system is grounded in sensorimotor experience. We understand "warmth" emotionally because we have felt physical warmth. We understand "UP" as positive because erect posture correlates with health and power.

This is not metaphor — it is the literal claim that the brain's conceptual system uses the same neural structures as sensorimotor processing.

🔥

Thermal Concepts

"Warm welcome," "cool reception," "heated argument" — temperature maps directly onto social and emotional concepts because of embodied thermal experience.

↕️

Vertical Orientation

"High status," "feeling low," "rising expectations" — verticality structures social, emotional, and evaluative concepts through the body's experience of gravity.

📦

Container Schemas

"In trouble," "out of ideas," "within reason" — the in/out bodily experience of containers structures abstract reasoning about categories, states, and boundaries.

🫀

Force Dynamics

"Pushing an argument," "resisting change," "breaking through barriers" — physical force and resistance structure how we reason about causation, agency, and obligation.

The Grounding Problem for AI Challenge

AI language models learn language from text — a disembodied medium. They have never felt heat, experienced gravity, or been inside a container. Yet they successfully use and manipulate all the conceptual structures that depend on these experiences.

This raises profound questions: Are LLMs using genuine conceptual structure, or are they sophisticated mimics? Can meaning that was originally grounded in the body survive transmission through text alone?

Research Finding

Probing studies have shown that LLMs encode systematic directional biases consistent with verticality metaphors (MORE IS UP, HAPPY IS UP). They learned these from text patterns alone — suggesting that embodied structure is recoverable from language data even without a body.

⚠️

Open Question: Multimodal models (GPT-4V, Gemini) that process images alongside text may achieve a form of "virtual grounding" through visual experience. Whether this constitutes genuine embodied understanding or a richer form of simulation remains an active research debate.

Module 2 · Lesson 2.2

Grounding Language in AI Models

Approaches to solving the symbol grounding problem — from robotics to multimodal transformers.

The Symbol Grounding Problem Core Problem

Harnad (1990) formalized the symbol grounding problem: if all symbols in a system are defined only in terms of other symbols, how does the system connect to the world? A dictionary that defines all words with other words is a closed loop — it only makes sense to someone who already understands some words through non-symbolic experience.

Pure text-based LLMs face this challenge: their representations are grounded only in relations to other tokens, not in the world itself.

🤖

Robotics Grounding

Embodied AI systems (robots) ground language in sensorimotor experience. The word "push" is grounded in the robot's actual experience of applying force to objects.

🖼️

Vision-Language Models

CLIP, DALL-E, GPT-4V ground language in visual representations. "Red apple" connects linguistic tokens to pixel distributions, providing a form of perceptual grounding.

🎵

Audio Grounding

Models trained on audio alongside text (Whisper, audio LLMs) ground language in acoustic features — "loud," "sharp," "resonant" gain non-textual referents.

🌐

World Models

Systems like DreamerV3 learn world models from interaction, grounding predictions in simulated physical consequences — a step toward CL-style embodied understanding.

Degrees of Grounding Framework

Level	System Type	Grounding Source	Example
Symbolic Only	Rule-based NLP	Symbolic rules only	Early GOFAI systems
Statistical Textual	Text-only LLMs	Co-occurrence statistics	GPT-2, BERT
Perceptual	Vision-Language Models	Image-text alignment	GPT-4V, Gemini
Action-Based	Embodied AI / Robotics	Sensorimotor feedback	RT-2, PaLM-E
Full Embodied	Biological cognition	Full body-world interaction	Humans

🎯 Check Your Understanding

A vision-language model that connects the word "red" to pixel distributions in images represents which level of grounding?

Module 3 · Lesson 3.1

Conceptual Metaphor Theory & LLMs

How systematic metaphorical mappings structure human reasoning — and how AI models learn and use them.

Conceptual Metaphor Theory Core Theory

Lakoff and Johnson's Metaphors We Live By (1980) is one of the most cited books in cognitive science. Its central claim: metaphor is not a literary device but a fundamental cognitive mechanism. We understand abstract domains by systematically mapping structure from concrete, embodied source domains.

These are not random poetic flourishes — they are systematic, consistent, and culturally shared conceptual structures.

💼

ARGUMENT IS WAR

"He attacked my position." "She defended her argument." "I demolished his thesis." The entire conceptual domain of argument is structured by military conflict metaphors.

⏳

TIME IS MONEY

"Don't waste my time." "I've invested years in this." "Save time by automating." Abstract time is understood through the concrete domain of financial resources.

🛤️

LIFE IS A JOURNEY

"She's at a crossroads." "He's lost his way." "We're headed in the right direction." Life progress is understood through spatial movement toward destinations.

💡

IDEAS ARE LIGHT

"An illuminating thought." "She shed light on the problem." "I'm in the dark about this." Abstract understanding maps onto the embodied experience of visual perception.

How LLMs Represent Conceptual Metaphors Research

Multiple studies have used probing classifiers and attention analysis to investigate how LLMs represent conceptual metaphors. Key findings:

LLMs encode systematic metaphorical relationships in their embedding spaces — metaphorically related words cluster in ways consistent with CMT predictions
Models trained on more data show stronger alignment with human conceptual metaphor judgments, suggesting metaphor is largely learnable from text
Attention patterns in BERT-like models cluster metaphorical source and target domain tokens — suggesting metaphorical mappings are encoded in transformer weights
LLMs can generate novel metaphors that are judged as natural by humans, implying productive metaphorical competence beyond mere pattern matching
Cross-lingual studies show metaphorical representations partially align across languages in multilingual models, consistent with the universality hypothesis

🔬

Research Frontier: The MetaNet project at ICSI and TroFi dataset have created annotated corpora of metaphorical language use that are now being used to benchmark LLM metaphor understanding. GPT-4 achieves near-human performance on novel metaphor interpretation tasks.

Module 3 · Lesson 3.2

Metaphor in Prompt Engineering

Using conceptual metaphor theory to write better prompts and understand how framing shapes AI output.

Metaphor Priming in Prompts Applied

Because LLMs have internalized conceptual metaphors from training data, the metaphors you activate in a prompt shape how the model reasons about the topic. This is not a trick — it reflects genuine metaphorical structure in the model's representations.

// ARGUMENT IS WAR frame → combative, defensive reasoning Prompt A: "Attack the weaknesses in this business plan." // ARGUMENT IS EXPLORATION frame → collaborative, curious reasoning Prompt B: "Explore what this business plan might be missing." // ARGUMENT IS CONSTRUCTION frame → building, structural reasoning Prompt C: "Help me strengthen the foundations of this business plan." // All three prompts ask for critique — but activate different cognitive frames // and will produce measurably different outputs in tone, structure, and content

Practical Metaphor Techniques Applied

Role metaphors: "Act as a detective" (REASONING IS INVESTIGATION) vs "Act as a doctor" (REASONING IS DIAGNOSIS) activates different inference patterns
Spatial metaphors: "Step back and look at the big picture" activates wide-scope overview processing; "zoom into the details" activates close analysis
Journey metaphors: "Walk me through your reasoning" structures output as sequential narration
Container metaphors: "What's inside this concept? What's outside its scope?" explicitly structures boundary analysis
Temperature metaphors: "What's the hot take? What's the cold, hard truth?" activates different evaluative registers

Advanced Technique: Metaphor Conflict

Deliberately activating conflicting metaphors can generate richer analysis: "Using both a medical diagnosis frame AND a detective investigation frame, analyze why this product launch failed." The model must reconcile competing conceptual structures, often producing more nuanced output.

🎯 Applied Challenge

A prompt says: "Tear apart the logic in this argument." Which conceptual metaphor does this primarily activate?

Module 4 · Lesson 4.1

Frames & Knowledge Structures

Charles Fillmore's Frame Semantics — the theory that every word activates a rich background knowledge structure.

What is a Frame? Core Theory

A frame is a structured knowledge schema that represents a type of event, relationship, or object in the world. When you understand a word, you don't just access a definition — you activate a whole frame with roles, relationships, and expectations.

Fillmore's classic example: the word COMMERCIAL TRANSACTION activates a frame with roles: Buyer, Seller, Goods, Money, and Place — plus expectations about what typically happens.

COMMERCIAL TRANSACTION FRAME

Buyer

Seller

Goods

Money

Place

Transaction

All of "buy," "sell," "purchase," "vendor," "customer," "price," "shop" activate this same frame — but highlight different roles within it.

Frame Inheritance & Hierarchy Structure

Frames are organized in hierarchical networks. The COMMERCIAL TRANSACTION frame inherits from TRANSACTION, which inherits from EXCHANGE, which inherits from TRANSFER. Each level adds specificity while inheriting the structure above it.

This hierarchy allows for frame-based inference: if X is a shop (COMMERCIAL TRANSACTION), then there must be goods, a price, and an expectation of payment — even if never mentioned.

🔑

Lexical Access

Every content word is a frame-evoking element. Verbs are particularly rich frame-activators, specifying participant roles and event structure.

🧩

Frame Elements

Participant roles within a frame (Agent, Patient, Theme, Goal, etc.) are frame elements — the semantic "slots" that syntactic arguments fill.

📡

Null Instantiation

Frame elements can be absent from text but still active in understanding. "She ate" implies food exists even without mention — a powerful inference mechanism.

Module 4 · Lesson 4.2

FrameNet & AI Applications

How Fillmore's frame semantics became a computational resource — and how it powers modern NLU systems.

FrameNet: The Computational Resource Applied

FrameNet (framenet.icsi.berkeley.edu) is a lexical database built on Fillmore's frame semantics. It contains over 1,200 semantic frames, 13,000 lexical units, and 200,000+ annotated sentences. It is one of the most important resources in computational linguistics.

🔍

Semantic Role Labeling

SRL systems identify frame elements in text. "John sold Mary a book for $20" → Seller:John, Buyer:Mary, Goods:book, Money:$20. Used in QA, IE, and summarization.

📰

Information Extraction

Frame-based IE extracts structured event information. ATTACK frames extract Attacker, Target, Weapon, Place from news — powering intelligence analysis systems.

❓

Question Answering

Frame element slots guide what information to extract to answer "Who sold what to whom for how much?" — frame structure maps directly to question types.

🎭

Sentiment Analysis

Frame-aware sentiment is more accurate than bag-of-words approaches. "The medicine killed the pain" vs "The medicine killed the patient" requires frame knowledge to parse correctly.

// Frame-based analysis pipeline Input: "Apple acquired the startup for $2 billion." FrameNet Lookup: "acquired" → COMMERCE_BUY frame Buyer: "Apple" Goods: "the startup" Money: "$2 billion" Seller: [null-instantiated — inferrable] // Frame elements = structured knowledge for downstream tasks // QA: "Who bought what?" → Buyer + Goods // IE: Extract all M&A events → COMMERCE_BUY frames

Frame Semantics in LLMs Research

Modern LLMs implicitly learn frame-like structures from training data — but in an unstructured way. Research directions include:

FrameNet-guided fine-tuning to make implicit frame knowledge explicit and controllable
Frame-conditioned generation: constraining LLM output to fill specific frame element slots
Probing LLMs for frame element representation — do attention heads specialize in specific frame roles?
Automatic FrameNet extension using LLMs to generate new frame annotations at scale

Module 5 · Lesson 5.1

Categories & Prototypes in AI

Why human categories are not boolean — and the implications for AI classification systems.

Classical vs. Prototype Theory Core Theory

The classical theory of categories (dominant until the 1970s) holds that categories are defined by necessary and sufficient conditions. Something is a BIRD if and only if it has features F1, F2, F3...

Eleanor Rosch's pioneering experiments in the 1970s showed this is wrong for human cognition. Categories are gradient, with some members being better examples than others. A robin is a "better" bird than a penguin, even though both are equally birds by classical definition.

BIRD CATEGORY — Prototype Structure

ROBIN

Prototype
Typicality: 1.0

→

EAGLE

Near Prototype
Typicality: 0.8

→

OSTRICH

Peripheral
Typicality: 0.5

→

PENGUIN

Marginal
Typicality: 0.3

Prototype Theory Principles Theory

Family resemblance: Members share overlapping features with each other, not a common core — like Wittgenstein's family members who share some but not all traits
Graded membership: Category membership is a matter of degree, not yes/no — measured by typicality ratings in psychological experiments
Prototype effects: Prototypical members are processed faster, remembered better, and used in reasoning more readily than peripheral members
Basic level categories: Intermediate-level categories (DOG, not ANIMAL or POODLE) are cognitively privileged — the level at which most knowledge is organized

Module 5 · Lesson 5.2

Fuzzy Logic & AI Classification

How prototype theory maps onto fuzzy sets, vector space models, and modern ML classification.

Prototype Theory → Computational Models Applied

Prototype theory directly inspired fuzzy set theory (Zadeh, 1965) and has profound implications for how we design and evaluate AI classification systems. Classical binary classifiers violate the cognitive reality of graded category membership.

🌫️

Fuzzy Sets

Elements have membership degrees between 0 and 1, not binary 0/1. A penguin has membership 0.3 in BIRD, not 0 — better modeling cognitive category structure.

📐

Vector Space Models

Word embeddings create prototype-like structures: category centroids emerge from averaging member vectors. Distance to centroid ≈ typicality rating.

🏔️

Softmax Probabilities

Neural classifiers outputting probability distributions naturally implement graded membership. A 70% confidence classification is cognitively more honest than forced binary output.

🔗

Clustering Algorithms

Fuzzy c-means clustering explicitly implements graded membership. K-means' hard assignment violates prototype theory; fuzzy approaches align better with human categorization.

Implications for AI Evaluation Critical

If human categories are gradient, then binary accuracy metrics (correct/wrong) are a poor measure of AI classification quality. A system that classifies a penguin as "not bird" is wrong — but it's less wrong than classifying a robin as "not bird."

Better Evaluation Approaches

Typicality-weighted accuracy: weight errors by prototype distance. Calibration: measure whether model confidence reflects category gradient. Human-model correlation: compare model confidence gradients to human typicality ratings. These metrics better capture cognitive reality than binary accuracy.

🎯 Check Your Understanding

A text classifier must categorize "She runs a small catering business" as BUSINESS type. Prototype theory suggests which approach is most cognitively valid?

Module 6 · Lesson 6.1

Constructions & Language Models

Construction Grammar's insight that form and meaning are inseparable — and why this matters for transformers.

What is a Construction? Core Theory

Construction Grammar (Goldberg 1995, Kay & Fillmore 1999) proposes that grammar consists of constructions: form-meaning pairings at all levels, from morphemes to sentence patterns. Crucially, constructions contribute meaning that cannot be derived from the words alone.

The Ditransitive Construction

Pattern: [Subj V Obj1 Obj2] → meaning: TRANSFER

"She gave him a book." → standard use (GIVE is inherently a transfer verb)
"She sneezed him the napkin." → unusual, but the CONSTRUCTION forces a transfer reading
"She talked him into compliance." → completely non-physical, but construction = transfer of state

The TRANSFER meaning comes from the CONSTRUCTION, not the verb "sneeze" or "talk."

🔄

Ditransitive

[Subj V Obj Obj2] = CAUSED MOTION / TRANSFER. Forces a transfer reading regardless of the verb used.

💥

Resultative

[Subj V Obj Adj/PP] = CAUSE TO BECOME. "She painted the wall red" — the object reaches the result state.

🚀

Caused Motion

[Subj V Obj PP] = CAUSE TO MOVE TO. "She sneezed the napkin off the table" — motion reading imposed by construction.

🔥

Way Construction

"She elbowed her way through the crowd." — idiomatic construction with MANNER + PATH meaning. Not compositional.

Module 6 · Lesson 6.2

Syntax–Semantics in Transformers

Evidence that transformer models learn construction-like representations — and how to leverage this.

Constructions in Transformer Representations Research

A rich body of probing research has investigated what syntactic and semantic knowledge is encoded in transformer layers. Construction Grammar predicts that form-meaning pairings should be holistically represented — and this is largely what is found.

Early layers encode local syntactic patterns; middle layers encode construction-level form-meaning associations; upper layers encode discourse-level pragmatics
Attention heads specialize: some track subject-verb agreement (syntactic), others track semantic role relationships (construction-level)
Contextual embeddings for the same verb differ systematically across constructions — "gave" in ditransitive vs. simple transitive has measurably different representations
Models generalize construction patterns to novel verbs — "She blicked him the widget" receives a transfer interpretation, consistent with construction-level learning

Construction-Aware Prompt Engineering Applied

Understanding which constructions you activate in a prompt allows more precise control over AI output structure and meaning.

// Resultative construction → forces focus on end-state "Explain quantum computing [until it is simple enough for a child]." // → Resultative: [V until Adj] → process aimed at achieving a result state // Ditransitive → frames output as explicit transfer of knowledge "Teach me the key insights of quantum computing." // → [teach + me + Obj] → TRANSFER frame: teacher → learner → content // Way construction → frames output as navigating obstacles "Work your way through the complexities of quantum computing." // → [V way PP] → MANNER + PATH: implies obstacles to overcome

🔬

Key Research: Ettinger (2020) and Linzen et al. (2016) demonstrated that BERT-like models learn construction-level form-meaning associations, not just surface statistics. This validates Construction Grammar's prediction that grammar is a network of meaningful patterns, not a derivational rule system.

Module 7 · Lesson 7.1

Image Schemas & AI Reasoning

Pre-conceptual spatial patterns that structure all human thought — and their surprising presence in AI systems.

What are Image Schemas? Core Theory

Mark Johnson proposed that image schemas are pre-linguistic, prelinguistic patterns of sensorimotor experience that structure all higher-level cognition. They are abstract, skeletal patterns derived from repeated bodily interactions with the environment.

Image schemas are not mental images — they are dynamic, structured patterns of interaction that can be elaborated metaphorically to structure abstract domains.

📦

CONTAINER Schema

Interior, boundary, exterior. "In love," "out of scope," "within limits." All bounded regions, categories, and states use this schema.

🛤️

PATH Schema

Source, trajectory, goal. "Achieving a goal," "following a plan," "progress toward success." All goal-directed processes map onto this schema.

⚡

FORCE Schema

Agent, force, resistance. "Pushing an argument," "social pressure," "resistance to change." Causation and agency are structured by force dynamics.

⚖️

BALANCE Schema

Symmetry, equilibrium. "Balancing trade-offs," "equilibrium," "offsetting costs." All equilibrium reasoning uses the balance schema.

🔗

LINK Schema

Connected entities. "Relationship," "association," "bond," "connection." All relational concepts use linking schemas.

Image Schemas in AI Spatial Reasoning Applied

Image schemas are directly relevant to spatial AI, robotic planning, and visual reasoning:

Robotic navigation: PATH schema (source, trajectory, goal) maps directly onto motion planning algorithms — A* search is a computational PATH schema
Scene understanding: CONTAINER schema structures how vision models parse spatial relationships — objects are "in" or "on" or "outside" containers
Causal reasoning: FORCE schema structures how AI models represent cause-effect relationships in knowledge graphs
LLM spatial tasks: Probing studies show LLMs encode image-schematic spatial relationships — "above," "between," "through" activate schema-consistent representations

Module 7 · Lesson 7.2

Semantic Roles & Natural Language Understanding

Thematic roles, VerbNet, PropBank — and how semantic role labeling powers modern NLU.

Thematic Roles Core Theory

Every verb frames an event with participant roles. These thematic roles (also called theta roles or semantic roles) capture the relationship between participants and the event — independent of their syntactic position.

Role	Definition	Example
Agent	Intentional causer of event	"John broke the window."
Patient	Entity undergoing change	"John broke the window."
Theme	Entity moving or described	"She sent the package."
Experiencer	Entity mentally affected	"She feared the storm."
Goal	Endpoint of motion/transfer	"He gave it to her."
Source	Start point of motion/transfer	"She came from Paris."
Instrument	Means by which event occurs	"She cut it with scissors."
Beneficiary	Entity benefiting from event	"She baked a cake for him."

Semantic Role Labeling in NLU Applied

SRL systems automatically identify who did what to whom, where, when, and how. This provides structured semantic representations for downstream tasks.

"The company quietly laid off 500 workers last quarter." Predicate: laid_off Agent: "The company" → WHO acted Patient: "500 workers" → WHO was affected Manner: "quietly" → HOW it was done Time: "last quarter" → WHEN it happened // SRL output enables: // - QA: "Who did the company lay off?" → Patient // - Bias detection: Is the Agent always named when Patient is? // - Summarization: Extract key predicate-argument structures

Module 8 · Lesson 8.1

Coherence & Discourse Models

How texts cohere beyond the sentence level — and what AI needs to understand discourse structure.

Discourse Coherence Core Theory

Coherent discourse is more than a sequence of grammatical sentences. It requires that sentences stand in coherence relations to each other — logical, causal, temporal, and rhetorical connections that make the text hang together as a unified whole.

Hobbs (1979) and Mann & Thompson's Rhetorical Structure Theory (RST, 1988) formalized these relations. RST identifies 30+ coherence relations including ELABORATION, CONTRAST, CAUSE, EVIDENCE, CONCESSION, and BACKGROUND.

🔍

ELABORATION

A clause gives more detail about what was said in the previous clause. Most body paragraphs use this relation to expand on a topic sentence.

⚡

CAUSE / RESULT

"Sales dropped. Therefore, we cut budgets." Causal relations are fundamental to explanatory discourse and argumentative structure.

↔️

CONTRAST

"Model A is fast. But Model B is accurate." Sets up comparison that guides the reader's evaluation across the discourse.

🔗

EVIDENCE

A subsequent clause supports or justifies a preceding claim. Critical for argumentative discourse and scientific writing.

Discourse in LLMs Research

LLMs generate locally coherent text remarkably well but struggle with global discourse structure in long documents. Key failure modes:

Entity tracking errors over long spans — losing track of what pronoun refers to whom across many paragraphs
Inconsistent discourse relations — claiming CAUSE in one section and CONTRAST in another for the same relationship
Missing BACKGROUND information — assuming the reader knows context not yet established
Failures in RST-based summarization — missing the nucleus-satellite structure that identifies what is most vs. least important

Module 8 · Lesson 8.2

Grice's Maxims & Conversational AI

The cooperative principle and how pragmatic competence shapes the quality of human-AI dialogue.

The Cooperative Principle Core Theory

Paul Grice (1975) proposed that human communication is governed by a Cooperative Principle: "Make your conversational contribution such as is required, at the stage at which it occurs, by the accepted purpose or direction of the talk exchange." This breaks down into four maxims:

✅

Quality

Do not say what you believe to be false. Do not say that for which you lack adequate evidence. → AI hallucination is a massive Maxim of Quality violation.

📏

Quantity

Be as informative as required. Do not be more informative than required. → Verbose AI responses and over-hedged disclaimers violate Quantity.

🎯

Relation

Be relevant. → AI responses that provide tangentially related content instead of what was actually asked violate Relation (Relevance).

💎

Manner

Be perspicuous: avoid obscurity, avoid ambiguity, be brief, be orderly. → Convoluted AI prose and unnecessary hedging violates Manner.

Implicature in AI Applied

Grice's greatest insight: speakers routinely mean more than they say, and listeners infer this via the cooperative principle. Implicature is this additional meaning.

Implicature Example for AI

User: "Can you write this in simpler terms?"
Literal meaning: A yes/no question about capability.
Implied meaning: Please actually rewrite this in simpler terms.

A pragmatically competent AI (like current LLMs) correctly reads the implicature and rewrites, rather than answering "Yes, I can." Grice explains why this is rational and cooperative.

Conversational AI failures often reduce to violations of Gricean maxims or failures to compute implicature. CL-informed evaluation of AI dialogue should explicitly test for pragmatic competence using Gricean criteria.

🎯 Check Your Understanding

An AI assistant is asked "What time is the meeting?" and responds with a 3-paragraph explanation of how meetings are generally scheduled in organizations. Which Gricean maxim does this most violate?

Module 9 · Lesson 9.1

CL-Informed Prompt Design

Synthesizing the entire course into a systematic framework for cognitively-grounded prompt engineering.

The CL Prompt Engineering Framework Applied

Every effective prompt operates simultaneously at multiple CL levels. Expert prompt engineers — whether they know it or not — are applying cognitive-linguistic principles. This framework makes those principles explicit and systematic.

🗂

Frame Level

What frame does your prompt activate? "Review this code" activates an EVALUATION frame. "Debug this code" activates a PROBLEM-SOLVING frame. Choose deliberately.

🌀

Metaphor Level

What source domain are you invoking? Judicial metaphors (judge, verdict) → formal evaluation. Surgical metaphors (cut, remove) → precise deletion tasks.

🎭

Prototype Level

Specify the prototype when precision matters: "Write a formal email (think annual report language)" vs "Write a formal email (think polite request)."

🔨

Construction Level

Use constructions that presuppose the desired output structure. Resultative: "Explain X until it is clear to a 10-year-old." Constrains the process and end state.

💬

Pragmatic Level

Maximize Gricean cooperativity. Be as specific as required (Quantity), keep it relevant (Relation), be clear and ordered (Manner), ground claims in evidence (Quality).

Prompt Analysis: Before & After Applied

// BEFORE: Cognitively vague "Tell me about this business plan." // Frame: unclear (evaluation? summary? advice?) // Metaphor: none activated // Prototype: "tell me about" is maximally vague // Construction: simple declarative, no result state specified // Pragmatics: under-specified (violates Quantity) // AFTER: CL-informed "Act as a venture capitalist who has 10 minutes before a pitch meeting. Identify the three biggest structural weaknesses in this business plan, then suggest one concrete fix for each — until each fix is specific enough to implement this week." // Frame: VC EVALUATION + TIME PRESSURE // Metaphor: STRUCTURE IS ARCHITECTURE (weaknesses, structural) // Prototype: "VC with 10 min" = specific, actionable evaluator // Construction: Resultative (until specific enough) + LIST OF THREE // Pragmatics: Relation (focused), Quantity (3 items), Manner (implement this week)

Module 9 · Lesson 9.2

Bias, Ethics & Language in AI

How cognitive-linguistic structures encode and perpetuate social biases in AI systems.

Language Encodes Ideology Critical

CL has always been attentive to how language doesn't just describe reality but actively constructs it. The frames, metaphors, and prototypes embedded in language carry ideological content — and AI trained on human language inherits this content at scale.

⚖️

Metaphor & Framing Bias

Describing crime as a "beast" (vs. a "virus") in LLM training data shifts model recommendations toward enforcement (vs. prevention). Lakoff's framing research shows how metaphor choice constrains reasoning.

👤

Prototype Bias

If the prototype "doctor" in training data is male, models exhibit gender bias in medical contexts. Prototype biases are more insidious than lexical biases because they structure entire inference patterns.

🗂

Frame Asymmetry

Historical texts frame different social groups in systematically different frames (AGENT vs. PATIENT, ACTOR vs. OBJECT). LLMs trained on this data inherit asymmetric role assignments.

📏

Semantic Change Bias

Words change meaning over time — often reflecting social change. Training data that spans time periods without temporal weighting may mix outdated conceptual structures with current ones.

CL-Based Bias Detection Methods Applied

Metaphor auditing: Systematically identifying which source domains are used to describe which social groups — revealing conceptual hierarchies
Frame asymmetry analysis: Using SRL to measure whether members of different groups are disproportionately assigned Agent vs. Patient roles in generated text
Prototype probing: Testing model assumptions about "typical" members of social categories using cloze tests and embedding proximity
WEAT testing: Word Embedding Association Test (Caliskan et al. 2017) measures associations between concepts in embedding spaces — directly operationalizing prototype theory
Counter-framing: Deliberately prompting alternative frames to measure how easily LLMs shift conceptual structures — revealing how deeply biases are embedded

⚠️

Ethical Imperative: CL-informed bias detection is not just about fairness metrics — it reveals the conceptual structures through which AI systems model social reality. Fixing surface-level lexical bias without addressing deeper frame and metaphor biases leads to systems that are superficially fair but cognitively discriminatory.

Module 10 · Lesson 10.1

Multimodal Cognition & AI

When language meets vision, sound, and action — and how CL frameworks extend to multimodal AI.

Multimodal Meaning Construction Advanced

Human cognition is fundamentally multimodal — we understand the word "red" not as a dictionary entry but as the integrated product of visual experience, emotional associations, cultural frames, and linguistic co-occurrence. CL's embodiment thesis predicts that richer AI cognition requires multimodal grounding.

👁️

Vision-Language Models

CLIP, GPT-4V, Gemini Vision learn cross-modal associations. The CL prediction: shared image-schema representations should emerge at the intersection — "above," "through," "expanding" should have consistent cross-modal representations.

🎧

Audio-Language Models

Thermal, tactile, and auditory schemas should be better grounded in audio-language models. "Loud," "sharp," "resonant" gain acoustic referents beyond text statistics.

🤲

Embodied Agents

RT-2, PaLM-E, and other vision-language-action models begin to approach full embodied grounding. Force, motion, and contact schemas gain genuine sensorimotor reference.

🧩

Cross-Modal Coherence

CL's insight: the same image schemas underlie all modalities. A successful multimodal AI should show unified schema representations that are consistent across vision, language, and action.

Testing CL Predictions in Multimodal Models Research

Do vision-language models show consistent VERTICALITY metaphor activation across image (visual up/down) and language (positive/negative) modalities?
Can multimodal models solve CL's "conceptual conflict" problems — cases where visual and linguistic information activate conflicting frames?
Do multimodal models show more human-like prototype effects than text-only models — specifically for perceptually grounded categories like COLOR and SHAPE?
Do embodied AI agents develop action-grounded semantic representations that differ systematically from text-only representations in cognitively predicted ways?

Module 10 · Lesson 10.2 — Final

Future: AGI & Cognitive Depth

What would it take for AI to achieve genuine cognitive-linguistic competence? And what does CL tell us about the path forward?

The Cognitive Depth Problem Frontier

Current LLMs have impressive cognitive width — they can perform adequately across an enormous range of linguistic tasks. But they may lack cognitive depth — the rich, embodied, frame-structured, prototype-organized conceptual system that underlies human language understanding.

CL gives us the vocabulary to specify precisely what this depth consists of — and therefore what tests we would need to pass to claim genuine AI language understanding.

🧬

Embodied Grounding

Full AGI will require sensorimotor grounding of abstract concepts. The field is moving toward this with embodied AI agents — but bodily experience at human richness remains distant.

🌐

Cultural Frame Knowledge

Frames are not universal — they are culturally constructed. AGI will need not one universal frame system but a multilingual, multicultural network of frames — a Pangloss of conceptual worlds.

⚡

Dynamic Categorization

Human categories are constructed on the fly in context — "Is a briefcase a piece of furniture if someone is using it as a table?" AGI needs dynamic, context-sensitive prototype construction.

🔮

Metaphorical Creativity

Novel conceptual metaphor creation — genuinely new mappings between domains — is a hallmark of human intelligence. Testing AI for this goes far beyond metaphor recognition.

CL Research Agenda for AI — 2025–2035 Roadmap

Benchmark development: Create CL-theoretically motivated benchmarks for frame activation, prototype gradience, metaphor creativity, and image-schematic reasoning
Interpretability through CL: Use frame semantics and construction grammar as theoretical lenses for mechanistic interpretability research in transformers
Cross-cultural AI: Build multilingual, frame-aware systems that respect the cultural specificity of conceptual systems rather than defaulting to Western cognitive defaults
Bias through a CL lens: Move beyond lexical bias to frame-level and metaphor-level debiasing methods
Dialogue systems: Design conversational AI using Relevance Theory and Gricean pragmatics as explicit evaluation criteria
Multimodal schema learning: Test whether shared image-schema representations emerge in multimodal models trained on diverse sensory inputs

🎓

Course Complete! You've now covered the full landscape of Cognitive Linguistics for AI — from foundational theories to frontier research questions. The field is young, the questions are profound, and the tools CL provides are uniquely suited to making AI systems that are not just statistically impressive, but cognitively principled.

Key Takeaways

LLMs are implicit cognitive systems — understanding them requires cognitive-linguistic theory

Frame semantics and prototype theory are directly operationalizable in computational systems

Conceptual metaphors shape AI reasoning — and can be deliberately engineered in prompts

The grounding problem is the central challenge — multimodal and embodied AI are the path forward

AI bias is fundamentally cognitive-structural — not just lexical

CL provides the theoretical vocabulary for AI interpretability and evaluation

Module 1 · Lesson 1.4 — NEW

The Centaur Model: AI as a Cognitive Surrogate

A new class of foundation model trained not on text, but on millions of human decisions — and what it means for AI as a simulator of cognition.

Beyond Language: Cognitive Foundation Models Cutting Edge

In 2024, researchers published the "Centaur" model in Nature — a foundation model trained on data from millions of decisions collected from psychological experiments. Unlike LLMs trained on text, Centaur is trained on the structured outputs of human cognition: choices made under uncertainty, reaction times, error patterns, and behavioral tendencies across cognitive tasks.

This represents a fundamental shift: from AI that simulates language to AI that simulates the mind itself. The model can predict what a new human participant will do in an unseen cognitive task with remarkable accuracy — a true cognitive surrogate.

CENTAUR VS. STANDARD LLM — TRAINING DATA COMPARISON

Standard LLM

Web text, books, code

Learns: linguistic patterns, world knowledge, reasoning traces in text

Centaur Model

Psych experiment decisions

Learns: human decision patterns, cognitive biases, behavioral tendencies, error profiles

Key Concepts Theory

Cognitive foundation models: Pre-trained on behavioral data rather than linguistic data — or fine-tuned LLMs adapted to predict human cognitive outputs across diverse tasks
Behavior prediction: The ability to forecast what a specific type of person (defined by their prior decision profile) will do in a novel task — a computational theory of mind at scale
Fine-tuning on psychological datasets: Adapting base models using annotated cognitive task data: memory tasks, attention experiments, decision-making under risk, learning paradigms
Limitations of instruction understanding: Centaur highlights a critical gap — a model can simulate the outputs of cognition without understanding the instructions that framed the task from a human perspective

🧪

Clinical Applications

Cognitive surrogates could model how patients with specific cognitive profiles (ADHD, early dementia, dyslexia) will respond to interventions — before clinical trials are run.

🎯

UX & Interface Design

Predict how different user profiles will behave when navigating a new interface — replacing expensive A/B testing with cognitive simulation.

⚠️

Cognitive Privacy

If AI can simulate individual cognitive profiles, it raises profound privacy concerns: prediction of mental states, vulnerabilities, and decision-making under manipulation.

🔬

Theory Testing

Cognitive models can serve as computational implementations of psychological theories — testable, falsifiable, and scalable in ways lab experiments cannot be.

⚠️

CL Implication: If Centaur-style models can simulate human cognitive behavior, they implicitly capture the conceptual structures CL describes — frame activations, prototype effects, metaphor use — not as learned linguistic patterns but as behavioral tendencies. This is a new kind of empirical validation for CL theory.

🎯 Check Your Understanding

What fundamentally distinguishes a "Centaur" cognitive foundation model from a standard large language model?

Module 2 · Lesson 2.3 — NEW

Sensorimotor Grounding in Robotics

Where CL meets the physical world — how embodied AI systems ground abstract language in motor control and affordance.

The Physical Symbol System Hypothesis vs. Connectionism Foundational Debate

Newell and Simon's Physical Symbol System Hypothesis (1976) claimed that physical symbol manipulation is both necessary and sufficient for general intelligent action. This positioned intelligence as substrate-independent: any system that can manipulate symbols can be intelligent.

Connectionism — and later CL — pushed back: symbols without grounding are meaningless. Intelligence is not substrate-independent; it is rooted in the physical interaction of a body with an environment. Robotics has become the empirical battleground for this debate.

🖐️

Affordance Learning

Gibson's concept of affordances — what an object offers for action — is being implemented in embodied AI. A robot learns "graspable," "pushable," "stackable" through physical interaction, not definitional rules.

🗣️

Grounding Commands in Motor Control

RT-2, PaLM-E and similar models translate linguistic commands ("pick up the red block") into motor control sequences. Language is grounded in the robot's kinematic space.

🔄

Feedback Loops

Sensorimotor grounding is not one-way. Physical outcomes update semantic representations: "fragile" is grounded by experiencing breakage. The body teaches language.

🌍

Sim-to-Real Transfer

Models trained in simulation then deployed in the physical world face a "grounding gap." CL's emphasis on actual bodily experience predicts this challenge and suggests why pure simulation may never fully substitute.

CL Concepts in Robotic Language Grounding Applied

CL Concept	Robotics Implementation	Example System
Image Schema: CONTAINER	3D spatial bounding volumes for object placement	SpatialVLA, RT-2
Image Schema: FORCE	Haptic feedback learning for manipulation	Dexterous manipulation models
Image Schema: PATH	Motion planning from source to goal	A* / RRT + language grounding
Affordance (Gibson)	Object interaction possibility learning	AffordanceNet, Contact-GraspNet
Prototype Theory	Category generalization across object variants	Open-vocabulary detection models
Frame Semantics	Action schema libraries for task planning	SayCan, Code as Policies

Case Study: SayCan (Google DeepMind)

SayCan grounds language instructions in robot affordances: it combines an LLM (which ranks possible actions by linguistic plausibility) with a value function (which ranks actions by physical feasibility given the current scene). The result: "Can you bring me something to clean up a spill?" → semantically plausible AND physically executable actions are ranked highest. This is CL's embodied grounding operationalized in a real system.

💡

The Key Tension: CL predicts that full linguistic competence requires full embodied grounding. But current robot systems achieve impressive language-grounded behavior with only partial grounding. Does this suggest CL's embodiment thesis is too strong? Or that these systems are exploiting text-pre-trained linguistic scaffolding while remaining fundamentally ungrounded? This is one of the most active debates in embodied AI research.

Module 7 · Lesson 7.3 — NEW

Event Semantics & Temporal Reasoning

How AI represents time, duration, and the internal structure of events — and why temporal reasoning remains a persistent challenge.

Event Structure in Cognitive Linguistics Core Theory

Every verb in human language encodes not just an action but an event structure — the internal temporal shape of the event. This is aspectual framing: the same real-world event can be described as ongoing, completed, repeated, or instantaneous depending on how the speaker "frames" it.

Zeno Vendler (1957) proposed the foundational four-way classification of event types that underlies all subsequent event semantics:

⚡

Achievements

Instantaneous, telic. "She won the race." "He recognized her." No internal duration — they happen at a point. An achievement either happened or it didn't.

🏁

Accomplishments

Extended, telic. "She ran a marathon." "He built a house." Have duration AND a natural endpoint. Cancelable: "She ran a marathon but didn't finish" is coherent.

🌊

Activities

Extended, atelic. "She ran." "He swam." Have duration but no natural endpoint. Not cancelable: "She ran but didn't finish" makes no sense.

🏔️

States

Extended, atelic, non-dynamic. "She knows French." "He loves her." Stable conditions without inherent change. Resist progressive forms: "?She is knowing French."

Situation Models & Temporal Anchoring in LLMs Research

Humans build situation models when reading text — mental representations of the described events, including their temporal location and duration. LLMs must implicitly do something similar to answer temporal questions correctly.

Temporal anchoring failures: LLMs frequently confuse the temporal order of events described in non-chronological text — they struggle to maintain a coherent timeline separate from sentence order
Aspect blindness: Models often treat achievement and accomplishment sentences the same, failing to infer that "she arrived" (achievement) implies completion while "she was arriving" implies non-completion
Duration estimation: Without grounded bodily experience of time, LLMs show systematic biases in duration estimation — underestimating geological and overestimating social time scales
Temporal knowledge cutoff: The training cutoff imposes an artificial "event horizon" — events after the cutoff do not exist in the model's situation model of the world

// Aspectual framing changes inference — critical for AI QA systems "John was crossing the street when the car hit him." // Progressive aspect → event in progress, NOT completed // Correct inference: John may NOT have finished crossing // Common LLM error: assuming crossing was completed "John crossed the street." // Simple past → accomplishment completed // Correct inference: John DID finish crossing // Temporal reasoning benchmarks (TimeML, TempEval, TORQUE) // probe exactly these aspectual distinctions in LLM outputs

Event-Based Knowledge Representation Applied

Event semantics directly informs how AI systems structure knowledge graphs and temporal databases. The TimeML annotation scheme and EventCorefBank implement linguistic event structure in computational form:

Events are typed by Vendler class and linked to temporal intervals (TimeML TIMEX3)
Temporal relations (BEFORE, AFTER, DURING, SIMULTANEOUS) are explicitly annotated between event pairs
Aspect and modality markers determine whether events are actual, hypothetical, or negated
LLMs fine-tuned on TimeML data show significantly improved temporal reasoning, demonstrating the value of CL-informed annotation schemes

🎯 Check Your Understanding

A user asks an AI: "Did Maria finish the report?" The context is: "Maria was writing the report when the power went out." What is the correct inference, and why do LLMs often get this wrong?

Module 9 · Lesson 9.3 — NEW

Cognitive Homogenization & The Future of Thought

The "homogenizing effect" of LLMs on human expression — and why cognitive diversity may be the most important AI safety issue nobody is talking about.

The Homogenization Effect Critical Issue

A 2024 analysis in Trends in Cognitive Sciences identified a disturbing pattern: as millions of people rely on the same LLMs for writing, reasoning, and communication assistance, their outputs are converging toward the statistical center of the training distribution. The diversity of human expression — the conceptual edges, minority framings, and culturally specific reasoning patterns — is being smoothed away.

This is not a future concern. It is measurable now in text corpora, in standardized writing assistance platforms, and in the outputs of AI-augmented professional communication.

🌀

Epistemic Collapse

When AI systems trained on majority-language, WEIRD (Western, Educated, Industrialized, Rich, Democratic) data become the default reasoning assistant, minority epistemologies lose their computational support infrastructure.

📝

Linguistic Standardization

LLM writing assistance pushes users toward the statistical mean of "good writing." Dialects, registers, and stylistic signatures that deviate from the mean are flagged and corrected out of existence.

🧩

Loss of Minority Reasoning Strategies

Different cultures have different default problem-solving frames — collectivist vs. individualist, cyclical vs. linear temporal reasoning, relational vs. categorical logic. LLMs privilege the majority strategy.

🌐

Collective Intelligence Risk

Cognitive diversity is to collective intelligence what genetic diversity is to ecosystem resilience. Homogenization creates brittleness — when the dominant cognitive framework fails, no alternatives remain.

De-Westernizing AI: Cross-Cultural Cognitive Frames Applied

CL has long documented that different languages encode different conceptual worlds. The Sapir-Whorf hypothesis in its moderate form — that language shapes (but doesn't fully determine) cognition — is now well-supported. AI trained predominantly on English encodes English conceptual defaults.

Multilingual frame networks: Building frame semantic databases that are culturally parallel — not translations of English frames but indigenous conceptual structures. Projects like FrameNet Brasil and JapaneseFN demonstrate this approach
Cultural-specific metaphors: Mandarin TIME IS A VERTICAL AXIS (future is below, past is above) conflicts with English TIME IS HORIZONTAL. AI systems need to maintain cultural metaphor inventories, not just translate
Diverse training curation: Actively over-representing low-resource languages, oral traditions, and non-WEIRD cultural texts in pre-training corpora — not as a fairness gesture but as a cognitive diversity imperative
Cognitive pluralism by design: Building AI systems that can deliberately shift their default reasoning frame based on cultural context — a "Pangloss" of conceptual worlds rather than a single universal cognitive architecture

Measurement: How to Detect Homogenization

Researchers measure cognitive homogenization by: (1) tracking lexical diversity metrics (type-token ratio, vocabulary richness) in AI-assisted vs. unassisted writing over time; (2) comparing frame activation distributions in AI outputs across cultural contexts; (3) measuring conceptual metaphor convergence — are users from different cultures converging on the same source domains when writing with AI assistance? Early results suggest the answer is yes, and the convergence is toward English-language defaults.

⚠️

Urgent Research Priority: The window for preserving cognitive diversity is closing quickly. As AI writing assistance becomes ubiquitous in education and professional settings, the next generation of language users may develop their cognitive frameworks through AI — making the AI's default conceptual structures their own. CL researchers have a unique responsibility to document and protect cognitive diversity before this window closes.

Module 10 · Lesson 10.3 — NEW

Neuro-Symbolic Reasoning in LLMs

The resurgence of symbolic AI — and how hybridizing logic with neural networks addresses LLMs' deepest reasoning limitations.

Why Pure Neural Networks Fall Short Motivation

LLMs exhibit remarkable linguistic fluency but systematic failures in compositional, logical, and causal reasoning — exactly the kinds of structured inference that symbolic AI excels at. The neuro-symbolic paradigm attempts to get the best of both: neural networks' flexibility and grounding + symbolic systems' logical rigor and explicit structure.

NEURAL VS. SYMBOLIC VS. NEURO-SYMBOLIC

Neural Only

✓ Flexible, robust to noise

✓ Learns from data

✗ Brittle logical reasoning

✗ Opaque, hard to verify

Symbolic Only

✓ Logically sound

✓ Transparent, verifiable

✗ Brittle to noise

✗ Requires manual rules

Neuro-Symbolic

✓ Flexible + structured

✓ Data-driven + logical

✓ Partially verifiable

~ Still maturing

Architecture Approaches Technical

Logic-infused LLMs: Constrain LLM outputs using formal logic rules. The model generates candidate inferences; a symbolic verifier accepts or rejects them. Systems like Logic-LM use this approach for math and commonsense reasoning
Knowledge graphs for frame semantics: CL frames (FrameNet, VerbNet) can be represented as knowledge graphs that LLMs query at inference time — providing explicit relational structure that the neural component lacks internally. Wikidata, ConceptNet, and FrameNet are commonly integrated this way
Grounding symbolic rules in vector space: Neural Theorem Provers (NTPs) and Differentiable Inductive Logic Programming (DILP) learn logic-like rules from data but represent them as differentiable operations in vector space — enabling gradient-based learning of symbolic structure
Chain-of-Thought as soft symbolism: CoT prompting encourages LLMs to produce explicit reasoning steps — a lightweight neuro-symbolic approach where the "symbolic" component is natural language reasoning traces rather than formal logic

🗺️

Frame + Graph Integration

Linking LLM outputs to FrameNet knowledge graphs provides structured frame element verification — checking if generated text correctly instantiates the expected frame roles.

🔢

Mathematical Reasoning

Neuro-symbolic systems like AlphaGeometry (DeepMind) combine neural intuition with formal geometric theorem proving — achieving olympiad-level performance neither component achieves alone.

⚖️

Legal & Medical AI

High-stakes domains require verifiable reasoning chains. Neuro-symbolic approaches allow LLMs to generate candidate reasoning while symbolic components verify logical consistency with domain rules.

🔬

CL Connection: CL's frame semantics and construction grammar are already symbolic structures — formal representations of meaning that are directly implementable as knowledge graph schemas. Neuro-symbolic AI that integrates FrameNet and VerbNet is, in a very real sense, implementing CL theory computationally. The hybrid architecture CL has always described (embodied grounding + structured knowledge) maps onto neuro-symbolic architectures (neural grounding + symbolic structure).

Module 10 · Lesson 10.4 — NEW · Final

CL-Based Mechanistic Interpretability

Using cognitive linguistics as a theoretical lens to open the black box — mapping construction grammar and frame semantics onto transformer weights.

The Interpretability Crisis Motivation

Mechanistic interpretability is one of the central challenges of modern AI safety: understanding why a model produces a given output in terms of its internal computations. Current approaches often lack theoretical grounding — they identify circuits and features empirically without a cognitive theory to predict what should be found.

Cognitive Linguistics offers exactly this: a theoretically grounded, empirically validated map of how human linguistic knowledge is organized. CL structures are testable hypotheses about what should exist inside a well-trained language model.

🔌

Construction Neurons

If transformers learn Construction Grammar, specific attention heads or MLP neurons should preferentially activate for specific constructions. "Construction neurons" are units that reliably fire for the ditransitive, resultative, or way-construction patterns regardless of lexical content.

🗺️

Frame Activation Mapping

Probing classifiers trained on FrameNet labels can identify which model layers encode which frame elements. Frame activation maps reveal where in the computational graph COMMERCE_BUY frames become distinct from COMMERCE_SELL frames.

📏

CL-Based Benchmarks

CL theory generates precise, falsifiable predictions about model behavior: prototype gradience, metaphor consistency, frame inheritance, aspectual inference. These predictions become evaluation benchmarks — moving beyond accuracy to cognitive validity.

🧲

Semantic Role Circuits

Attention patterns for subject-verb-object should differ systematically from agent-verb-patient assignments when passive voice shifts syntactic roles without changing semantic roles. CL-informed probing can isolate these circuits.

Methodology: CL-Informed Probing Technical

// CL-Informed Probing Pipeline Step 1: Generate CL-hypothesis // Hypothesis: Layer 8–12 of BERT encodes frame element roles // Prediction: AGENT and PATIENT representations differ in a learnable direction Step 2: Build probe dataset // FrameNet sentences annotated with frame elements // "Apple [Buyer] bought [COMMERCE_BUY] the startup [Goods]" // Extract hidden states at element positions Step 3: Train linear probe probe = LinearClassifier(hidden_states → frame_element_label) // High probe accuracy = model encodes the distinction // Low accuracy = distinction not represented at this layer Step 4: Causal intervention // Activation patching: swap AGENT vector with PATIENT vector // Does the model's output change as CL predicts? // → Tests whether the representation is causally active, not just correlated

Current Findings & Open Questions Research Frontier

Layer specialization: Consistent with CL predictions, early transformer layers encode phonological/morphological patterns, middle layers encode syntactic constructions, and upper layers encode discourse-pragmatic structures
Prototype geometry: Category centroids in embedding space do exhibit prototype structure — members cluster around central prototypes with graded distance, validating prototype theory in computational form
Metaphor circuits: Preliminary evidence suggests metaphorical source-target mappings are encoded in specific attention head clusters — "construction neurons" for metaphor are being actively researched
Frame element binding: Attention heads that track subject-verb-object also show sensitivity to semantic role distinctions, but only partially — suggesting frame elements are distributed across multiple circuits
Open question: Do models that are better aligned with CL structures (as measured by CL-based benchmarks) also perform better on downstream tasks? Establishing this link would validate the CL interpretability program empirically

🎓 Course Complete — Updated Edition

You have completed all 28 lessons across 10 modules — including the 6 new lessons added from the expert review. The field is moving fast. The concepts you now hold give you both the theoretical vocabulary and the empirical tools to navigate it.

Centaur models show behavioral CL is now empirically testable at scale

Robotic grounding is the physical test of embodied cognition theory

Event semantics reveals temporal reasoning as a persistent AI gap

Cognitive homogenization is the most urgent socio-cognitive AI risk

Neuro-symbolic AI implements CL's hybrid vision computationally

CL gives interpretability research its theoretical backbone

Module 11 · Lesson 11.1 — NEW MODULE

CMT as an LLM Prompting Paradigm

Research-backed techniques for using Conceptual Metaphor Theory to dramatically improve LLM reasoning accuracy, coherence, and creative depth — with live comparisons across ChatGPT, Claude, and Gemini.

From Theory to Prompting Practice Research-Backed

A landmark 2024 study by Kramer demonstrated that CMT-based prompting significantly enhances LLM reasoning accuracy, clarity, and metaphorical coherence across a range of complex tasks. This lesson operationalizes that finding into a concrete, replicable prompting framework applicable to any frontier model.

The core insight: LLMs have already internalized conceptual metaphor structures from training data. CMT prompting doesn't teach them new knowledge — it activates the cognitive structures they already have in a deliberate, structured way.

CMT-INSPIRED CHAIN-OF-THOUGHT (CoT) STRUCTURE

Step 1
Identify Source Domain
+ its properties

→

Step 2
Identify Target Domain
(abstract concept)

→

Step 3
Create Inference
via mapping

→

Output
Coherent, grounded
response

The Three CMT-CoT Examples (Kramer, 2024) Applied Research

These are the canonical examples from the research demonstrating how CMT CoT structures reasoning:

Example 1

TIME IS MONEY

Source: Money — a valuable, finite resource that can be spent, saved, wasted, or invested

Target: Time — an abstract flow

Inference: Time is valuable, should be spent wisely, can be wasted or invested — guiding time management reasoning

Example 2

HEART OF STONE

Source: Stone — hard, cold, unfeeling, impervious to impact

Target: A person's emotional character

Inference: The person is emotionally unresponsive, unsympathetic — maps stone's physical properties onto personality

Example 3

WORLD IS A STAGE

Source: Stage — a platform for scripted performances, roles, audiences

Target: Human life and daily activity

Inference: Life involves roles to perform — daily activities are performances, implying social scripting and audience awareness

LLM Configuration for CMT Prompting Practical

The research specifies optimal configurations for activating CMT reasoning in frontier models:

// OPTIMAL CMT SYSTEM MESSAGE (Kramer, 2024) System: "You are a cognitive agent utilizing Conceptual Metaphor Theory (CMT). When interpreting abstract concepts, always map them to concrete physical experiences. Structure your reasoning as follows: 1. Identify the Source Domain and its concrete properties 2. Identify the Target Domain (the abstract concept) 3. Perform the Inference: explain the abstract via the concrete mapping Maintain metaphorical consistency throughout your response." // RECOMMENDED TEMPERATURE temperature: 0.7 // Balanced creativity + coherence // Too low (0.1–0.3): literal, loses metaphorical richness // Too high (0.9–1.0): incoherent, loses systematic mapping // FEW-SHOT PRIMING ALTERNATIVE User (shot 1): "Explain procrastination using CMT." Assistant: "Source: A heavy burden. Target: Unfinished tasks. Inference: Procrastination accumulates weight — the longer you delay, the heavier the burden becomes, until movement is nearly impossible."

ChatGPT vs. Claude vs. Gemini — CMT Performance Profile LLM Comparison

Model	CMT Strengths	CMT Weaknesses	Best Use
ChatGPT (GPT-4o)	Superior syntactic explanation of metaphors; consistent source-target mapping; strong few-shot CMT adoption	Can become formulaic with repeated CMT prompting; less creative in novel metaphor generation	Technical CMT-CoT reasoning tasks; step-by-step metaphor analysis
Claude (Sonnet/Opus)	Actively plans coherent structures; manifests "thinking" process aligned with cognitive schemas; strong metaphor consistency across long outputs	May over-elaborate inferences; sometimes adds meta-commentary about the metaphor rather than using it	Long-form CMT-structured writing; complex domain mapping; nuanced emotional metaphors
Gemini (Pro/Ultra)	Academic precision in metaphor identification; complex grammatical constructions; strong cross-lingual CMT	Tends toward academic lexis — less accessible; sometimes prioritizes technical accuracy over metaphorical richness	Cross-cultural metaphor analysis; academic and research writing; multilingual CMT tasks
DeepSeek	Strong mathematical/logical metaphor domains; efficient token use in CMT prompts	Less training on creative metaphorical corpora; weaker emotional domain mapping	Technical/scientific domains where CMT is applied to formal concepts

📊

Research Finding (Kramer, 2024): Models preconditioned with CMT system messages showed measurable improvements in reasoning accuracy, clarity of explanation, and metaphorical coherence versus standard prompting baselines. The effect was consistent across GPT-4, Claude, and Gemini — suggesting CMT activates cross-model cognitive structures, not model-specific quirks.

🎯 Check Your Understanding

According to the Kramer (2024) research, what temperature setting is recommended for CMT-based LLM prompting, and why?

Module 11 · Lesson 11.2 — NEW MODULE

Frame Semantics & Knowledge Representation in Modern LLMs

How ChatGPT, Claude, and Gemini encode, identify, and leverage semantic frames — from latent implicit knowledge to In-Context Learning for structured extraction.

Latent Frame Knowledge in LLMs Research Finding

Recent studies (2024) confirm that major LLMs — ChatGPT, Claude, and Gemini — encode latent knowledge of Frame Semantics without explicit training on FrameNet. This knowledge is implicit in the statistical patterns of their training corpora: because humans consistently use frame-evoking language in consistent ways, the models learn frame structures as a byproduct of language modeling.

This means every frontier LLM is already a partial frame semanticist — and the question for AI engineers is how to activate and direct this latent knowledge rather than how to install it from scratch.

🔎

Frame Identification

Modern LLMs can identify semantic frames within text with high accuracy. Given "John bought the startup," they correctly identify the COMMERCE_BUY frame and its elements — Buyer: John, Goods: startup — without explicit instruction.

🧲

In-Context Learning (ICL)

LLMs can be prompted with a few FrameNet-style examples and will then extract frame-semantic arguments from new sentences with high accuracy — often outperforming traditional statistical parsers trained specifically for this task.

🔄

Backward Reasoning

Modern models use frame structures to "think backward" — given a desired output state (a frame's goal element), they can plan token sequences that consistently fill the frame toward that outcome. Planning is frame navigation.

🌐

Relational Reasoning

Frames provide the relational backbone for coherent generation. When asked to generate a story about a transaction, models activate Commerce frames and maintain consistent role assignments across the entire narrative.

ICL Frame Extraction — Practical Protocol Applied

The most powerful practical application of frame semantics in modern LLMs is using In-Context Learning to turn any frontier model into a high-accuracy semantic parser:

// IN-CONTEXT LEARNING: Frame Semantic Extraction Protocol System: "You are a semantic frame analyst. For each sentence, identify: 1. The TARGET WORD (frame-evoking element) 2. The FRAME it evokes (use FrameNet names where possible) 3. The FRAME ELEMENTS present (with their text spans) 4. Any NULL-INSTANTIATED elements (implied but absent)" // Few-shot examples (2-3 shots sufficient for high accuracy) Shot 1 Input: "Apple acquired the startup for $2B." Shot 1 Output: Target Word: "acquired" Frame: COMMERCE_BUY Buyer: "Apple" Goods: "the startup" Money: "$2B" Seller: [NULL — inferrable from context] // New input — model performs accurately without further training New Input: "She donated her library to the university." // LLM output: GIVING frame — Donor: She, Theme: library, // Recipient: university, [Purpose: null-instantiated]

Frame Semantics Across the Three Major LLMs LLM Comparison

Capability	ChatGPT	Claude	Gemini
Frame Identification Accuracy	High — strong on common frames; weaker on specialized domain frames	High — particularly strong on discourse-level frame consistency across long texts	High — strong cross-lingual frame identification; strong on technical frames
Null-Instantiation Detection	Moderate — identifies implied roles inconsistently	Strong — actively infers implied participants; consistent with "thinking" process	Moderate — better when prompted explicitly to consider missing elements
ICL Frame Extraction	Excellent — adopts FrameNet format rapidly with 2–3 shots	Excellent — maintains format across many examples; adds useful uncertainty flags	Good — slightly more formal output format; strong for structured data pipelines
Relational Reasoning via Frames	Strong — uses frames for coherent narrative generation	Very strong — plans coherent multi-frame narratives; explicit schema activation	Strong — particularly good at multi-step frame chains in academic contexts

💡

Key Practical Insight: You do not need to fine-tune LLMs on FrameNet data to get frame-semantic extraction. ICL with 2–3 well-formatted shots activates the latent frame knowledge that already exists in the model. This makes FrameNet-quality semantic parsing accessible to any team with API access — no training pipeline required.

Module 11 · Lesson 11.3 — NEW MODULE

Construction Grammar & Syntactic-Semantic Integration in Transformers

How frontier LLMs demonstrate Construction Grammar principles in practice — with a detailed comparison of how ChatGPT, Claude, and Gemini differ in constructional understanding.

CxG in Transformer Behavior — What the Research Shows Research

Researchers have found that frontier LLMs demonstrate a strong connection between syntactic form and semantic meaning that closely aligns with Construction Grammar's core thesis. Rather than treating syntax as an independent module, these models appear to have learned form-meaning pairings holistically — exactly as CxG predicts.

The evidence comes from three converging sources: grammaticality judgment tasks, constructional meaning identification, and syntactic explanation ability — where models must articulate why a construction means what it means.

The CxG Diagnostic Tasks Methodology

Constructional meaning identification: Given "She sneezed the napkin off the table," can the model identify that the CAUSED-MOTION meaning comes from the construction, not the verb "sneeze"?
Grammatical judgment: Is "She talked him into compliance" grammatical? If yes, which construction licenses it? Models must identify the CAUSED-CHANGE-OF-STATE construction
Constructional slot analysis: What fills the [Subj V Obj Adj] resultative slot and what meaning emerges? "She painted the wall red" vs. "She painted the wall quickly" — only the first is a true resultative
Cross-constructional inference: Does "He baked her a cake" (ditransitive) imply transfer? What about "He baked a cake for her" (prepositional dative)? Models should detect the subtle pragmatic difference CxG predicts

🔷

ChatGPT on CxG

Superior syntactic explanation — can clearly articulate why a given construction produces its meaning. Strongest on grammatical judgment tasks. Identifies "construction slots" explicitly when prompted. Best for pedagogical CxG explanation tasks.

🔶

Claude on CxG

Actively plans coherent structures that respect constructional constraints. Manifests a "thinking" process aligned with cognitive schemas — less likely to violate construction-level meaning even in long outputs. Best for CxG-constrained creative writing and generation tasks.

🔹

Gemini on CxG

Tends toward academic lexis with complex grammatical constructions. Strong cross-lingual CxG awareness. Performs best on formal linguistic analysis tasks. Particularly good at identifying CxG structures in non-English texts.

Implications for LLM Design — Leveraging CxG Applied

Construction Grammar's insights can be directly leveraged to improve LLM output quality in three ways:

// TECHNIQUE 1: Construction-Constrained Generation // Force a specific construction to shape the output structure "Rewrite this explanation using only resultative constructions ([Subject V Object to Adj-state]) to emphasize outcomes over processes." // TECHNIQUE 2: Constructional Diagnosis Prompting // Use CxG to probe model understanding of unusual sentences "Analyze: 'The professor lectured the class into a stupor.' Identify: (a) which construction licenses this, (b) what meaning the construction contributes that the verb alone doesn't, (c) which frame elements are present." // TECHNIQUE 3: Cross-Construction Contrastive Prompting // Exploit constructional meaning differences for precise communication "Compare: 'She gave him the keys' vs 'She gave the keys to him.' In what contexts would each be more natural? What subtle differences in perspective or presupposition does each carry?" // → Activates CxG's insight that ditransitive vs. prepositional dative // encode different speaker perspectives on the transfer event

🎯 Applied Challenge

A user prompts: "She laughed him out of the room." According to Construction Grammar — which the research confirms LLMs have internalized — where does the CAUSED-MOTION meaning of this sentence come from?

Module 11 · Lesson 11.4 — FINAL LESSON

Cognitive Engineering & Ethical AI

Designing AI systems with human-like cognitive architectures — and the ethical responsibilities that come with building machines that think like us.

What is Cognitive Engineering? Emerging Field

Cognitive Engineering is the discipline of designing AI systems that not only process information efficiently but do so through architectures that are explicitly informed by human cognitive structure. Rather than treating LLMs as black-box function approximators, cognitive engineering asks: how should we design, train, evaluate, and deploy AI so that its internal representations and reasoning processes map onto what we know about human cognition from CL and cognitive science?

This is the synthesis of everything this course has covered — it is CL applied not just as analysis but as design philosophy.

🏗️

CL-Informed Architecture

Design choices guided by CL: training on usage-based corpora (usage-based model), attention mechanisms that respect frame-element boundaries, output constraints that honor construction-level semantics.

🔍

CL-Informed Evaluation

Moving beyond perplexity and BLEU: benchmarks that test prototype gradience, metaphor consistency, frame inference, aspectual reasoning, and construction-level meaning — the dimensions CL theory tells us matter for genuine understanding.

🛡️

CL-Informed Alignment

Using frame semantics and prototype theory to detect and measure misalignment: does the model's conceptual structure match human conceptual structure? Divergence is a measurable alignment failure.

⚖️

CL-Informed Ethics

Frame asymmetry, prototype bias, metaphor ideology — CL's tools for detecting the conceptual structures embedded in language directly serve AI ethics auditing at a level deeper than surface-level content filtering.

Neuro-Symbolic Integration for Explainable AI Technical

The most practically important frontier in cognitive engineering is building AI systems whose reasoning is explainable through CL structures. Rather than post-hoc attribution methods that highlight input tokens, CL-informed explainability maps AI decisions onto human-interpretable conceptual structures:

Frame-level explanation: "This recommendation was made because the COMMERCE_BUY frame was activated with you as Buyer and Product X as Goods" — more interpretable than attention weight visualizations
Metaphor-level explanation: "This risk assessment used the JOURNEY metaphor: you are currently at a crossroads with two paths of different risk profiles" — grounds AI reasoning in human cognitive structures
Construction-level explanation: "This instruction was parsed as a CAUSED-CHANGE-OF-STATE construction, interpreting X as the intended result state" — enables disambiguation of ambiguous instructions
Prototype-level explanation: "This classification has 0.7 confidence because the input has 70% overlap with the category prototype, not 100%" — communicates uncertainty in human-interpretable terms

The Role of CL in Achieving AGI Frontier

The final question: is CL-informed AI a path toward Artificial General Intelligence — or a richer form of narrow AI that is better aligned with human cognition?

The CL-AGI Thesis

AGI, if it means AI that understands language the way humans do, requires the cognitive structures CL describes: embodied grounding, frame-organized knowledge, prototype-structured categories, metaphor-based abstract reasoning, and pragmatic competence. A system that achieves all of these at human level would, by definition, have human-level language understanding — and language understanding may be the clearest path to general intelligence. CL is not just a tool for building better NLP systems. It is a map of what general intelligence looks like.

🎓

Course Complete — Final Edition. You have now completed all 32 lessons across 11 modules. From the foundational principles of CL to the cutting edge of CMT prompting research, cognitive engineering, and LLM comparison — this course gives you both the theory and the tools to work at the intersection of human cognition and artificial intelligence.

Module 11 — Core Takeaways

The industry-ready bridge between CL theory and modern AI practice.

CMT-CoT prompting at temperature 0.7 measurably improves LLM reasoning across all major models

All frontier LLMs encode latent frame knowledge — ICL activates it without fine-tuning

Claude plans via schemas; ChatGPT explains constructions; Gemini excels cross-lingually

Cognitive engineering is the design philosophy that unifies all CL applications in AI

CL-informed explainability is more human-interpretable than attention attribution

CL is not just a tool — it is a map of what AGI must achieve