LLMs Think in Geometry, Not Language

Post LinkedIn

🦙Read original on Reddit r/LocalLLaMA

#llm-internals #neuroanatomy #sapir-whorfgemma-4-31bgemma-4-31b qwen3.6-35b rys

💡Breakthrough: LLMs think in universal geometry across langs/models—playable viz included

⚡ 30-Second TL;DR

What Changed

LLMs process concepts geometrically in mid-layers, language vanishes

Why It Matters

Challenges language-thought links in LLMs, reveals convergent architectures across orgs. Enables better interpretability tools for practitioners probing model internals.

What To Do Next

Explore the interactive PCA widget at dnhkng.github.io/posts/sapir-whorf/ to visualize LLM concept spaces.

Who should care:Researchers & Academics

Key Points

•LLMs process concepts geometrically in mid-layers, language vanishes
•Tested 8 languages (EN, ZH, AR, RU, JA, KO, HI, FR) on 5 models
•Text, Python (single-letter vars), LaTeX converge for concepts like kinetic energy
•Interactive PCA viz and code released; Gemma-4-31B, Qwen3.6-35B results added

🧠 Deep Insight

AI-generated analysis for this event.

🔑 Enhanced Key Takeaways

•The research builds upon the 'Representation Engineering' (RepE) framework, which posits that internal model states can be manipulated as geometric vectors to steer model behavior without retraining.
•Cross-lingual alignment is achieved through a phenomenon known as 'semantic isomorphism,' where the model's high-dimensional manifold preserves relative distances between concepts regardless of the linguistic tokenization path.
•The findings suggest that 'concept-based' interpretability tools can be developed to perform model editing by directly modifying these geometric coordinates, effectively bypassing the need for natural language prompts.

🛠️ Technical Deep Dive

•The study utilizes Principal Component Analysis (PCA) and t-SNE to project high-dimensional hidden states (typically from layers 15-25 in 30B+ parameter models) into a 3D manifold.
•The convergence is measured using Procrustes analysis, which aligns the vector spaces of different languages to demonstrate that the geometric 'shape' of a concept remains invariant.
•The implementation involves extracting activations from the residual stream at specific mid-layer indices, which are then normalized to account for variance in token frequency across different languages.
•The research demonstrates that even with obfuscated variable names in Python code, the model maps the underlying logic to the same geometric region as the corresponding mathematical LaTeX representation.

🔮 Future ImplicationsAI analysis grounded in cited sources

Language-agnostic model steering will become the standard for safety alignment.

If concepts are stored geometrically, safety filters can be applied at the vector level to block harmful concepts regardless of the language used to express them.

Translation models will shift from token-to-token mapping to vector-space projection.

Directly mapping the geometric coordinates of a concept from one language's manifold to another's eliminates the need for intermediate natural language processing.

⏳ Timeline

2024-05

Initial publication of 'Representation Engineering' framework introducing the concept of steering vectors.

2025-02

Release of Revised LLM Neuroanatomy I, establishing the baseline for cross-lingual concept mapping.

2025-11

Revised LLM Neuroanatomy II expands testing to include code-to-text semantic convergence.

2026-03

Integration of Gemma-4 and Qwen3.6 architectures into the geometric analysis framework.

🦙Read original article on Reddit r/LocalLLaMA

📰

Weekly AI Recap

Read this week's curated digest of top AI events →

👉Related Updates

Same topic

Explore #llm-internals

Same product