All Updates

#path-dependence#lock-in#gradient-descent

Science Stuck in Local Minima Like ML

Scientific knowledge forms local optima due to path dependence, cognitive biases, and institutional lock-in. Drawing ML gradient descent analogy, it follows tractable local gradients over global truths. Proposes meta-scientific interventions to escape traps.

#agent-memory#staleness-detection

Memory Worth: Agent Memory Governance

This arXiv paper proposes Memory Worth (MW), a lightweight two-counter metric per memory that tracks co-occurrence with task successes versus failures in agent systems. It converges theoretically to the conditional success probability and shows strong empirical correlation (rho=0.89) with true utilities. The approach enables staleness detection, retrieval suppression, and deprecation with minimal overhead.

#agent-memory#epistemic-safety#knowledge-wikis

Memory as Metabolism for Companion LLMs

This arXiv paper proposes 'Memory as Metabolism,' a design for companion knowledge systems that mirrors user knowledge while compensating for epistemic failures like entrenchment. It outlines five core operations—TRIAGE, DECAY, CONTEXTUALIZE, CONSOLIDATE, AUDIT—backed by memory gravity and minority-hypothesis retention. The framework addresses governance for single-user LLM wikis in a 2026 landscape of emerging agent memory systems.

#activation-space#attractor-dynamics#agent-identity

Identity as Attractor in LLM Activation Space

Large language models exhibit attractor-like dynamics where semantically related prompts map to similar representations. Experiment on Llama 3.1 8B shows agent identity documents (cognitive_core) cause paraphrases to cluster tighter than controls in hidden states. Replicated on Gemma 2 9B, with evidence it's semantic and reading agent descriptions shifts states toward the attractor.

#social-robots#multimodal-memory#embodied-ai

Human-Like Selective Memory for Social Robots

This arXiv paper introduces a human-inspired context-selective multimodal memory architecture for social robots, capturing textual and visual episodic memories based on emotional salience or novelty. It outperforms human consistency in selective storage (Spearman ρ=0.506) and boosts multimodal retrieval Recall@1 by 13%. The system enables personalized, natural human-robot interactions with real-time performance.

#llm-agents#long-horizon#benchmarks

HORIZON Diagnoses LLM Agent Long-Horizon Failures

Introduces HORIZON benchmark to systematically diagnose long-horizon failures in LLM agents across domains. Evaluates SOTA models like GPT-5 variants and Claude on 3100+ trajectories, revealing degradation patterns. Releases leaderboard and LLM-as-a-Judge pipeline validated with human annotations (κ=0.84).

#llm-fine-tuning#scientific-feedback#dataset

GoodPoint: LLM Constructive Paper Feedback

GoodPoint curates a 19K ICLR paper dataset annotated with reviewer feedback using author responses, defining effectiveness via validity and author action. It introduces a training recipe with fine-tuning and preference optimization, boosting Qwen3-8B's success rate by 83.7% and achieving SOTA among similar LLMs. Expert human studies confirm higher practical value for authors.

#ai-agents#healthcare#longitudinal

Framework for Longitudinal Health AI Agents

Researchers propose a multi-layer framework and agent architecture for AI supporting longitudinal health tasks like symptom management and patient support. It operationalizes adaptation, coherence, continuity, and agency across repeated interactions. Use cases show sustained engagement, goal adaptation, and safe personalized decision-making.

#multi-agent#discourse-tree#slide-generation

ArcDeck: Narrative Paper-to-Slide AI

ArcDeck is a multi-agent framework that generates slides from academic papers by modeling logical flow via discourse trees and global commitment documents. Specialized agents iteratively refine outlines before rendering visuals. It introduces ArcBench, a new benchmark showing improved narrative coherence.

📊

Bloomberg Technology•18d ago

Nvidia AI Models Boost Quantum Stocks

Nvidia unveiled new open-source AI models designed to accelerate quantum computing progress. This announcement triggered a surge in Asian software and IT stocks focused on quantum computing.

#quantum-computing#stock-rally#ai-acceleration

#token-pricing#china-ai#competition

Alibaba, ByteDance Target Zhipu, MiniMax Pricing

Alibaba and ByteDance are aggressively pursuing Zhipu AI and MiniMax in a competitive landscape. The core issue is who controls AI token pricing. Tokens have unlimited potential, but pricing logic remains constrained.

#memory-shortage#e-waste#hardware-supply

E-Waste Memory Outvalues Gold in 100 Days

In 2026, rural China sees a rush of e-waste collectors targeting scrapped machines. Memory chips are valued more than gold in this 100-day market frenzy. Highlights ongoing hardware scarcity.

#autonomous-driving#global-expansion#pilot-launch

Didi AV Speeds Global Push with UAE Pilot This Year

Didi Autonomous Driving accelerates global expansion with a pilot launch in UAE this year. Emphasizes responsible innovation via local partnerships. Aims to deploy Chinese AV tech and services worldwide.

🖥️

Computerworld•18d ago

Curity's Runtime Auth for AI Agents

Curity launches Access Intelligence, extending its IAM platform for securing autonomous AI agents via runtime authorization. Uses OAuth tokens with purpose/intent data for ephemeral access. Addresses gaps in traditional IAM for non-deterministic agent actions.

#iam#runtime-auth#agent-security

🦙

Reddit r/LocalLLaMA•18d ago

DGX Spark Setup for vLLM Local Inference

A user unboxes NVIDIA DGX Spark for on-premises LLM inference using vLLM, PyTorch, and Hugging Face models in an education app. They seek advice on optimal models, vLLM tuning for unified memory, and real-world throughput. This marks a shift from cloud GPUs to local setups.

#unified-memory#local-inference#nvidia

⚛️

量子位•18d ago

OpenAI Leaks Anthropic Revenue Inflation Mockery

OpenAI sent an internal letter mocking Anthropic's Claude annualized revenue as inflated by 80 billion. The letter, leaked publicly, accuses Anthropic of padding income figures. This escalates rivalry between the AI giants.

#revenue-leak#ai-rivalry#financial-dispute

🦙

Reddit r/LocalLLaMA•18d ago

llama.cpp Hot Expert Cache Speeds MoE 27%

llama.cpp introduces dynamic expert cache that loads frequent MoE experts into VRAM, boosting Qwen3.5-122B-A10B token generation by 27% over layer offload (22.67 tok/s on RTX 4090). It outperforms all-CPU by 45% with similar VRAM use. Code repo provided for testing.

#moe#vram-cache#token-generation