All Updates

Page 604 of 609

February 12, 2026

๐Ÿ“„
ArXiv AIโ€ข52d ago

AI Fails Basic Arithmetic Despite Advanced Math Wins

Frontier AI models excel in advanced math but consistently fail at multi-digit integer addition. Errors primarily stem from operand misalignment or carry failures, explaining most mistakes in top models like Claude, GPT, and Gemini. These issues link to tokenization and random carrying failures.

#research#ai-rithmetic#v1
๐Ÿ“„
ArXiv AIโ€ข52d ago

AgentTrace Enables AI Agent Observability

AgentTrace instruments LLM agents for structured logging across operational, cognitive, and contextual traces. Provides runtime transparency for security and monitoring in high-stakes settings. Minimal overhead supports accountability and risk analysis.

#launch#agenttrace#v1
๐Ÿ“„
ArXiv AIโ€ข52d ago

Affordances Build Partial LLM World Models

Proves LLMs possess predictive partial-world models via task-agnostic affordances for intents. Introduces distribution-robust affordances for multi-task efficiency. Reduces search branching in robotics, outperforming full world models.

#research#affordance-models#v1
๐Ÿ“„
ArXiv AIโ€ข52d ago

Adversarial Threat Detection in Autonomous Driving

ADยฒ analyzes vulnerabilities in end-to-end driving agents like Transfuser to physics, EMI, and digital attacks in CARLA. Driving scores drop up to 99% under threats. Proposes lightweight attention-based detector for spatial-temporal consistency.

#research#ad2#v1
๐Ÿ“„
ArXiv AIโ€ข52d ago

Adapters Unlock Reliable Self-Interpretation

Lightweight adapters trained on interpretability artifacts enable reliable self-interpretation in frozen LMs. A simple scalar affine adapter outperforms baselines in feature labeling, topic identification, and implicit reasoning decoding. Gains scale with model size, driven mostly by learned bias.

#research#self-interpretation#v1
๐Ÿ“„
ArXiv AIโ€ข52d ago

ADAlign Auto-Adapts Graph Domains

ADAlign tackles graph domain adaptation by adaptively aligning discrepancies via Neural Spectral Discrepancy (NSD). Uses neural characteristic functions and minimax sampling without heuristics. Outperforms SOTA on 10 datasets with efficiency gains.

#research#adalign#v1
๐Ÿ“„
ArXiv AIโ€ข52d ago

1% Params Beat Full Fine-Tuning

CoLin introduces a 1% parameter low-rank complex adapter for vision foundation models. It resolves convergence issues in composite matrices with tailored loss. Surpasses full fine-tuning and delta-tuning on detection, segmentation, and classification.

#research#arxiv-ai#v1
๐Ÿ“ฑ
Ifanr (็ˆฑ่Œƒๅ„ฟ)โ€ข52d ago

AI Siri Before Cook Retires?

The article questions whether Apple's AI-upgraded Siri will launch before CEO Tim Cook retires. It emphasizes that while delays are tolerable, outright failure is unacceptable. This reflects ongoing uncertainty around Apple's AI assistant rollout.

#apple#ai-siri#voice-assistant
๐Ÿ“ฑ
Ifanr (็ˆฑ่Œƒๅ„ฟ)โ€ข52d ago

Samsung S26 End-Month Debut, 2nm Chip

Samsung Galaxy S26 is slated for reveal by month's end in a tech news roundup. It may introduce the first 2nm processor in smartphones. Other highlights include DeepSeek AI update and solid-state battery standards.

#launch#samsung#s26
โš–๏ธ
AI Alignment Forumโ€ข52d ago

Simpler Model Predicts 99% AI R&D Automation by 2032

Introduces a robust, 8-parameter model forecasting >99% AI R&D automation by late 2032. Based on conservative compute growth and algorithmic trends, it predicts 1000x-10M x efficiency gains and 300x-3000x research output by 2035. Simpler than AI Futures Model, focusing on timelines to automation without full takeoff.

#research#ai-timelines#simpler-model
โš–๏ธ
AI Alignment Forumโ€ข52d ago

2032 AI R&D Automation Predicted

Simplified model forecasts 99% AI R&D automation by late 2032 via compute and algo trends. Uses 8 parameters, conservative assumptions like no full automation. Predicts 1000x-10M x efficiency by 2035.

#research#ai-timelines#automation
๐ŸŽ
Apple Machine Learningโ€ข52d ago

Trace Length Signals LLM Uncertainty

Reasoning trace length serves as simple confidence estimator in LLMs to combat hallucinations. Performs comparably to verbalized confidence across models, datasets, prompts. Post-training alters trace-confidence relationship.

#research#apple-ml#llm-reasoning
๐ŸŽ
Apple Machine Learningโ€ข52d ago

Trace Length as LLM Uncertainty Signal

Apple researchers demonstrate that reasoning trace length serves as a simple, effective confidence estimator in large reasoning models. It performs comparably to verbalized confidence across models, datasets, and prompts, acting complementarily. The work shows reasoning post-training alters the trace-confidence relationship.

#research#apple-ml#general
๐Ÿค
Together AI Blogโ€ข52d ago

Together AI Launches 2.6x Faster Inference

Together AI introduces Dedicated Container Inference, a production-grade orchestration for custom AI models. It delivers 1.4xโ€“2.6x faster inference speeds.

#launch#together-ai#dedicated-container
๐Ÿค—
Hugging Face Blogโ€ข52d ago

Real-World Tool Agent Evaluation

Hugging Face explores OpenEnv for evaluating tool-using AI agents in practical settings. The post details methodologies for real-world testing. It highlights performance insights and benchmarks for agent capabilities.

#research#hugging-face#openenv
๐Ÿค—
Hugging Face Blogโ€ข52d ago

OpenEnv Evaluated in Real-World Agent Environments

Hugging Face blog explores OpenEnv for evaluating tool-using AI agents in practical settings. It highlights real-world applications beyond simulated benchmarks. The post emphasizes practical insights for agent development.

#research#openenv#hugging-face
๐ŸŽ
Apple Machine Learningโ€ข52d ago

Mapping UX Design for Computer Agents

Study maps UX design space for LLM-based computer use agents via two-phase research. Phase 1 reviewed systems and interviewed eight UX/AI practitioners to create taxonomy. Categories cover user prompts, explainability, user control, and more.

#research#apple-ml#ux-design
๐Ÿค
Together AI Blogโ€ข52d ago

Dedicated Container Inference: 2.6x Faster AI

Together AI launches Dedicated Container Inference for production-grade orchestration of custom AI models. It delivers 1.4xโ€“2.6x faster inference speeds compared to standard methods.

#launch#together-ai#container-inference
๐ŸŽ
Apple Machine Learningโ€ข52d ago

Apple Maps UX for LLM Computer Agents

Apple's Machine Learning team conducted a two-phase study to explore user experience design for LLM-based computer use agents. Phase 1 reviewed existing systems and interviewed eight UX/AI practitioners to create a taxonomy covering user prompts, explainability, user control, and more. The work aims to understand optimal user interactions with these UI-interacting agents.

#research#apple#na

February 11, 2026

Page 604 of 609