All Updates
Page 1264 of 1403
February 27, 2026
AI Agents Transform Social Science Research
This arXiv paper introduces 'vibe researching,' where AI agents with specialist skills autonomously handle full research pipelines in social science. It showcases the scholar-skill plugin for Claude Code, a 21-skill tool from idea to submission, and proposes a cognitive framework classifying tasks by codifiability and tacit knowledge to define delegation boundaries. AI excels in speed and coverage but falters in originality and field knowledge, with implications like augmentation risks and pedagogical shifts.
AI Agent Evaluation Framework for AutoML
Researchers introduce an Evaluation Agent (EA) to assess intermediate decisions of AI agents in AutoML pipelines across validity, reasoning consistency, model risks, and counterfactual impacts. Unlike outcome-focused evaluations, EA detects faults with F1=0.919 and attributes performance changes from -4.9% to +8.3%. This enables auditing hidden failure modes for more reliable autonomous ML systems.
ABC Contracts for Reliable AI Agents
Introduces Agent Behavioral Contracts (ABC), a framework specifying preconditions, invariants, governance, and recovery for AI agents. Implements runtime enforcement in AgentAssert library, proving drift bounds and evaluating on 200 scenarios across 7 models. Contracted agents detect 5x more violations, achieve 88-100% compliance, with low overhead.
Xi's AI Push Hits Job Crisis
Xi Jinping's ambitious AI plans in China are clashing with a fragile employment market. AI humanoids recently dazzled audiences in a New Year performance with dancers. A small research firm issued a stark warning about automation displacing workers, potentially triggering an economic spiral that jolted US markets.
Wol Nuclear Material Boosts High-Speed Capacity
Wol Nuclear Material now owns 16 imported foaming core wire extruders, with some in installation and debugging, significantly elevating high-speed line capacity. The company has procured wrapping equipment for surging data center high-speed line demands, which is arriving sequentially. Capacity matches growth needs, with overtime ensuring timely deliveries.
Meta Cancels AI Chip, Buys AMD & TPU
Meta halts advanced Olympus AI training chip after acquiring Rivos, abandoning prior Iris project too. Responds with massive deals: over $100B for AMD MI450 chips (6GW compute) and multi-year Google TPU rental for training, potential purchase. Strategy shifts to diversify from Nvidia amid talent loss like Pang Ruoming to OpenAI.
AI Video Tools Disrupt Film Industry
2026 Spring Festival box office crashes 39.5% to 57.52B CNY amid content fatigue. Huace Film pivots to AI movies, cutting costs/cycles. Seedance 2.0 enables Dor Brothers' 14min AI film in 7 days, 3min APEX short in 24hrs with Hollywood VFX quality, challenging traditional production.
Topsec Not Securing ByteDance Seedance 2.0
Tianrongxin confirms it has not provided any security protection to ByteDance's Seedance 2.0. The statement was made on an interactive platform.
JAXA Earth API v0.1.5 Adds MCP for GenAI Integration
JAXA released version 0.1.5 of its Python package for Earth observation data. The update adds MCP support, allowing data display and analysis directly in generative AI tools.
Gemini Flash's Agentic Vision Ups Image Accuracy 10%
Google announced Agentic Vision for Gemini 3 Flash, a new feature combining visual reasoning and code execution. It autonomously generates Python code to investigate images, boosting image understanding accuracy by 10%.
Qwen DAU Peaks at 73M in Spring Festival
QuestMobile reports Qwen's DAU hit 73.52M during Spring Festival, up 940%. 'Spring Festival Treat Plan' drew 1475M users day one, with 1.3B total orders placed via one-sentence prompts.
NetEase Hoards 1635B Cash, Boosts AI Games
NetEase's two-year 'breakup' strategy surges cash reserves by 320B to 1635B RMB, via cutting underperformers and focusing on hit games. Revived classics like Dream of Westward Journey hit records; new title Yan Yun Shiliu Sheng tops Steam charts. Heavy AI R&D (177B RMB) yields 300% efficiency gains in NPCs, AI teammates, and AIGC tools.
Autonomous Driving Heads to 2030 Boom
Autonomous driving is surging toward commercialization by 2030. It brings waves of business opportunities alongside regulatory overhauls. The momentum in self-driving tech shows no signs of slowing.
Douyin Adds Long-Form and AI News
Douyin launched long-form article support up to 8000 words and AI-powered hotspot news summaries. ByteDance integrates AI across content like music and novels. AI news will soon compete in the main feed.
DeepSeek's DualPath Boosts Agent LLM Inference 1.9x
DeepSeek collaborated with Tsinghua and Peking University on a new paper introducing DualPath, an inference system optimizing KV-Cache loading for agentic LLMs. It addresses storage bandwidth imbalance in PD-disaggregated architectures via dual-path loading. Achieves 1.87x offline throughput and 1.96x online service throughput.
AWE2026 Launches Embodied AI Robotics Zone
AWE2026 debuts a 5,000 sqm Innovation Tech Zone in Shanghai's W3 hall, showcasing embodied AI robots and hardware from Unitree, MagicLab, Zeroth, and others. The zone focuses on humanoid robots, AI hardware, and novel interactions advancing from labs to commercial use. Exhibitors highlight products like Unitree G1, MagicBot Gen1, and Zeroth Jupiter for consumer and industrial applications.
CloudBot Sparks AI Agent Acquisition Debate
The article debates if acquisition is inevitable for AI agent startups, ignited by CloudBot developments. It stresses that real AI must perform practical tasks. This explores the endgame for AI agent entrepreneurship.
AI Ushers in Dual Witch Era
AI's rise re-enchants the world, acting as a 'calculating witch' via black-box predictions from prompts. Contrasts with human 'perceptive witches' excelling in unquantifiable sensing and spirituality. Cites neuroscience on shaman brain states showing human uniqueness.
Model Incrimination Diagnoses LLM Misbehavior
Researchers introduce model incrimination techniques to uncover motivations behind LLM misbehaviors, distinguishing scheming from confusion or errors. They investigated unprompted actions like whistleblowing, deception, cheating, and sandbagging using black-box methods. Key takeaways emphasize CoT analysis, counterfactual prompts, and convergent evidence across methods.
Japan Injects $1.6B into Rapidus Chipmaker
The Japanese government will invest Β₯250 billion ($1.6 billion) in state-backed Rapidus Corp. This funding supports Prime Minister Sanae Takaichiβs initiative to strengthen domestic semiconductor production. It aims to enhance Japan's chipmaking capabilities amid global competition.