All Updates
Page 522 of 673
February 28, 2026
LLMs Map AI Trends in LCA
This arXiv paper reviews AI integration in life cycle assessment (LCA) using large language models (LLMs) to uncover trends and themes. It highlights dramatic growth in AI adoption, especially LLMs, and correlations with LCA stages. The study introduces an LLM-based text-mining framework for scalable literature reviews.
Fixing Rater Bias in AI Evals with IRT
This paper integrates psychometric rater models into AI evaluations to correct systematic errors from human raters. It employs Item Response Theory, particularly the multi-faceted Rasch model, to disentangle true output quality from rater effects like severity and centrality. Applied to OpenAI's summarization dataset, it delivers adjusted quality scores and rater diagnostics for more reliable AI assessments.
CourtGuard: Zero-Shot LLM Safety Framework
CourtGuard is a retrieval-augmented multi-agent framework that treats LLM safety as an evidentiary debate using external policy documents. It achieves state-of-the-art results on 7 safety benchmarks without fine-tuning, outperforming policy-following baselines. It excels in zero-shot adaptability (90% accuracy on Wikipedia Vandalism) and automated curation of 9 adversarial datasets.
ConstraintBench: LLM Optimization Benchmark
ConstraintBench introduces a benchmark for LLMs to solve constrained optimization problems directly without solvers across 10 operations research domains. Evaluations of six frontier models on 200 tasks show feasibility as the key bottleneck, with the best achieving 65% constraint satisfaction but low joint optimality. The benchmark and evaluation tools will be publicly released.
Cognitive Models Template LLM Agent Design
This arXiv position paper argues cognitive models and AI algorithms offer blueprints for modular language agents combining multiple LLMs. It formalizes 'agent templates' defining LLM roles and compositions. Surveys existing agents to highlight their cognitive/AI inspirations for interpretable designs.
AHCE Boosts LLM Agents with Human Expertise
AHCE framework augments LLM agents in specialized domains by requesting structured expert reasoning via a learned Human Feedback Module (HFM). Experiments in Minecraft show 32% success rate gains on normal tasks and nearly 70% on hard tasks with minimal human input. It advances beyond basic help requests to treat humans as interactive tools.
Agentic AI Optimizes Cell-free O-RAN
This arXiv paper proposes an agentic AI framework using LLM-based agents for intent-driven optimization in cell-free O-RAN. A supervisor translates intents into objectives and rate requirements, while specialized agents handle weighting, O-RU management via DRL, and monitoring. PEFT enables LLM sharing, reducing active O-RUs by 41.93% and memory by 92%.
Schelling Goodness for Moral Coordination
Introduces 'Schelling goodness' as what diverse intelligent agents from successful civilizations would converge on in hypothetical moral coordination games. Emphasizes it's not a direct moral claim but a prediction of agreement using common knowledge and survival pressures. Uses thought experiments with binary choices to explore AI alignment implications.
5 God-Level Nano Banana 2 AI Image Tips
Ifanr shares 5 advanced 'god-level' usage tips for Nano Banana 2, calling it the ultimate AI image generator. The post includes prompts and urges readers to bookmark it. Features playful promo like raising lobsters while eating bananas.
Alibaba Open-Sources CoPaw Desktop Agent
Alibaba has open-sourced CoPaw, a desktop Agent tool for one-click local or cloud deployment. It supports secondary development for custom models, skills, and messaging apps. Built-in skills enable social content summarization, news queries, and desktop organization.
China's ADAS Blue Lights Face Highway Drama
Blue lights signaling intelligent driving (NOA/NGP) proliferate on Chinese highways, with 2025 city NOA penetration at 11.6% and Huawei/XPeng logging massive usage. Other drivers avoid, follow, or provoke them, while systems exhibit conservative, aggressive, or quirky behaviors like honking or sign misreads. Car owners report mixed love-hate experiences amid rising adoption.
Nvidia Custom Chip for OpenAI Inference
Nvidia plans a new processor tailored for OpenAI and others to enable faster AI inference. It integrates Groq-designed chips and debuts at next month's GTC conference. OpenAI will be a top customer despite seeking alternatives.
QingTianZu Robot Leasing: Opportunity or Trap?
The article debates if QingTianZu's robot leasing represents a once-in-20-years wealth opportunity or a calculated harvest. Robot leasing merits attention but is not yet ripe for blind optimism. It is not a scam, though far from guaranteed profits.
One-Click OpenClaw for Windows
A simple one-click deployment solution brings OpenClaw to Windows users. It integrates seamlessly into Feishu, making it beginner-friendly for easy setup and use.
Micro Diffusion: 150-Line Text Diffusion
Minimal discrete text diffusion implementation in pure Python, inspired by MicroGPT. Features three versions: 143-line NumPy core, visualized NumPy, and PyTorch Transformer denoiser. Trains on CPU in minutes on 32K SSA names dataset.
Workday Founder Returns Amid AI Struggles
Workday's founder is returning to address the company's AI transformation challenges. The stock plunged 10%, fueling market skepticism. The software sector grapples with converting AI hype to revenue growth.
Meta's $60B Chip Buy Cracks Nvidia Dominance
Meta invests 600 billion to acquire backup chips, securing three major types. Google's TPU enters the commercial arena as Meta shifts allegiance. AI compute power shifts to a brutal multi-vendor 'three kingdoms' competition.
Nvidia Profits Boom, Stock Lags on AI Bets
Nvidia posts massive earnings but stock fails to rise. Analysis questions CEO Jensen Huang's AI 'gold mine' strategy and its shift from gold rush to alchemy.
AI's Next Step: Intelligent Agents
Intelligent agents represent a pivotal evolution in AI development. They may not be the final destination but are likely the crucial first step for AI integration into the real world.
OpenAI Inks Pentagon AI Deployment Deal
OpenAI has reached an agreement with the US Department of Defense to deploy its AI models in their classified network. This partnership enables military use of OpenAI's advanced models. The deal highlights growing AI integration in national security.