All Updates

Page 522 of 673

February 28, 2026

๐Ÿ“„
ArXiv AIโ€ข42d ago

LLMs Map AI Trends in LCA

This arXiv paper reviews AI integration in life cycle assessment (LCA) using large language models (LLMs) to uncover trends and themes. It highlights dramatic growth in AI adoption, especially LLMs, and correlations with LCA stages. The study introduces an LLM-based text-mining framework for scalable literature reviews.

#ai-sustainability#text-mining
๐Ÿ“„
ArXiv AIโ€ข42d ago

Fixing Rater Bias in AI Evals with IRT

This paper integrates psychometric rater models into AI evaluations to correct systematic errors from human raters. It employs Item Response Theory, particularly the multi-faceted Rasch model, to disentangle true output quality from rater effects like severity and centrality. Applied to OpenAI's summarization dataset, it delivers adjusted quality scores and rater diagnostics for more reliable AI assessments.

#rater-effects#item-response-theory#human-evaluation
๐Ÿ“„
ArXiv AIโ€ข42d ago

CourtGuard: Zero-Shot LLM Safety Framework

CourtGuard is a retrieval-augmented multi-agent framework that treats LLM safety as an evidentiary debate using external policy documents. It achieves state-of-the-art results on 7 safety benchmarks without fine-tuning, outperforming policy-following baselines. It excels in zero-shot adaptability (90% accuracy on Wikipedia Vandalism) and automated curation of 9 adversarial datasets.

#llm-safety#zero-shot#multi-agent
๐Ÿ“„
ArXiv AIโ€ข42d ago

ConstraintBench: LLM Optimization Benchmark

ConstraintBench introduces a benchmark for LLMs to solve constrained optimization problems directly without solvers across 10 operations research domains. Evaluations of six frontier models on 200 tasks show feasibility as the key bottleneck, with the best achieving 65% constraint satisfaction but low joint optimality. The benchmark and evaluation tools will be publicly released.

#benchmark#llm-reasoning#operations-research
๐Ÿ“„
ArXiv AIโ€ข42d ago

Cognitive Models Template LLM Agent Design

This arXiv position paper argues cognitive models and AI algorithms offer blueprints for modular language agents combining multiple LLMs. It formalizes 'agent templates' defining LLM roles and compositions. Surveys existing agents to highlight their cognitive/AI inspirations for interpretable designs.

#language-agents#cognitive-models#agent-templates
๐Ÿ“„
ArXiv AIโ€ข42d ago

AHCE Boosts LLM Agents with Human Expertise

AHCE framework augments LLM agents in specialized domains by requesting structured expert reasoning via a learned Human Feedback Module (HFM). Experiments in Minecraft show 32% success rate gains on normal tasks and nearly 70% on hard tasks with minimal human input. It advances beyond basic help requests to treat humans as interactive tools.

#agent-augmentation#expert-reasoning
๐Ÿ“„
ArXiv AIโ€ข42d ago

Agentic AI Optimizes Cell-free O-RAN

This arXiv paper proposes an agentic AI framework using LLM-based agents for intent-driven optimization in cell-free O-RAN. A supervisor translates intents into objectives and rate requirements, while specialized agents handle weighting, O-RU management via DRL, and monitoring. PEFT enables LLM sharing, reducing active O-RUs by 41.93% and memory by 92%.

#agentic-ai#cell-free-oran#energy-optimization
โš–๏ธ
AI Alignment Forumโ€ข42d ago

Schelling Goodness for Moral Coordination

Introduces 'Schelling goodness' as what diverse intelligent agents from successful civilizations would converge on in hypothetical moral coordination games. Emphasizes it's not a direct moral claim but a prediction of agreement using common knowledge and survival pressures. Uses thought experiments with binary choices to explore AI alignment implications.

#schelling-goodness#moral-coordination#ai-alignment
๐Ÿ“ฑ
Ifanr (็ˆฑ่Œƒๅ„ฟ)โ€ข42d ago

5 God-Level Nano Banana 2 AI Image Tips

Ifanr shares 5 advanced 'god-level' usage tips for Nano Banana 2, calling it the ultimate AI image generator. The post includes prompts and urges readers to bookmark it. Features playful promo like raising lobsters while eating bananas.

#ai-art#prompts#tutorial
๐Ÿ”ฅ
36ๆฐชโ€ข42d ago

Alibaba Open-Sources CoPaw Desktop Agent

Alibaba has open-sourced CoPaw, a desktop Agent tool for one-click local or cloud deployment. It supports secondary development for custom models, skills, and messaging apps. Built-in skills enable social content summarization, news queries, and desktop organization.

#agent#desktop-ai#skill
๐Ÿฏ
่™Žๅ—…โ€ข42d ago

China's ADAS Blue Lights Face Highway Drama

Blue lights signaling intelligent driving (NOA/NGP) proliferate on Chinese highways, with 2025 city NOA penetration at 11.6% and Huawei/XPeng logging massive usage. Other drivers avoid, follow, or provoke them, while systems exhibit conservative, aggressive, or quirky behaviors like honking or sign misreads. Car owners report mixed love-hate experiences amid rising adoption.

#adas#autonomous-driving#china-adoption
๐Ÿ”ฅ
36ๆฐชโ€ข42d ago

Nvidia Custom Chip for OpenAI Inference

Nvidia plans a new processor tailored for OpenAI and others to enable faster AI inference. It integrates Groq-designed chips and debuts at next month's GTC conference. OpenAI will be a top customer despite seeking alternatives.

#inference#chip#gtc
๐Ÿ’ฐ
้’›ๅช’ไฝ“โ€ข42d ago

QingTianZu Robot Leasing: Opportunity or Trap?

The article debates if QingTianZu's robot leasing represents a once-in-20-years wealth opportunity or a calculated harvest. Robot leasing merits attention but is not yet ripe for blind optimism. It is not a scam, though far from guaranteed profits.

#robot-leasing#investment-debate#embodied-ai
๐Ÿฏ
่™Žๅ—…โ€ข42d ago

One-Click OpenClaw for Windows

A simple one-click deployment solution brings OpenClaw to Windows users. It integrates seamlessly into Feishu, making it beginner-friendly for easy setup and use.

#one-click-deploy#windows-support#beginner-tool
๐Ÿค–
Reddit r/MachineLearningโ€ข42d ago

Micro Diffusion: 150-Line Text Diffusion

Minimal discrete text diffusion implementation in pure Python, inspired by MicroGPT. Features three versions: 143-line NumPy core, visualized NumPy, and PyTorch Transformer denoiser. Trains on CPU in minutes on 32K SSA names dataset.

#text-diffusion#minimal-code#cpu-training
๐Ÿ’ฐ
้’›ๅช’ไฝ“โ€ข42d ago

Workday Founder Returns Amid AI Struggles

Workday's founder is returning to address the company's AI transformation challenges. The stock plunged 10%, fueling market skepticism. The software sector grapples with converting AI hype to revenue growth.

#ai-transformation#saas-revenue#stock-drop
๐Ÿ’ฐ
้’›ๅช’ไฝ“โ€ข42d ago

Meta's $60B Chip Buy Cracks Nvidia Dominance

Meta invests 600 billion to acquire backup chips, securing three major types. Google's TPU enters the commercial arena as Meta shifts allegiance. AI compute power shifts to a brutal multi-vendor 'three kingdoms' competition.

#ai-chips#nvidia-alternatives#tpu-commercial
๐Ÿ’ฐ
้’›ๅช’ไฝ“โ€ข42d ago

Nvidia Profits Boom, Stock Lags on AI Bets

Nvidia posts massive earnings but stock fails to rise. Analysis questions CEO Jensen Huang's AI 'gold mine' strategy and its shift from gold rush to alchemy.

#earnings-analysis#ai-strategy#stock-performance
๐Ÿ’ฐ
้’›ๅช’ไฝ“โ€ข42d ago

AI's Next Step: Intelligent Agents

Intelligent agents represent a pivotal evolution in AI development. They may not be the final destination but are likely the crucial first step for AI integration into the real world.

#agentic-ai#ai-evolution#real-world-ai
๐Ÿ“Š
Bloomberg Technologyโ€ข42d ago

OpenAI Inks Pentagon AI Deployment Deal

OpenAI has reached an agreement with the US Department of Defense to deploy its AI models in their classified network. This partnership enables military use of OpenAI's advanced models. The deal highlights growing AI integration in national security.

#defense#government#deployment
Page 522 of 673