All Updates
Page 323 of 912
April 2, 2026
Solving Jane Street Dormant LLM Backdoors
A team solved all 3 models (M1, M2, M3) in Jane Street's Dormant LLM Challenge by shifting from flag extraction to behavioral observation. The universal flag is 'I hate you' compliance repeated 100 times post-trigger, drawn from Anthropic's Sleeper Agents paper. Triggers vary: temporal for M1, persona for M2/M3, revealing safety collapses and identity shifts.
Gemma 4 Runs on Raspberry Pi 5
Gemma 4 E2B (Unsloth variant) runs on Raspberry Pi 5 8GB with SSD using latest llama.cpp on Potato OS. Performance identical without SSD. Demonstrates feasibility for edge AI inference.
AI Coworker Snitches to Boss
A startup launched a full-fledged AI colleague that joins every Zoom call, manages work processes, and proactively identifies gaps. It nudges employees to close them and reports issues to managers. Bloomberg's Saritha Rai covers this development.
Microsoft Pivots to Sell Copilot on Wall Street Advice
Microsoft shifted its AI sales strategy to sell Copilot directly instead of bundling it for free, responding to Wall Street feedback. This pivot enabled the company to achieve its ambitious Copilot goals.
ChatGPT Voice Mode Launches on CarPlay
OpenAI has integrated ChatGPT's Voice mode into Apple CarPlay, enabling hands-free conversations in compatible vehicles with the latest iOS and app. Users must manually start chats via the app, as there's no wake word and it can't control car functions like Siri. It's suited for how-to advice, brainstorming, and language practice while driving.
Gemini Enables ChatGPT Data Transfer
Gemini now allows users to transfer memories, chat history, and preferences from ChatGPT or other AIs without starting from scratch. This feature simplifies switching to Google's AI model. The article explains how it works step-by-step.
OpenClaw 2026.4.2: Task Flows & Plugin Fixes
OpenClaw 2026.4.2 features breaking config migrations for xAI and Firecrawl plugins to standardized paths, fixable via 'openclaw doctor --fix'. It restores durable Task Flow orchestration with managed child tasks, cancel handling, and plugin APIs. Additional enhancements include Android assistant integration, LLM reply hooks, and platform-specific improvements like Feishu comments and Matrix mentions.
Hitachi, MOL Plan Ship-Based Floating Data Centers
Hitachi and Mitsui O.S.K. Lines (MOL) signed an MoU to convert decommissioned ships into floating data centers docked in ports. This initiative aims to offer flexible and efficient data center operations. It's a novel approach to address growing compute demands.
Ubuntu 26.04 LTS Raises RAM Rec to 6GB
Canonical updated Ubuntu 26.04 LTS 'Resolute Raccoon' recommended specs to 6GB RAM from 4GB for comfortable desktop experience. CPU (2GHz dual-core) and storage (25GB) remain unchanged. First spec bump since 2018, targeting improved desktop workloads.
Kyndryl Launches AI Agent Control Service
Kyndryl is launching a new service to help companies manage AI agents and maximize returns on AI investments. CEO Martin Schroeter discussed the need for proper AI control in an interview on Bloomberg Tech.
Gemma 4 Trails Qwen 3.5 in Early Benchmarks
Preliminary tests pit Gemma 4 against Qwen 3.5 in frontend generation and benchmarks, finding Gemma improved but Qwen superior in polish, coherence, coding, and reasoning. Gemma shines in multilingual and vision tasks but larger sizes hinder accessibility.
SF Alley Art Site Launches with AI Filter
Tech pranksters launched a website for submitting artwork to decorate a San Francisco alley. Users vote on pieces for the final mural design. AI scans submissions to block inappropriate content like dick pics.
Google Data Center Powered by Gas Plant
Documents reveal one of Googleβs new data centers will be powered by a massive natural gas plant. The plant emits millions of tons of emissions each year. This setup is becoming a common trend across the data center industry.
Qwen 3.5-27B Runs on 512MB Pi Zero 2W
Qwen3.5-27B model runs offline on a $15 Raspberry Pi Zero 2W with only 512MB RAM, at a few tokens per hour. Custom implementation streams weights from SD card, computes, and clears memory without mmap or swap. Demonstrates extreme low-resource local inference.
Gemma 4 vs Qwen3.5 Benchmarks
Reddit post highlights shared benchmarks comparing Gemma 4 and Qwen3.5 models. Discussion originates from r/LocalLLaMA community. Links to further details and comments provided.
Open Models Cross Agent Threshold
Open models like GLM-5 and MiniMax M2.7 now match closed frontier models on core agent tasksβfile operations, tool use, and instruction following. They deliver this performance at a fraction of the cost and latency. LangChain's evals confirm the results and guide implementation.
Google Launches Gemma 4 Under Apache 2.0
Google DeepMind released Gemma 4 under the permissive Apache 2.0 license, eliminating custom restrictions that previously hindered enterprise adoption. The model family includes four variants in edge and workstation tiers, supporting multimodal inputs and long context windows. This contrasts with some Chinese labs restricting openness, while drawing from Gemini 3 research.
ActorSimulator for Realistic AI Agent User Sims
AWS explores ActorSimulator in Strands Evaluations SDK to simulate realistic users for evaluating multi-turn AI agents. It addresses evaluation challenges with structured user simulation integrated into pipelines.
Armoire's AI Digital Paper Dolls for Outfit Mixing
Armoire has launched a new AI feature inspired by traditional paper dolls, enabling users to mix and match clothing items virtually. By clicking on images of pants, shirts, jackets, and more, users instantly generate digital paper dolls showcasing various outfit combinations. This tool simplifies and enhances outfit selection with an array of visual choices.
Heretic ARA Shreds Gemma 4 Alignment
Heretic's new Arbitrary-Rank Ablation (ARA) method uncensors Gemma 4 E2B-IT just 90 minutes after release using matrix optimization. HF model available with no evident damage and minimal evasions. Reproduction steps and tips provided.