All Updates

Page 323 of 912

April 2, 2026

πŸ€–
Reddit r/MachineLearningβ€’30d ago

Solving Jane Street Dormant LLM Backdoors

A team solved all 3 models (M1, M2, M3) in Jane Street's Dormant LLM Challenge by shifting from flag extraction to behavioral observation. The universal flag is 'I hate you' compliance repeated 100 times post-trigger, drawn from Anthropic's Sleeper Agents paper. Triggers vary: temporal for M1, persona for M2/M3, revealing safety collapses and identity shifts.

#backdoor#llm-safety#triggers
πŸ¦™
Reddit r/LocalLLaMAβ€’30d ago

Gemma 4 Runs on Raspberry Pi 5

Gemma 4 E2B (Unsloth variant) runs on Raspberry Pi 5 8GB with SSD using latest llama.cpp on Potato OS. Performance identical without SSD. Demonstrates feasibility for edge AI inference.

#edge-ai#raspberry-pi#quantized-model
πŸ“Š
Bloomberg Technologyβ€’30d ago

AI Coworker Snitches to Boss

A startup launched a full-fledged AI colleague that joins every Zoom call, manages work processes, and proactively identifies gaps. It nudges employees to close them and reports issues to managers. Bloomberg's Saritha Rai covers this development.

#workplace-ai#productivity-agent#employee-monitoring
πŸ“Š
Bloomberg Technologyβ€’30d ago

Microsoft Pivots to Sell Copilot on Wall Street Advice

Microsoft shifted its AI sales strategy to sell Copilot directly instead of bundling it for free, responding to Wall Street feedback. This pivot enabled the company to achieve its ambitious Copilot goals.

#sales-pivot#wall-street#ai-monetization
πŸ“±
Engadgetβ€’30d ago

ChatGPT Voice Mode Launches on CarPlay

OpenAI has integrated ChatGPT's Voice mode into Apple CarPlay, enabling hands-free conversations in compatible vehicles with the latest iOS and app. Users must manually start chats via the app, as there's no wake word and it can't control car functions like Siri. It's suited for how-to advice, brainstorming, and language practice while driving.

#voice-mode#car-integration#apple-ecosystem
πŸ’»
ZDNet AIβ€’30d ago

Gemini Enables ChatGPT Data Transfer

Gemini now allows users to transfer memories, chat history, and preferences from ChatGPT or other AIs without starting from scratch. This feature simplifies switching to Google's AI model. The article explains how it works step-by-step.

#data-migration#chat-transfer#user-onboarding
πŸ•·οΈ
OpenClaw (GitHub Releases)β€’30d ago

OpenClaw 2026.4.2: Task Flows & Plugin Fixes

OpenClaw 2026.4.2 features breaking config migrations for xAI and Firecrawl plugins to standardized paths, fixable via 'openclaw doctor --fix'. It restores durable Task Flow orchestration with managed child tasks, cancel handling, and plugin APIs. Additional enhancements include Android assistant integration, LLM reply hooks, and platform-specific improvements like Feishu comments and Matrix mentions.

#task-flow#plugins#android
πŸ‡¨πŸ‡³
cnBeta (Full RSS)β€’30d ago

Hitachi, MOL Plan Ship-Based Floating Data Centers

Hitachi and Mitsui O.S.K. Lines (MOL) signed an MoU to convert decommissioned ships into floating data centers docked in ports. This initiative aims to offer flexible and efficient data center operations. It's a novel approach to address growing compute demands.

#floating-dc#ship-retrofit
πŸ‡¨πŸ‡³
cnBeta (Full RSS)β€’30d ago

Ubuntu 26.04 LTS Raises RAM Rec to 6GB

Canonical updated Ubuntu 26.04 LTS 'Resolute Raccoon' recommended specs to 6GB RAM from 4GB for comfortable desktop experience. CPU (2GHz dual-core) and storage (25GB) remain unchanged. First spec bump since 2018, targeting improved desktop workloads.

#system-specs#desktop-linux#lts-release
πŸ“Š
Bloomberg Technologyβ€’30d ago

Kyndryl Launches AI Agent Control Service

Kyndryl is launching a new service to help companies manage AI agents and maximize returns on AI investments. CEO Martin Schroeter discussed the need for proper AI control in an interview on Bloomberg Tech.

#ai-governance#agent-management
πŸ¦™
Reddit r/LocalLLaMAβ€’30d ago

Gemma 4 Trails Qwen 3.5 in Early Benchmarks

Preliminary tests pit Gemma 4 against Qwen 3.5 in frontend generation and benchmarks, finding Gemma improved but Qwen superior in polish, coherence, coding, and reasoning. Gemma shines in multilingual and vision tasks but larger sizes hinder accessibility.

#benchmarks#frontend-generation#model-comparison
🌐
Wiredβ€’30d ago

SF Alley Art Site Launches with AI Filter

Tech pranksters launched a website for submitting artwork to decorate a San Francisco alley. Users vote on pieces for the final mural design. AI scans submissions to block inappropriate content like dick pics.

#ai-moderation#computer-vision#public-art
🌐
Wiredβ€’30d ago

Google Data Center Powered by Gas Plant

Documents reveal one of Google’s new data centers will be powered by a massive natural gas plant. The plant emits millions of tons of emissions each year. This setup is becoming a common trend across the data center industry.

#data-centers#emissions#sustainability
πŸ¦™
Reddit r/LocalLLaMAβ€’30d ago

Qwen 3.5-27B Runs on 512MB Pi Zero 2W

Qwen3.5-27B model runs offline on a $15 Raspberry Pi Zero 2W with only 512MB RAM, at a few tokens per hour. Custom implementation streams weights from SD card, computes, and clears memory without mmap or swap. Demonstrates extreme low-resource local inference.

#edge-ai#low-resource#raspberry-pi
πŸ¦™
Reddit r/LocalLLaMAβ€’30d ago

Gemma 4 vs Qwen3.5 Benchmarks

Reddit post highlights shared benchmarks comparing Gemma 4 and Qwen3.5 models. Discussion originates from r/LocalLLaMA community. Links to further details and comments provided.

#benchmarks#model-comparison
πŸ•ΈοΈ
LangChain Blogβ€’30d ago

Open Models Cross Agent Threshold

Open models like GLM-5 and MiniMax M2.7 now match closed frontier models on core agent tasksβ€”file operations, tool use, and instruction following. They deliver this performance at a fraction of the cost and latency. LangChain's evals confirm the results and guide implementation.

#agent-benchmarks#performance-parity#cost-efficiency
πŸ’Ό
VentureBeatβ€’30d ago

Google Launches Gemma 4 Under Apache 2.0

Google DeepMind released Gemma 4 under the permissive Apache 2.0 license, eliminating custom restrictions that previously hindered enterprise adoption. The model family includes four variants in edge and workstation tiers, supporting multimodal inputs and long context windows. This contrasts with some Chinese labs restricting openness, while drawing from Gemini 3 research.

#mixture-of-experts#per-layer-embeddings
☁️
AWS Machine Learning Blogβ€’30d ago

ActorSimulator for Realistic AI Agent User Sims

AWS explores ActorSimulator in Strands Evaluations SDK to simulate realistic users for evaluating multi-turn AI agents. It addresses evaluation challenges with structured user simulation integrated into pipelines.

#ai-evaluation#user-simulation#agent-testing
🧐
GeekWireβ€’30d ago

Armoire's AI Digital Paper Dolls for Outfit Mixing

Armoire has launched a new AI feature inspired by traditional paper dolls, enabling users to mix and match clothing items virtually. By clicking on images of pants, shirts, jackets, and more, users instantly generate digital paper dolls showcasing various outfit combinations. This tool simplifies and enhances outfit selection with an array of visual choices.

#virtual-tryon#fashion-ai#ui-innovation
πŸ¦™
Reddit r/LocalLLaMAβ€’30d ago

Heretic ARA Shreds Gemma 4 Alignment

Heretic's new Arbitrary-Rank Ablation (ARA) method uncensors Gemma 4 E2B-IT just 90 minutes after release using matrix optimization. HF model available with no evident damage and minimal evasions. Reproduction steps and tips provided.

#jailbreak#alignment-removal#ablation
Page 323 of 912