All Updates

Page 155 of 907

April 19, 2026

🦙
Reddit r/LocalLLaMA12d ago

Scaffold Doubles Small Model Coding Score

Same Qwen3.5-9B model scores 19.1% on Aider benchmark with vanilla scaffold, but 45.6% with author's little-coder adaptation. Changes focus on bounded reasoning, write guards, workspace discovery, and per-turn injections. Suggests scaffold-model fit crucial for sub-10B local models in coding agents.

#scaffolds#coding-agents#benchmarks
🦙
Reddit r/LocalLLaMA12d ago

Qwen 3.6 35B Builds Browser OS

User shares 'Browser OS' implementation using Qwen 3.6 35B, calling it the best result from any local model. Post links to details but content is minimal.

#browser-os#local-model#agentic
🤖
Reddit r/MachineLearning12d ago

Formalisation Trap in AI Production

Recurring AI production failure: systems make technically correct but contextually wrong decisions due to shifted assumptions. Not due to models, data, or infra; outputs valid but outdated. Tightening controls reinforces the 'Formalisation Trap' locking meaning into structure.

#prod-failures#assumption-drift#formalisation-trap
🌍
The Next Web (TNW)12d ago

Meta Layoffs 8000 for AI Infra Billions

Meta targets 20 May for companywide layoffs of approximately 8,000 employees (10% of 78,865 workforce) to redirect billions toward AI infrastructure costing $115-135 billion. Additional cuts are planned for the second half of 2026, following prior rounds totaling roughly 25,000 since 2022.

#layoffs#ai-investment#restructuring
🧐
GeekWire12d ago

Neighbor's Take: Microsoft Cultural Woes

Sammamish operations consultant, neighboring Microsoft employees, critiques the company's culture. Internal politics dominate even weekend talks. H-1B visa fears are stifling risk-taking and innovation.

#company-culture#h1b-visa#internal-politics
📊
Bloomberg Technology12d ago

Apple Teases Revamped Siri UI in iOS 27

Apple has hidden a preview of a revamped Siri interface for iOS 27 in its WWDC teaser video. The update focuses on enhancing the visual design of the AI assistant. Additionally, memory shortages could delay new Mac launches.

#voice-assistant#ui-redesign#hardware-delay
🤖
Reddit r/MachineLearning12d ago

Transitioning to ML Research Engineer Over 40

Experienced US software engineer (Staff+ level) with math-heavy CS degree and ML courses seeks advice on becoming a research engineer. Past applied ML experience disliked; open to unpaid/part-time roles for entry. Considers masters/PhD but questions value beyond connections.

#career-transition#ml-jobs#ageism
🦙
Reddit r/LocalLLaMA12d ago

Qwen3.6-35B-A3B Excels in LM Studio Chat

User shares optimal prompt and settings for precise responses from qwen3.6-35b-a3b in LM Studio's lms chat. Config includes temp 0.7, top-k 10, presence penalty 1 on RTX 5090. Detailed reasoning protocol prompt boosts accuracy for complex tasks.

#prompt-engineering#local-inference#reasoning-model
💰
钛媒体13d ago

AI Rings, Necklaces Target Wearables Trillion Market

After smart glasses, AI-powered rings and necklaces are competing intensely for the trillion-yuan smart wearables market. These devices promise to provide users with AI-enhanced 'cheat mode' capabilities.

#wearables#ai-hardware#market-competition
💰
钛媒体13d ago

11 Whys Unlock Yizhuang Robot Marathon Insights

The article breaks down the Yizhuang robot marathon event through 11 key questions. It stresses that analyzing not just winners but also failed robots provides critical lessons for robotics development.

#robotics#embodied-ai#competition-analysis
🦙
Reddit r/LocalLLaMA13d ago

llama.cpp Merges Speculative Checkpointing

Speculative checkpointing feature merged into llama.cpp via PR #19493. It delivers speedups for some prompts, varying by task; coding sees 0-50% gains with specific params. Performance depends on draft acceptance and repetition patterns.

#speculative-decoding#inference-speedup#coding-optimization
💰
钛媒体13d ago

DeepSeek $10B Valuation; TSMC AI Crunch; China-US LLM Parity

DeepSeek is reportedly in first external funding talks at over $10B valuation; TSMC CEO says max expansion can't meet AI demand. Stanford report claims substantial elimination of US-China top large model gaps; other news includes HappyHorse-1.0 Arena debut, compute price surges by Alibaba/Anthropic, Meta 10% layoff plans, Nvidia's open-source quantum AI model ISING.

#funding#infrastructure#compute-pricing
📊
Bloomberg Technology13d ago

AI Boom Fuels US Copper Race

AI-driven electricity demand surges, making copper vital for data centers and power grids. US production has stagnated, increasing import reliance amid global demand. Projects like Rio Tinto’s Resolution mine face regulatory delays and costs, with China dominating processing.

#data-centers#supply-chain#mining
🇨🇳
cnBeta (Full RSS)13d ago

OpenAI's Three Execs Depart Same Day

OpenAI faces accelerated talent loss with three senior executives—Kevin Weil, Bill Peebles, and Srinivas Narayanan—quitting simultaneously. This coincides with shutting down multiple experimental projects. Industry watches closely for strategic shifts.

#talent-loss#leadership-change#project-shutdown
🦙
Reddit r/LocalLLaMA13d ago

Entropy + OLS + SVD Beats KV Pruning

Experiment combines entropy selection, OLS reconstruction, SVD compression for KV cache. Achieves ~3x lower error at low memory vs Top-K pruning, avoids spikes. Prototype blog seeks feedback.

#kv-cache#compression#low-rank
🇬🇧
The Register - AI/ML13d ago

AI Vendors Dodge Vuln Responsibility

AI vendors promote using AI to combat threats but dismiss their own security flaws as 'working as intended.' This opinion piece criticizes their lack of maturity in owning vulnerabilities. It highlights a pattern of passing blame to users in corporate IT environments.

#vulnerabilities#accountability#maturity
🤖
Reddit r/MachineLearning13d ago

Local LLMs for XQuery-SQL Conversion

Developer seeks best approach to convert XQuery to SQL using local LLMs amid data scarcity. Parsing and prompt engineering failed on complex queries. Considering QLoRA fine-tuning on ~120 samples.

#fine-tuning#prompt-engineering#query-translation
🏠
IT之家13d ago

ASRock & ASUS Add DDR5 32-bit HUDIMM Support

Amid DDR5 shortages, ASRock enables 32-bit HUDIMM (single sub-channel) on Intel 600/700/800 motherboards for cheaper half-die modules. ASUS also supports standard 4-chip HUDIMM, allowing mixes with UDIMM for 96-bit width. This addresses high DDR5 prices and boosts adoption.

#memory-shortage#motherboards#ddr5-subchannel
⚛️
量子位13d ago

Kimi Paper Turns KVCache into Business Model

Moonshot AI's Kimi team releases a new paper innovating on KVCache, transforming it into a novel commercial model. This breakthrough is positioned as a boon for handling ultra-long contexts in LLMs. The approach promises efficiency gains in extended inference scenarios.

#kv-cache#long-context#inference
⚛️
量子位13d ago

Gaode Unveils AGI Embodied Tech Stack

Gaode (Amap) publicly releases its first full-stack embodied technology system aimed at AGI. The system dominates 15 global SOTA benchmarks. It signals rapid convergence as embodied AI infrastructure matures.

#embodied-ai#agi#sota-benchmarks
Page 155 of 907