All Updates

Page 4 of 607

April 4, 2026

🦙
Reddit r/LocalLLaMA20h ago

Kokoro TTS Achieves 20x Realtime on CPU

Developer optimized Kokoro TTS for iOS with CPU-only pipeline, hitting 20x realtime without thermal issues by splitting the model and using Apple's Accelerate framework. Avoids Metal for background audio support. Released as Morph Books EPUB reader app.

#tts#on-device#ios-optimization
🇬🇧
The Register - AI/ML20h ago

PrismML Launches 1-Bit Bonasi 8B LLM

PrismML, a Caltech AI startup, released Bonasi 8B, a 1-bit large language model competitive with other 8B models. It is 14x smaller and 5x more energy efficient, aiming to enable efficient AI on mobile devices and reduce cloud dependency.

#1-bit-quantization#edge-ai#energy-efficiency
🦙
Reddit r/LocalLLaMA20h ago

Qwen 3.6 Quantization Erases Benchmark Edge

Users worry about unreleased Qwen 3.6 397B, but small benchmark gaps between Qwen 3.5 and 3.6 suggest quantization like Q2_K_XL on RTX 6000 would negate advantages. Discussion anticipates smaller Qwen models competing with Gemma 4.

#quantization#benchmarks#local-deployment
🔥
36氪21h ago

Silicon Association Probes Baotou Supply Glut

China Nonferrous Metals Industry Association Silicon Division surveyed Baotou's silicon sector. Firms report acute supply-demand imbalance, prices below costs, and chain-wide losses. They seek association-government action to curb chaos, foster fair markets, and refine standards avoiding blanket policies.

#silicon-supply#market-crisis#industry-regulation
🏠
IT之家22h ago

Tesla Superchargers Hit 80K Global, 12K in China

Tesla's global Supercharger network exceeds 80,000 stalls, with over 12,000 in mainland China across 2,500+ stations. It covers 100% of provincial capitals and opens 950+ stations to non-Tesla EVs. In 2025, delivered record 6.7 TWh electricity.

#ev-charging#china-expansion#network-scale
🔥
36氪22h ago

World's First Embodied AI Hackathon

Zivariable Robotics hosted the inaugural global embodied intelligence developer conference in Shenzhen, with 20 post-00s teams hacking for 72 hours on real robotic arms backed by 100+ PFLOPs compute and open models like WALL-OSS. A/B leaderboards tested generalization in fixed vs. random real-world environments across tasks like fruit sorting and cable insertion. Pre-event, Zivariable launched the first robot cleaning service partnering with 58 Daojia.

#embodied-ai#hackathon#open-models
🦙
Reddit r/LocalLLaMA22h ago

DIY GGUF Quantization Guide Released

User shares detailed recipe for quantizing GGUF models like Gemma-4-26B-A4B, requiring 500GB storage and architecture-specific configs. Thanks quantizers like unsloth, bartowski; links full REPRODUCE.md on Hugging Face.

#quantization#gguf#tutorial
🐯
虎嗅23h ago

AI Ends Internet's Lightweight Era

Internet giants are pivoting from lightweight software to massive AI infrastructure investments, with Amazon planning $200B capex in 2026 for power and data centers. Big Tech's combined $650B annual spend rivals global semiconductor revenue, while Chinese firms like ByteDance, Alibaba, and Tencent allocate hundreds of billions RMB to AI chips and facilities. This shift reverses scale economies, as AI inference costs rise with user growth.

#capex#data-centers#power-supply
⚛️
量子位23h ago

Django Founder: AI Zeros 30-Year-Old Coders' Value

Django founder warns that AI will render the skills of 30-year-old programmers worthless. He states his former superpower of rapid prototyping is now achievable by anyone using AI tools.

#ai-disruption#programmer-jobs#prototyping
🐯
虎嗅23h ago

$1M Crowdfunded in 5 Hours for Local AI Box

Tiiny AI Pocket Lab plug-in device for 100B local LLM inference hits $2.95M Kickstarter with 2k backers at $1399; fills gap for privacy-focused, easy local AI vs. costly AI PCs or weak boards. Built on open-source PowerInfer engine from SJTU.

#local-llm#ai-hardware#open-source
🤖
Reddit r/MachineLearning23h ago

ML Vets: What Public Gets Wrong About AI

Reddit thread seeks insights from ML/AI experts with 10+ years experience on public misconceptions. Highlights gaps between public perceptions and frontier research realities. Invites discussion on underestimations or overestimations in AI.

#ai-hype#veteran-insights
🏠
IT之家Yesterday

Qwen3.6-Plus Breaks 1.4T Token Daily Record

Alibaba's Qwen3.6-Plus hit 1.4 trillion tokens on OpenRouter in one day post-launch, shattering the platform's single-model record. It tops China and ranks #2 globally in programming benchmarks. Free preview available, strongest among recent models.

#llm-benchmark#api-record#agent-model
🤖
Reddit r/MachineLearningYesterday

NeurIPS Submission: Agentic Proof Dilemma

Researcher debates submitting NeurIPS paper on novel agentic system with formal convergence proof and real-world application. Limited to few examples due to unsuitable benchmarks. Seeks advice on proceeding despite data gaps.

#neurips-submission#agentic-systems#convergence-proof
🔥
36氪Yesterday

Qwen AI Taxi Surges 1500% in 2 Weeks

Qwen AI ride-hailing launched on March 23 and saw orders surge over 1500% week-over-week on April 4 during Qingming holiday. User scale rapidly expanded in under two weeks. Users favor it for complex scenarios like multi-waypoints, appointments, and personalized request combinations.

#ai-agent#ride-hailing#user-growth
🦙
Reddit r/LocalLLaMAYesterday

GLM-5 Nearly Matches Claude Opus at 11x Lower Cost

YC-Bench benchmark simulates LLMs running a startup for a year with delayed feedback and adversarial clients. GLM-5 achieves $1.21M avg funds, close to Claude Opus's $1.27M but at 11x lower API cost. Persistent scratchpad use predicts success.

#llm-benchmark#agent#cost-efficiency
🇨🇳
cnBeta (Full RSS)Yesterday

Sony Preps PS6 and Handheld Amid Memory Crisis

Next-gen consoles slated for late 2027 or 2028 despite persistent memory shortages. Sony is alerting developers to gear up for PS6 and a new portable PS handheld. Emphasis on enhanced game scalability across devices.

#memory-crisis#next-gen-console#handheld
🔥
36氪Yesterday

Musk Ties SpaceX IPO to Grok Purchases

Elon Musk reportedly requires companies eyeing SpaceX IPO participation to purchase Grok. This bundles access to the rocket firm's public offering with adoption of xAI's AI chatbot. Reported by Caijing.

#ipo-mandate#xai-strategy#musk-ecosystem
🐯
虎嗅Yesterday

PR Evolves: Clarity Over Volume in AI Age

AI and social media have flooded content supply, diminishing traditional PR impact despite efficient production. Success now hinges on clear core messaging and translating internal narratives for external clarity. Examples like ZhuiMi show effective audience-specific framing.

#pr-strategy#ai-content#brand-clarity
🤖
Reddit r/MachineLearningYesterday

ACL 2026 Decisions Due in 24 Hours

Dedicated Reddit thread for ACL 2026 decision updates and discussions. Decisions expected to publish within 24 hours. Provides space for community venting and sharing.

#acl-2026#conference-decisions#nlp-community
🐯
虎嗅Yesterday

Colleague.Skill Turns Ex-Coworkers into AI Bots

Viral GitHub project 'colleague-skill' lets users feed ex-colleagues' messages, docs, and emails into AI to create mimicking 'skills' for work tasks. It sparks debate on distilling human expertise into AI, job losses, and 'cyber immortality' memes. Concerns rise over closing junior career paths and self-training replacement tools.

#job-automation#cyber-immortality
Page 4 of 607