All Updates
Page 366 of 891
March 28, 2026
Wharton: AI Causes Cognitive Surrender
Wharton School research reveals AI reshapes human reasoning, leading to 'cognitive surrender' where users blindly trust erroneous outputs. In experiments with 1300 participants, 80% accepted wrong ChatGPT answers without verification, boosting misplaced confidence. Researchers propose a new 'System 3' for AI-extended cognition.
TurboQuant Core: Random Vector Rotation
TurboQuant enhances vector quantization by applying random rotations to vectors before quantization and counter-rotations on dequantization. This counters quasi-sparse structures in LLM state vectors that cause information loss. The simple trick dramatically boosts performance without complex dependencies.
Llama-Server Breaking Cache Migration
Latest llama-server auto-migrates legacy cache to HuggingFace directory, converting .gguf models to blobs. This breaks launch scripts and model distribution workflows. Commit added without opt-out option amid HF takeover complaints.
RTX 4080 32GB Triple Fan from China €1300
A user bought a 32GB RTX 4080 triple fan GPU from China for around €1300, deeming it reasonable for the VRAM amount. The card runs smoothly and quietly thanks to the triple fans. They seek advice on initial tests.
Grudges Shape OpenAI-Anthropic AI Paths
WSJ exposes decade-long personal conflicts among Altman, Musk, Amodei, and Brockman from OpenAI origins, sparking Anthropic's founding. Disputes over AI disclosure, power, and layoffs evolved into OpenAI's deployment speed vs. Anthropic's safety focus. Human emotions drive AI trajectories as much as tech.
Claude Paid Subscriptions Double in 2024
Anthropic's Claude is experiencing skyrocketing popularity among paying consumers. Paid subscriptions have more than doubled this year, according to a company spokesperson. Total consumer user estimates vary widely from 18 million to 30 million.
Llama.cpp Integrates Turboquant, H2O, StreamingLLM
Developer peva3 added Turboquant, Heavy-Hitter Oracle (H2O), and StreamingLLM to llama.cpp for major speedups. Achieves full-speed token generation up to 256k+ context on 16GB 4060ti GPU with Qwen 3.5 4B. CPU and CUDA builds are fully usable with detailed docs on GitHub.
Anthropic Struggles vs Chinese Rivals, Safety Focus
Anthropic, maker of Claude, faces headwinds from Chinese competition and its own strict safety obsession. The company gained goodwill by resisting US Defense Department demands to soften model safeguards. It plans to go public as soon as Q4 2026.
DIY Tiled Attention for AMD GPUs
A user built a PyTorch-based tiled attention mechanism as a flash-attention alternative for unsupported AMD MI50 GPUs (gfx906), enabling video generation without OOM. Inspired by llama.cpp, it uses query chunking, softmax fallbacks, and optimizations like BF16-to-FP16 conversion. Pure PyTorch, no custom kernels needed.
TikTok AI Ad Labels Failing
TikTok requires AI disclosures for generative AI ads, but companies like Samsung omit labels on videos. Users suspect synthetic content but lack confirmation without fine print clarity. Policy enforcement is ineffective.
Meta Liable for Harming Minors
US juries in New Mexico and Los Angeles held Meta liable for minor harm, awarding hundreds of millions. YouTube also liable in LA case. Firms appeal, questioning Section 230 protections.
Actors Union Demands Tilly Tax on AI Characters
Hollywood actors’ union is bargaining for a ‘Tilly Tax’ on AI film characters. Union head states organized labor checks AI use as US adoption outpaces regulation. This highlights labor's role in shaping AI deployment.
Apple May Restart YMTC NAND for China iPhones
Apple may resume partnership with China's YMTC amid US export curbs and soaring memory prices hurting iPhone margins. Shift to domestic NAND eyed for China models. 12GB LPDDR5X chips now cost up to $70 each.
Qujing ATaaS Launches Trillion-Token Daily Factory
Qujing has launched the ATaaS platform, a 'Token factory' boasting daily trillion-token production capacity. Academician Zheng Weimin leads insights into emerging Token service trends.
Lag State: Citation Graphs Indexing Blind Spot
Researchers coined 'lag state' for papers recently cited but not yet indexed in major databases like Semantic Scholar, creating systematic gaps in citation graphs. This biases automated literature review tools, especially for frontier ML work using graph embeddings. Related 'cold node' modes undervalue bridging papers.
TurboQuant VRAM Edge Over LM Studio Tested
Benchmark on dual 3090s at 16k context shows TurboQuant using 1.8GB VRAM vs LM Studio's 5.4GB. TurboQuant nearly matches LM on recall tasks (79/85 vs 85/85) but slightly slower tok/s. Great tradeoff for VRAM-limited setups.
Snow Fox Rescue AI Videos Spark Creation Frenzy
AI-generated short videos in Shaw Brothers wuxia style, featuring viral lines like 'Have you rescued a fox on snow mountain?' and 'I'm not a fox, I'm sauce plate duck!', have swept Chinese social media. This has ignited a nationwide AI content creation boom with mass participation. The article explores key insights from the phenomenon.
4x32GB vs 2x64GB RAM for AI Workloads
User with RTX 5080/5090 plans and AMD 9950X3D seeks advice on RAM upgrade: add 2x32GB DDR5 6000 (total 4x32GB) cheaply or buy pricier 2x64GB at 5600MT/s 40CL. Questions potential slowdown in 4-DIMM config for model offloading to RAM and gaming.
Alibaba Launches AI Digital Workforce for Taobao
Alibaba is set to launch AI agent services for millions of merchants on Taobao and Tmall by end of March. Built on Business Advisor, it provides a 24/7 autonomous digital workforce to automate operations. This capitalizes on the AI frenzy to strengthen e-commerce leadership.
AI Schism: Tech vs Labor in DC
A major schism over AI is gripping Washington as tech leaders and labor groups compete for influence. Silicon Valley executives, Trump administration officials, and Congress members gathered at a historic auditorium to praise AI's virtues. The event underscores tensions in shaping AI policy.