All Updates
Page 1257 of 1414
February 28, 2026
ML Engineers: 2026 Industry Expectations
A Reddit post in r/MachineLearning seeks discussion on industry expectations for Machine Learning Engineers in 2026. Submitted by u/AccomplishedCat4770 with link to comments. No detailed content provided in the excerpt.
Proprietary Fine-Tuning Deployment Nightmares
Fine-tuning on sensitive client data faces massive non-technical hurdles like data access logs, compliance, and contracts. Providers like Together, Fireworks, Replicate require legal reviews delaying ML work. DeepInfra chosen for quick compliance clearance.
Alaska Passes AI CSAM and Kids Social Media Bill
Alaska's House unanimously passed HB47, banning AI-generated CSAM and deepfakes of children while imposing social media limits on minors like curfews from 10:30 PM to 6:30 AM, addictive feature bans, and age verification with parental consent. The bill, amended from its original AI focus, heads to the Senate after modeling Utah's approach. Some representatives raised concerns about broad rules without platform input.
BYD Denza Z9 GT Claims World's Longest EV Range
BYD's Denza Z9 GT claims the world's longest-range EV title. It pushes electric driving beyond 1,000 km on a single charge. This challenges Tesla as its biggest rival in range.
Demand for 60-70B MoE with 8-10B Active Params
A Reddit post in r/LocalLLaMA asks if others await a 60-70B Mixture of Experts (MoE) model with 8-10B activated parameters. This size is seen as ideal for 64GB VRAM, potentially matching closed 'flash' models' performance. The poster notes the absence of mid-sized MoE models between ~30B and ~120B.
ChatGPT Misleads Health Outbreak Probe
Health officials investigating a puzzling outbreak consulted ChatGPT, which suggested cold beer and contaminated ice as culprits. The AI convincingly provided an explanation that steered the probe astray. This incident highlights risks of LLM hallucinations in critical investigations.
Qwen3.5 35B-A3B Replaces Dual-Model Agents
Qwen3.5-35B-A3B single-handedly handles reasoning, coding, and analysis on M1 64GB Mac, outperforming prior two-model setups. Runs at 27 tok/s via llama.cpp, delivering better Amazon sales analysis results. Disables thinking mode for optimal speed-quality balance.
Qwen3.5-35B Tops VLLM A100 Benchmarks
Benchmarks on dual A100 40GB GPUs with VLLM show Qwen3.5-35B-A3B-AWQ-4bit-FlashInfer as fastest at 352 tok/s output and 1357 total tok/s. Qwen3-30B variants lag behind, with FP8 FlashAttn slowest. Tests cover FP8/AWQ quant and FlashAttn/FlashInfer attentions.
Tiny Transformers Perfectly Add 10-Digit Numbers
Researchers created transformers with fewer than 100 parameters achieving 100% accuracy on adding two 10-digit numbers using digit tokens. The project highlights extreme efficiency in specialized models. Floating-point arithmetic poses greater challenges.
Distilled CLAP for Fast Text-Music Search
AudioMuse-AI-DCLAP is a distilled LAION CLAP model optimized for music, shrinking audio tower from 295MB/80M params to 23MB/7M params with 2-3x speed gain. It projects text and songs into shared 512D space for search. Open-source ONNX model available on GitHub, soon integrating into AudioMuse-AI for playlists.
OpenAI Wins Pentagon Deal After Anthropic Ethics Exit
OpenAI has struck a deal to supply AI to the Pentagon's classified US military networks. This follows Trump's order to stop using rival Anthropic, which backed out over concerns about mass surveillance and autonomous weapons. CEO Sam Altman assured the military won't use it for killing systems or surveillance.
PhD Committee: Fame vs Availability?
AI/ML PhD student debates including a renowned professor with industry ties but poor availability on committee. Seeks advice on impact for industry RS or academia jobs. Prioritizing engagement may yield stronger letters despite less name recognition.
Google: Longer CoT Hurts Accuracy (-0.54)
Google's paper reveals longer chain-of-thought reasoning correlates negatively with accuracy (-0.54) across models like GPT-OSS and Qwen3. They introduce DTR to measure deep thinking fraction and Think@n strategy for efficient sampling, cutting compute by 50% with better results.
OpenAI Fires Employee for Polymarket Insider Use
OpenAI confirmed firing an employee who used confidential information for trades on prediction market Polymarket. This violated the company's code prohibiting personal gain from internal info, including prediction markets. No employee name was disclosed.
RFK Jr. Praises Carbon Robotics Weed-Zappers
Seattle startup Carbon Robotics received another endorsement from U.S. Secretary of Health Robert F. Kennedy Jr. for its weed-zapping robots. RFK Jr. highlighted the machines on a new episode of The Joe Rogan Experience podcast.
AI: Power's New Allocation Engine
AI automates high-stakes decisions, reshaping power in war and governance. US invokes Defense Production Act against Anthropic for military AI access, shifts contract to OpenAI. Laser weapons' low costs enable AI 'shoot-first' autonomy risks.
Google Gemini Tests Goal-Based Learning Hub
Google is testing a new Learning Hub in Gemini powered by goal-based actions. The 'Goal Scheduled Actions' feature was briefly revealed in the Gemini app. It hints at adaptive AI-driven goal management and personalized learning support.
SOMs Enable Multi-Dir Refusal Ablation
New PR to heretic repo uses Self-Organizing Maps to suppress refusals by mapping multi-directional clusters in latent space. Achieves 3/100 refusals on GPT-OSS-20B (KL 0.12) and strong results on Qwen3.5-27B (18/100, KL 0.028). Technique bakes changes into weights, stacking with other abliteration methods.
AI Agents Replace Courses for IP Monetization
By 2026, selling courses fails amid AI commoditizing knowledge; pivot to custom AI agents encapsulating niche expertise for interactive, real-time coaching. Clients buy proven instincts integrated into AI, not generic content. Focus on experiential learning via personalized AI clones.
Qwen3.5-35B Crashes with Claude Code on llama.cpp
User reports model unloading and crashes during full prompt reprocessing with claude code on llama.cpp build b8179. Issue occurs on 2nd/3rd prompt. OpenCode works fine and outperforms 4.7-flash.