AI Turns Shouts into Calm Voices

💡See how AI tames real shouts for harassment-proof service – audio AI breakthrough
⚡ 30-Second TL;DR
What Changed
AI converts shouting to calm voice for harassment defense
Why It Matters
Could significantly reduce workplace stress in service industries by automating de-escalation. Boosts AI adoption in real-time audio processing for safety.
What To Do Next
Test SoftBank's voice AI demo to integrate into your customer support pipeline.
🧠 Deep Insight
Web-grounded analysis with 7 cited sources.
🔑 Enhanced Key Takeaways
- •SoftBank's emotion-canceling AI was developed over three years and trained on 10,000+ voice data samples from 10 actors recording 100+ common phrases including screams, accusations, threats, and apology demands[1][2]
- •The two-stage system first identifies angry voices and extracts key speech points, then uses acoustic tools to transform intonation into natural, polite tones while preserving the original words[1][2]
- •The technology was inspired by a television program highlighting verbal abuse against call center staff, with developer Toshiyuki Nakatani specifically motivated to protect workers from customer harassment[2]
📊 Competitor Analysis▸ Show
| Feature | SoftBank Emotion Canceling | Voicekiller | ElevenLabs Calm Voice Changer | YourVoic |
|---|---|---|---|---|
| Primary Use Case | Call center harassment mitigation | Creative voice direction/TTS | General voice transformation | Emotional text-to-speech |
| Training Data | 10,000+ voice samples (anger-focused) | Not specified | Not specified | 1000+ voices across 93+ languages |
| Emotional Control | Anger-to-calm conversion | Ultra-precise emotion control via instructions | Calm tone customization | Emotion-driven speech synthesis |
| Word Preservation | Yes (intonation only) | No (full speech control) | Yes | Yes |
| Implementation Status | Development phase (timeline unclear)[1] | Commercial product | Commercial product | Commercial product |
🛠️ Technical Deep Dive
- Two-stage architecture: Stage 1 uses AI voice-processing to identify angry callers and analyze speech characteristics; Stage 2 incorporates acoustic features of non-threatening voices to create calmer tones[2]
- Training methodology: 10 actors recorded 100+ common phrases expressing various emotions (anger, frustration, accusations, threats, apology demands) generating 10,000+ total voice data samples[1][2]
- Intonation modification: System significantly softens intonation while preserving original words and maintaining traces of anger so operators understand the situation[2]
- Acoustic processing: Uses acoustic tools to transform aggressive speech into natural, even polite tones[1]
🔮 Future ImplicationsAI analysis grounded in cited sources
⏳ Timeline
📎 Sources (7)
Factual claims are grounded in the sources below. Forward-looking analysis is AI-generated interpretation.
Weekly AI Recap
Read this week's curated digest of top AI events →
👉Related Updates
AI-curated news aggregator. All content rights belong to original publishers.
Original source: ITmedia AI+ (日本) ↗

