Back to Glossary
AdvancedAI Glossary

Speech Recognition

Quick Answer

Speech recognition, also known as automatic speech recognition (ASR), converts spoken audio into text. Modern ASR handles accents, background noise, multiple speakers and domain-specific vocabulary far better than systems from even a few years ago.

In Depth

What Speech Recognition really means

Contemporary ASR systems are based on end-to-end neural networks trained on tens of thousands of hours of audio. They support real-time transcription, speaker diarisation (who said what) and word-level timestamps.

Speech recognition is a foundation for voice assistants, call-centre analytics, meeting transcription and accessibility features. Accuracy varies significantly by domain; legal, medical and regional British accents often benefit from custom adaptation.

Why It Matters

Business relevance for UK organisations

UK contact centres use speech recognition to analyse 100% of calls for compliance, quality and customer intent — previously only a small sample could be reviewed manually.

Real-world example

How this shows up in practice

A Glasgow contact centre deployed speech recognition across 120,000 monthly calls, identifying a single mis-scripted sales line responsible for 18% of post-call complaints.