AdvancedAI Glossary

Speech Recognition

Quick Answer

Speech recognition, also known as automatic speech recognition (ASR), converts spoken audio into text. Modern ASR handles accents, background noise, multiple speakers and domain-specific vocabulary far better than systems from even a few years ago.

In Depth

What Speech Recognition really means

Contemporary ASR systems are based on end-to-end neural networks trained on tens of thousands of hours of audio. They support real-time transcription, speaker diarisation (who said what) and word-level timestamps.

Speech recognition is a foundation for voice assistants, call-centre analytics, meeting transcription and accessibility features. Accuracy varies significantly by domain; legal, medical and regional British accents often benefit from custom adaptation.

Why It Matters

Business relevance for UK organisations

UK contact centres use speech recognition to analyse 100% of calls for compliance, quality and customer intent — previously only a small sample could be reviewed manually.

Real-world example

How this shows up in practice

A Glasgow contact centre deployed speech recognition across 120,000 monthly calls, identifying a single mis-scripted sales line responsible for 18% of post-call complaints.

Related Terms

Continue exploring

Advanced

Natural Language Processing (NLP)

Natural Language Processing is the field of AI concerned with interpreting, understanding and generating human language. NLP underpins chatbots, translation, summarisation, sentiment analysis, voice assistants and much of the productivity software UK teams now rely on daily.

Basics

Speech Recognition

In Depth

Why It Matters

Real-world example

Related Terms

Natural Language Processing (NLP)

Deep Learning

Customer Intelligence

Multimodal AI