AdvancedAI Glossary

OCR (Optical Character Recognition)

Quick Answer

OCR is the technology that extracts machine-readable text from images and scanned documents. Modern OCR is now closely paired with layout understanding and AI extraction, turning scanned documents into structured data for downstream systems.

In Depth

What OCR (Optical Character Recognition) really means

Traditional OCR focused on character recognition. Modern document AI combines OCR with language models to understand tables, forms and complex layouts, outputting structured fields rather than a wall of text.

Accuracy depends heavily on scan quality, language, layout complexity and whether the document is a handwritten form, a printed invoice, or a digitally generated PDF.

Why It Matters

Business relevance for UK organisations

UK finance, legal and insurance teams use OCR and document AI to eliminate hours of manual data entry from invoices, contracts, claims forms and onboarding documents.

Real-world example

How this shows up in practice

A Bristol insurance broker deployed document AI across inbound claims PDFs, cutting average data-entry time per claim from 14 minutes to under 90 seconds.

Related Terms

Continue exploring

Advanced

Computer Vision

Computer vision is the branch of AI concerned with enabling machines to interpret and act on visual information. It powers applications from quality inspection and medical imaging to retail analytics, autonomous vehicles and augmented reality.

Advanced

Natural Language Processing (NLP)

Natural Language Processing is the field of AI concerned with interpreting, understanding and generating human language. NLP underpins chatbots, translation, summarisation, sentiment analysis, voice assistants and much of the productivity software UK teams now rely on daily.

Business

Automation

Automation is the use of technology to perform tasks that would otherwise require human effort. AI extends traditional automation by handling tasks that involve judgment, unstructured data or natural language — not just deterministic, rule-based steps.

Basics

Deep Learning

Deep Learning is a branch of machine learning that uses multi-layered neural networks to learn highly complex patterns directly from raw data such as images, audio and text, without the need for hand-crafted feature engineering.