easy portal contact
Language Switch

Glossary

Phonetic Search: Sound of Spelling

Phonetic Search is the technology that forgives typos. Unlike rigid full-text search, which matches exact terms, this phonetic search procedure is guided by the sound of a word.

This method is indispensable, particularly in Document Management Systems and Enterprise Content Management. It is a central component in any search solution that has to handle unclean data, different spellings, or transcribed names. Thus, one can find client names, product designations, or addresses, even with incorrect input.

What is Phonetic Search?

Phonetic Search is a specialized search procedure. It finds matches based on the phonetic sound (pronunciation) of words, rather than their exact spelling. It thus overcomes the hurdle of dialects, transcription errors, and human typos.

How Does Phonetic Searching Work?

The mechanism is based on a so-called Phonetic Code (or Sound Code). The system does not compare the search term directly with the indexed word. Instead, both the search term and all words in the index are translated into an alphanumeric code. This code represents the phonetic sound of the respective word and forms the basis for the phonetic search.

The Phonetic Search is then performed exclusively by comparing these codes.

Phonetic Codes: The Key to Fault Tolerance

Letters that sound similar receive the same phonetic code. For example, vowels often receive a zero code, as they only minimally influence pronunciation but are often the source of spelling errors. This leads to a match even if the spelling in the document differs.

Common Mechanisms: The Cologne Phonetics

Internationally, the Soundex algorithm is well-known. However, in the German-speaking area, the Cologne Phonetics method has established itself as the standard. This method is optimized for the specific characteristics of German pronunciation.

The Cologne Phonetics translates a word into a sequence of numbers, where the assignment of the number represents the sound:

Letter(s)CodeRepresented SoundExample of Assignment
Vowels (A, E, I, O, U, Y)0(No relevant consonant sound)Vowels are ignored.
C, Z, S (before A, O, U)8S- and Hissing SoundsCesare, Zeit, Sache
D, T2T- and D-SoundsDokument, Text
G, K, Q4G-, K-SoundsGut, Kunde, Quelle

Example: Searching by Code

The various spellings of the name “Maier” receive the same code through Cologne Phonetics:

NameTranslation StepsFinal Code
MeierM (6) – E(0) – I(0) – E(0) – R(7)67
MayerM (6) – A(0) – Y(0) – E(0) – R(7)67
MaierM (6) – A(0) – Y(0) – E(0) – R(7)67

The search procedure only searches for code 67 in the index. Thus, it finds all variants and ensures that the correct client file is displayed.

Advantages in the Corporate Routine

Phonetic search compensates for the natural weaknesses of human data entry:

  • Higher Hit Rate: The probability of finding a match increases significantly, as spelling errors are no longer an exclusion criterion.
  • Fault Tolerance: It reliably captures differing spellings of proper names, company names, or product names (e.g., due to poor handwriting or dialects).
  • Better User Experience: The search is more intuitive. The employee does not need to know how the name was stored in the system.

Phonetic Search ideally supplements the rigid Full-Text Search with a layer of fault tolerance. However, it is not yet Semantic Search, it recognizes the sound, not the actual meaning or context.

easyarchive

Archive data securely and compliant.

Discover easy archive

easyDMS

Mann arbeitet mit easy DMS

Manage documents easily and efficiently.

Discover easy DMS
Newsroom Media Library Glossary
Newsletter

We will keep you regularly up to date. Subscribe to our newsletter and find out everything you need to know about the digitization of business processes. The topics will be prepared for you in a tailor-made and varied way.

Newsletter subscription