A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to leir in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
leir (0) - 20 freq
neir (1) - 24 freq
leif (1) - 2 freq
leer (1) - 4 freq
ler (1) - 1 freq
lei (1) - 1 freq
geir (1) - 14 freq
heir (1) - 44 freq
leit (1) - 15 freq
feir (1) - 7 freq
teir (1) - 10 freq
keir (1) - 9 freq
lir (1) - 16 freq
yeir (1) - 253 freq
weir (1) - 139 freq
leis (1) - 1 freq
deir (1) - 7 freq
peir (1) - 3 freq
letir (1) - 2 freq
leid (1) - 1517 freq
meir (1) - 7 freq
lair (1) - 57 freq
cleir (1) - 3 freq
lear (1) - 107 freq
leim (1) - 2 freq
leir (0) - 20 freq
lair (1) - 57 freq
lear (1) - 107 freq
ler (1) - 1 freq
lir (1) - 16 freq
leer (1) - 4 freq
lire (2) - 2 freq
laar (2) - 2 freq
laer (2) - 1 freq
reir (2) - 1 freq
liar (2) - 24 freq
beir (2) - 17 freq
lour (2) - 5 freq
leirs (2) - 1 freq
leia (2) - 1 freq
elr (2) - 1 freq
lri (2) - 1 freq
loire (2) - 8 freq
lr (2) - 5 freq
loor (2) - 2 freq
laur (2) - 1 freq
leary (2) - 1 freq
eir (2) - 18 freq
lere (2) - 2 freq
leear (2) - 16 freq
SoundEx code - L600
leerie - 22 freq
lory - 42 freq
lower - 57 freq
leear - 16 freq
lure - 6 freq
lair - 57 freq
lawer - 10 freq
lere - 2 freq
larry - 105 freq
lawyer - 33 freq
'leear - 1 freq
lairie - 2 freq
laayer - 5 freq
leir - 20 freq
liar - 24 freq
lorry - 14 freq
lear - 107 freq
lour - 5 freq
lore - 12 freq
leary - 1 freq
lara - 1 freq
laura - 190 freq
laur - 1 freq
lora - 1 freq
leer - 4 freq
laar - 2 freq
lare - 7 freq
layower - 1 freq
loire - 8 freq
loor - 2 freq
layer - 13 freq
lowry - 2 freq
lowrie - 19 freq
lyrie - 1 freq
lowra - 1 freq
lyare - 1 freq
lire - 2 freq
lauri - 1 freq
laer - 1 freq
lee-er - 1 freq
l'or' - 1 freq
lyre - 1 freq
'larry - 1 freq
lawrie - 4 freq
larrie - 2 freq
lori - 1 freq
larrea - 1 freq
lir - 16 freq
lr - 5 freq
lurie - 1 freq
ler - 1 freq
lri - 1 freq
'lorry - 1 freq
'lower - 1 freq
MetaPhone code - LR
leerie - 22 freq
lory - 42 freq
leear - 16 freq
lure - 6 freq
lair - 57 freq
lere - 2 freq
larry - 105 freq
'leear - 1 freq
lairie - 2 freq
leir - 20 freq
liar - 24 freq
lorry - 14 freq
lear - 107 freq
lour - 5 freq
lore - 12 freq
leary - 1 freq
lara - 1 freq
laura - 190 freq
laur - 1 freq
lora - 1 freq
leer - 4 freq
laar - 2 freq
lare - 7 freq
loire - 8 freq
loor - 2 freq
lowry - 2 freq
lowrie - 19 freq
lyrie - 1 freq
lowra - 1 freq
lire - 2 freq
lauri - 1 freq
laer - 1 freq
lee-er - 1 freq
l'or' - 1 freq
lyre - 1 freq
'larry - 1 freq
lawrie - 4 freq
larrie - 2 freq
lori - 1 freq
larrea - 1 freq
lir - 16 freq
lr - 5 freq
lurie - 1 freq
ler - 1 freq
wler - 1 freq
lri - 1 freq
'lorry - 1 freq
LEIR
Time to execute Levenshtein function - 0.178052 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.323791 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027498 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.036600 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000812 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.