A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
(0) - 1 freq
(1) - 3 freq
á (1) - 4 freq
(1) - 2 freq
lek (2) - 101 freq
leh (2) - 3 freq
(2) - 2 freq
lwe (2) - 1 freq
ld (2) - 3 freq
lw (2) - 8 freq
lea (2) - 155 freq
lze (2) - 1 freq
lis (2) - 1 freq
lxb (2) - 1 freq
lyh (2) - 1 freq
lax (2) - 2 freq
ltd (2) - 8 freq
ldl (2) - 2 freq
lle (2) - 1 freq
lya (2) - 1 freq
ldj (2) - 1 freq
(2) - 1 freq
ö (2) - 6 freq
æ (2) - 2 freq
lem (2) - 5 freq
(0) - 1 freq
á (2) - 4 freq
(2) - 2 freq
(2) - 3 freq
lh (4) - 1 freq
lt (4) - 5 freq
(4) - 3 freq
lp (4) - 8 freq
llb (4) - 1 freq
ár (4) - 1 freq
lu (4) - 17 freq
lvu (4) - 1 freq
(4) - 3 freq
lef (4) - 7 freq
lxq (4) - 1 freq
lke (4) - 20 freq
lbk (4) - 1 freq
lms (4) - 1 freq
(4) - 1 freq
lan (4) - 138 freq
lup (4) - 1 freq
lut (4) - 59 freq
low (4) - 210 freq
lab (4) - 29 freq
lr (4) - 5 freq
SoundEx code - L000
lee - 155 freq
lay - 385 freq
lea - 155 freq
l - 178 freq
law - 291 freq
'll - 85 freq
low - 210 freq
lie - 259 freq
lowe - 110 freq
lou - 31 freq
ll - 69 freq
la - 116 freq
loo - 26 freq
li - 21 freq
lue - 5 freq
leo - 7 freq
lae - 28 freq
le - 53 freq
'la - 4 freq
lui - 2 freq
law' - 2 freq
'lea - 6 freq
'lay - 1 freq
lew - 5 freq
laa - 114 freq
lo - 32 freq
louie - 4 freq
ley - 3 freq
lhe - 1 freq
leh - 3 freq
lu - 17 freq
ly - 7 freq
lah - 1 freq
leia - 1 freq
ïll - 51 freq
ïlle - 1 freq
lee' - 1 freq
lieu - 2 freq
'loo - 1 freq
'l - 5 freq
leah - 1 freq
'loo' - 1 freq
lle - 1 freq
- 3 freq
lye - 8 freq
- 2 freq
laow - 1 freq
lll - 3 freq
lao - 3 freq
la - 2 freq
-ly - 1 freq
loe - 15 freq
ll - 1159 freq
lla - 1 freq
l - 85 freq
la - 4 freq
lo'e - 16 freq
loue - 1 freq
lei - 1 freq
le - 4 freq
law - 1 freq
l - 3 freq
'll - 1 freq
- 1 freq
ll - 1 freq
lea - 1 freq
ly - 4 freq
lea - 3 freq
lll - 1 freq
lea' - 1 freq
ll - 2 freq
lly - 2 freq
æl - 1 freq
l - 1 freq
lya - 1 freq
lw - 8 freq
lo’e - 1 freq
loui - 1 freq
lau - 1 freq
lha - 1 freq
le'e - 1 freq
lwe - 1 freq
lyh - 1 freq
lwo - 1 freq
lh - 1 freq
lyo - 2 freq
MetaPhone code - L
lee - 155 freq
lay - 385 freq
lea - 155 freq
l - 178 freq
law - 291 freq
'll - 85 freq
low - 210 freq
lie - 259 freq
lou - 31 freq
yl - 4 freq
ll - 69 freq
la - 116 freq
loo - 26 freq
li - 21 freq
lue - 5 freq
leo - 7 freq
wll - 1 freq
lae - 28 freq
wl - 6 freq
le - 53 freq
y'll - 3 freq
hle - 1 freq
'la - 4 freq
lui - 2 freq
wyllie - 5 freq
hwyl - 1 freq
law' - 2 freq
'lea - 6 freq
wylie - 4 freq
'lay - 1 freq
lew - 5 freq
laa - 114 freq
hyll - 1 freq
lo - 32 freq
louie - 4 freq
ley - 3 freq
leh - 3 freq
lu - 17 freq
ly - 7 freq
lah - 1 freq
leia - 1 freq
ïll - 51 freq
hïll - 7 freq
ïlle - 1 freq
lee' - 1 freq
lieu - 2 freq
'loo - 1 freq
hl - 1 freq
'l - 5 freq
leah - 1 freq
'loo' - 1 freq
lle - 1 freq
- 3 freq
- 2 freq
laow - 1 freq
y'ill - 1 freq
lll - 3 freq
wyle - 5 freq
lao - 3 freq
yøl - 2 freq
la - 2 freq
-ly - 1 freq
yöl - 1 freq
loe - 15 freq
yle - 2 freq
ll - 1159 freq
lla - 1 freq
l - 85 freq
la - 4 freq
hóli - 1 freq
lo'e - 16 freq
wyll - 2 freq
loue - 1 freq
lei - 1 freq
le - 4 freq
law - 1 freq
l - 3 freq
'll - 1 freq
- 1 freq
ll - 1 freq
lea - 1 freq
ly - 4 freq
lea - 3 freq
lll - 1 freq
lea' - 1 freq
y'lll - 1 freq
ll - 2 freq
lly - 2 freq
w'll - 6 freq
æl - 1 freq
l - 1 freq
lw - 8 freq
lo’e - 1 freq
loui - 1 freq
wle - 1 freq
y’il - 1 freq
y'il - 2 freq
y'all - 1 freq
lau - 1 freq
le'e - 1 freq
y-l - 1 freq
hlai - 1 freq
hly - 1 freq
wlo - 1 freq
wly - 1 freq
lyh - 1 freq
lh - 1 freq

Time to execute Levenshtein function - 0.184893 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.345821 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027432 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.036787 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000821 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.