A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to john� in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
johnne (3) - 1 freq
johnson (3) - 59 freq
johnnys (3) - 1 freq
johnmcd (3) - 2 freq
johne (3) - 1 freq
johnneh (3) - 6 freq
johni (3) - 1 freq
john's (3) - 43 freq
johnnie (3) - 28 freq
johns (3) - 9 freq
johnny (3) - 111 freq
john (3) - 805 freq
johnnyd (3) - 1 freq
johnie (3) - 9 freq
johnsmi (3) - 12 freq
johnwest (4) - 4 freq
joan's (4) - 2 freq
joined (4) - 48 freq
johnwick (4) - 1 freq
join (4) - 131 freq
johanna (4) - 2 freq
joint (4) - 52 freq
joanne (4) - 1 freq
jonas (4) - 2 freq
johnegan (4) - 1 freq
johnny (6) - 111 freq
johns (6) - 9 freq
johnnyd (6) - 1 freq
johnie (6) - 9 freq
johnne (6) - 1 freq
johnsmi (6) - 12 freq
johnnie (6) - 28 freq
john (6) - 805 freq
johnnys (6) - 1 freq
john's (6) - 43 freq
johnson (6) - 59 freq
johne (6) - 1 freq
johnmcd (6) - 2 freq
johni (6) - 1 freq
johnneh (6) - 6 freq
johnnygf (7) - 1 freq
joahny (7) - 13 freq
johnegan (7) - 1 freq
johnny's (7) - 5 freq
johnston (7) - 19 freq
johann (7) - 4 freq
joahnny (7) - 3 freq
johannes (7) - 1 freq
johnwest (7) - 4 freq
johonnie (7) - 1 freq
SoundEx code - J500
jam - 84 freq
jyne - 116 freq
jeannie - 91 freq
'jonah' - 1 freq
john - 805 freq
join - 131 freq
june - 102 freq
jean - 169 freq
jan - 27 freq
jeanie - 34 freq
johnnie - 28 freq
jona - 1 freq
juin - 25 freq
jamie - 131 freq
jim - 235 freq
jine - 37 freq
jimmy - 139 freq
'jean - 4 freq
jammy - 19 freq
johnny - 111 freq
'johnny - 1 freq
jenny - 178 freq
joana - 1 freq
jane - 50 freq
jannie - 10 freq
jonah - 30 freq
jenna - 5 freq
joan - 30 freq
'john - 6 freq
'jenny - 1 freq
jawin - 2 freq
jon - 4 freq
jeenie - 1 freq
jeyn - 1 freq
jimmie - 9 freq
jiyn - 1 freq
jaune - 1 freq
janny - 23 freq
jum - 3 freq
jn - 11 freq
jinny - 28 freq
jeemie - 61 freq
jimmy' - 1 freq
jain - 1 freq
jun - 7 freq
jennie - 7 freq
jhone - 71 freq
jshone - 1 freq
jannai - 3 freq
johanna - 2 freq
jammie - 2 freq
joan' - 1 freq
jm - 57 freq
juno - 2 freq
jhon - 2 freq
joanna - 7 freq
jön - 1 freq
jayne - 2 freq
'jeannie - 2 freq
jaanie - 1 freq
jame - 3 freq
johnie - 9 freq
jonny - 46 freq
'jam - 1 freq
jowin - 4 freq
johann - 4 freq
jeemy - 3 freq
johnne - 1 freq
johne - 1 freq
jemmy - 5 freq
jen - 6 freq
joannie - 1 freq
€˜jimmy - 1 freq
jemmie - 4 freq
joanie - 1 freq
€˜john - 2 freq
jonnie - 1 freq
jen- - 1 freq
joni - 3 freq
joahny - 13 freq
joahnny - 3 freq
jeamie - 9 freq
jaimie - 4 freq
'jimmy - 5 freq
€˜jenny - 2 freq
jaune' - 1 freq
€œjonnie - 1 freq
johonnie - 1 freq
jeemmie - 1 freq
€œjohnny - 1 freq
€œjohn - 2 freq
jeanne - 1 freq
j'aime - 1 freq
jimi - 1 freq
joom - 1 freq
jenni - 1 freq
jiim - 1 freq
janey - 10 freq
jom - 1 freq
jnw - 1 freq
jcwme - 1 freq
jxn - 1 freq
jgzn - 1 freq
janie - 1 freq
johni - 1 freq
jeane - 1 freq
jamma - 1 freq
jkscwem - 1 freq
jynuuh - 1 freq
jqn - 1 freq
jcqn - 1 freq
joanne - 1 freq
jin - 1 freq
jaan - 1 freq
johnneh - 6 freq
MetaPhone code - JN
giein - 437 freq
gin - 1987 freq
gien - 1024 freq
jyne - 116 freq
jeannie - 91 freq
'jonah' - 1 freq
john - 805 freq
gein - 37 freq
join - 131 freq
'gin - 37 freq
june - 102 freq
jean - 169 freq
jan - 27 freq
'giein - 3 freq
jeanie - 34 freq
johnnie - 28 freq
geein - 42 freq
jona - 1 freq
juin - 25 freq
gean - 10 freq
jine - 37 freq
gie'in - 2 freq
gi'en - 7 freq
'jean - 4 freq
johnny - 111 freq
'johnny - 1 freq
jenny - 178 freq
joana - 1 freq
jane - 50 freq
jannie - 10 freq
jonah - 30 freq
jenna - 5 freq
genie - 7 freq
joan - 30 freq
hygiene - 8 freq
geen - 77 freq
'john - 6 freq
'jenny - 1 freq
jon - 4 freq
giein' - 4 freq
jeenie - 1 freq
jeyn - 1 freq
jiyn - 1 freq
jaune - 1 freq
genoa - 2 freq
janny - 23 freq
jn - 11 freq
jinny - 28 freq
gene - 2 freq
gen - 11 freq
ginnae - 1 freq
jain - 1 freq
jun - 7 freq
jennie - 7 freq
jannai - 3 freq
gi'n - 3 freq
joan' - 1 freq
gin' - 1 freq
juno - 2 freq
joanna - 7 freq
jön - 1 freq
giean - 4 freq
gie'n - 2 freq
jayne - 2 freq
'jeannie - 2 freq
jaanie - 1 freq
johnie - 9 freq
geian - 1 freq
jonny - 46 freq
johnne - 1 freq
johne - 1 freq
gie-in - 2 freq
jen - 6 freq
joannie - 1 freq
€˜gien - 2 freq
€˜gin - 4 freq
joanie - 1 freq
€œgien - 1 freq
€¦gin - 1 freq
€œgin - 25 freq
€˜john - 2 freq
jonnie - 1 freq
jen- - 1 freq
gey-an - 1 freq
joni - 3 freq
joahny - 13 freq
joahnny - 3 freq
€˜jenny - 2 freq
jaune' - 1 freq
€œjonnie - 1 freq
gein' - 1 freq
€œjohnny - 1 freq
€œjohn - 2 freq
jeanne - 1 freq
gena - 1 freq
jenni - 1 freq
janey - 10 freq
geeeeeeoannnnn - 1 freq
jnw - 1 freq
gino - 1 freq
genny - 1 freq
janie - 1 freq
johni - 1 freq
jeane - 1 freq
gien” - 2 freq
jynuuh - 1 freq
gioni - 1 freq
joanne - 1 freq
jin - 1 freq
jaan - 1 freq
johnneh - 6 freq
JOHN�
Time to execute Levenshtein function - 0.375255 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.690544 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.030294 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.043536 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000894 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.