A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to il in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
il (0) - 39 freq
ir (1) - 1540 freq
ib (1) - 1 freq
ilc (1) - 1 freq
nil (1) - 7 freq
bl (1) - 2 freq
iq (1) - 4 freq
ii (1) - 69 freq
cl (1) - 5 freq
hl (1) - 1 freq
ily (1) - 2 freq
sl (1) - 7 freq
'l (1) - 5 freq
nl (1) - 6 freq
ip (1) - 14 freq
ia (1) - 5 freq
im (1) - 370 freq
ig (1) - 16 freq
if (1) - 5844 freq
iz (1) - 403 freq
vil (1) - 2 freq
ml (1) - 12 freq
ail (1) - 5 freq
i'l (1) - 1 freq
dil (1) - 4 freq
il (0) - 39 freq
ail (1) - 5 freq
eil (1) - 10 freq
al (1) - 237 freq
ul (1) - 9 freq
ily (1) - 2 freq
yil (1) - 4 freq
iel (1) - 1 freq
oil (1) - 89 freq
el (1) - 25 freq
ol (1) - 5 freq
yl (1) - 4 freq
ile (1) - 65 freq
l (1) - 178 freq
aail (2) - 1 freq
gl (2) - 2 freq
fil (2) - 2 freq
ale (2) - 51 freq
ool (2) - 8 freq
yul (2) - 1 freq
ola (2) - 1 freq
oyl (2) - 2 freq
xl (2) - 3 freq
til (2) - 1710 freq
ilie (2) - 1 freq
SoundEx code - I400
'i'll - 62 freq
ill - 405 freq
iley - 4 freq
i'll - 836 freq
ile - 65 freq
'ill - 4 freq
ilie - 1 freq
i'llll - 1 freq
'il - 9 freq
il - 39 freq
iell - 2 freq
iloo - 1 freq
ie'll - 1 freq
ilia - 1 freq
ill' - 1 freq
ile' - 1 freq
i'il - 2 freq
i'law - 1 freq
i'loo - 6 freq
i'aul - 1 freq
i'ile - 1 freq
i'hillie - 5 freq
i'wool - 1 freq
iel - 1 freq
i'l - 1 freq
ill- - 2 freq
ily - 2 freq
€˜i'll - 1 freq
€˜ill - 1 freq
i-i'll - 1 freq
€™il - 7 freq
illie - 2 freq
€œi'll - 5 freq
€œil - 9 freq
€œill - 1 freq
iÂ’ll - 32 freq
iulii - 1 freq
“il - 1 freq
“i’ll - 1 freq
iyil - 9 freq
MetaPhone code - IL
'i'll - 62 freq
ill - 405 freq
iley - 4 freq
i'll - 836 freq
ile - 65 freq
'ill - 4 freq
ilie - 1 freq
i'llll - 1 freq
'il - 9 freq
il - 39 freq
iell - 2 freq
iloo - 1 freq
ie'll - 1 freq
ilia - 1 freq
ill' - 1 freq
ile' - 1 freq
i'il - 2 freq
i'law - 1 freq
i'loo - 6 freq
i'aul - 1 freq
i'ile - 1 freq
iel - 1 freq
i'l - 1 freq
ill- - 2 freq
ily - 2 freq
€˜i'll - 1 freq
€˜ill - 1 freq
i-i'll - 1 freq
€™il - 7 freq
illie - 2 freq
€œi'll - 5 freq
€œil - 9 freq
€œill - 1 freq
iÂ’ll - 32 freq
iulii - 1 freq
“il - 1 freq
“i’ll - 1 freq
IL
Time to execute Levenshtein function - 0.259067 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.666154 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.078141 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.040921 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000784 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.