A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to yid� in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
yids (3) - 2 freq
yiddie (3) - 1 freq
yid (3) - 87 freq
widd (4) - 69 freq
tidins (4) - 8 freq
widlan (4) - 2 freq
kid (4) - 104 freq
kiddin (4) - 70 freq
ydm (4) - 1 freq
yince- (4) - 1 freq
idle (4) - 27 freq
ideqv (4) - 1 freq
midse (4) - 1 freq
idiot (4) - 24 freq
tidy's (4) - 1 freq
yijwp (4) - 1 freq
wid-be (4) - 1 freq
yil (4) - 4 freq
yirs (4) - 4 freq
midges (4) - 26 freq
yirdit (4) - 29 freq
bidit (4) - 30 freq
sidey (4) - 2 freq
yiss (4) - 22 freq
side's (4) - 3 freq
yid (6) - 87 freq
yids (6) - 2 freq
yiddie (6) - 1 freq
idda (7) - 89 freq
iddir (7) - 11 freq
idjz (7) - 1 freq
idea (7) - 563 freq
idunn (7) - 1 freq
idoot (7) - 1 freq
aidam (7) - 1 freq
iday (7) - 19 freq
aider (7) - 2 freq
aidso (7) - 1 freq
idol (7) - 12 freq
didni (7) - 21 freq
uid (7) - 1 freq
id (7) - 597 freq
aided (7) - 2 freq
didmae (7) - 1 freq
didny (7) - 16 freq
didna (7) - 1636 freq
aidter (7) - 1 freq
aidgin (7) - 1 freq
idil (7) - 1 freq
eidduc (7) - 1 freq
SoundEx code - Y300
yett - 116 freq
ye'd - 445 freq
yet - 1020 freq
yit - 500 freq
you-yit - 1 freq
youth - 130 freq
ye-it - 1 freq
you'd - 58 freq
yed - 7 freq
yaud - 2 freq
yt - 12 freq
'yt - 1 freq
yuith - 1 freq
yout - 1 freq
yeat - 2 freq
'yit - 1 freq
yte - 1 freq
yeed - 1 freq
yitey - 3 freq
yite - 2 freq
'youth - 2 freq
yi'd - 17 freq
'ye'd - 7 freq
yad - 1 freq
you'da - 1 freq
yae-dae - 1 freq
yae'd - 2 freq
yi''d - 1 freq
yit' - 1 freq
'you'd - 2 freq
yat - 1 freq
yid - 87 freq
yud - 1 freq
yeht - 1 freq
yit-oh - 1 freq
ytt - 1 freq
yooth - 6 freq
yitt - 1 freq
yowt - 1 freq
yatt - 2 freq
yaethe - 1 freq
yoda - 3 freq
€œyou'd - 1 freq
€™ye-do - 1 freq
yiddie - 1 freq
youthie - 1 freq
yeti - 1 freq
€œyat - 1 freq
yytde - 1 freq
yeÂ’d - 10 freq
ytuo - 1 freq
youÂ’d - 4 freq
ydo - 1 freq
yet” - 1 freq
yto - 1 freq
yaud' - 1 freq
yeet - 1 freq
youd - 4 freq
yoit - 1 freq
MetaPhone code - YT
yett - 116 freq
ye'd - 445 freq
yet - 1020 freq
yit - 500 freq
ye-it - 1 freq
you'd - 58 freq
yed - 7 freq
yaud - 2 freq
yout - 1 freq
yeat - 2 freq
'yit - 1 freq
hyed - 1 freq
yeed - 1 freq
yitey - 3 freq
yite - 2 freq
yi'd - 17 freq
'ye'd - 7 freq
yad - 1 freq
you'da - 1 freq
yae-dae - 1 freq
yae'd - 2 freq
yi''d - 1 freq
yit' - 1 freq
'you'd - 2 freq
yat - 1 freq
yid - 87 freq
yud - 1 freq
yeht - 1 freq
yit-oh - 1 freq
yitt - 1 freq
wyed - 2 freq
yowt - 1 freq
yatt - 2 freq
yoda - 3 freq
€œyou'd - 1 freq
€™ye-do - 1 freq
yiddie - 1 freq
yeti - 1 freq
€œyat - 1 freq
yeÂ’d - 10 freq
youÂ’d - 4 freq
yet” - 1 freq
yaud' - 1 freq
yeet - 1 freq
youd - 4 freq
yoit - 1 freq
YID�
Time to execute Levenshtein function - 0.229117 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.345044 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.029043 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.038766 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000924 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.