A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to yat in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
yat (0) - 1 freq
yah (1) - 3 freq
zat (1) - 11 freq
jat (1) - 1 freq
eat (1) - 460 freq
yas (1) - 28 freq
vat (1) - 12 freq
dat (1) - 1391 freq
tat (1) - 16 freq
yar (1) - 3 freq
cat (1) - 557 freq
yit (1) - 500 freq
rat (1) - 58 freq
gat (1) - 367 freq
ytt (1) - 1 freq
yam (1) - 2 freq
yaz (1) - 1 freq
pat (1) - 259 freq
bat (1) - 50 freq
at (1) - 20079 freq
kat (1) - 156 freq
yaw (1) - 1 freq
oat (1) - 12 freq
yeat (1) - 2 freq
fat (1) - 295 freq
yat (0) - 1 freq
yet (1) - 987 freq
yit (1) - 500 freq
aat (1) - 852 freq
yt (1) - 12 freq
at (1) - 20079 freq
yeat (1) - 2 freq
oat (1) - 12 freq
eat (1) - 460 freq
ut (2) - 9 freq
oot (2) - 13735 freq
ta (2) - 2534 freq
eet (2) - 581 freq
ati (2) - 16 freq
yout (2) - 2 freq
ate (2) - 115 freq
ygt (2) - 1 freq
'at (2) - 357 freq
yak (2) - 15 freq
hat (2) - 176 freq
yav (2) - 1 freq
yte (2) - 1 freq
eit (2) - 644 freq
yeti (2) - 1 freq
it (2) - 32760 freq
SoundEx code - Y300
yett - 116 freq
ye'd - 435 freq
yet - 987 freq
yit - 500 freq
you-yit - 1 freq
youth - 130 freq
ye-it - 1 freq
you'd - 58 freq
yed - 7 freq
yaud - 2 freq
yt - 12 freq
'yt - 1 freq
yuith - 1 freq
yout - 2 freq
yeat - 2 freq
'yit - 1 freq
yte - 1 freq
yeed - 1 freq
yitey - 3 freq
yite - 2 freq
'ye'd - 7 freq
yad - 1 freq
you'da - 1 freq
yae-dae - 1 freq
yae'd - 2 freq
yi'd - 16 freq
yi''d - 1 freq
yit' - 1 freq
'you'd - 2 freq
'youth - 1 freq
yat - 1 freq
yid - 87 freq
yud - 1 freq
yeht - 1 freq
yit-oh - 1 freq
ytt - 1 freq
yooth - 6 freq
yitt - 1 freq
yowt - 1 freq
yatt - 2 freq
yaethe - 1 freq
yoda - 3 freq
€œyou'd - 1 freq
€™ye-do - 1 freq
yiddie - 1 freq
youthie - 1 freq
yeti - 1 freq
€œyat - 1 freq
yytde - 1 freq
yeÂ’d - 10 freq
ytuo - 1 freq
youÂ’d - 4 freq
ydo - 1 freq
yet” - 1 freq
yto - 1 freq
yaud' - 1 freq
yeet - 1 freq
youd - 4 freq
yoit - 1 freq
MetaPhone code - YT
yett - 116 freq
ye'd - 435 freq
yet - 987 freq
yit - 500 freq
ye-it - 1 freq
you'd - 58 freq
yed - 7 freq
yaud - 2 freq
yout - 2 freq
yeat - 2 freq
'yit - 1 freq
hyed - 1 freq
yeed - 1 freq
yitey - 3 freq
yite - 2 freq
'ye'd - 7 freq
yad - 1 freq
you'da - 1 freq
yae-dae - 1 freq
yae'd - 2 freq
yi'd - 16 freq
yi''d - 1 freq
yit' - 1 freq
'you'd - 2 freq
yat - 1 freq
yid - 87 freq
yud - 1 freq
yeht - 1 freq
yit-oh - 1 freq
yitt - 1 freq
wyed - 2 freq
yowt - 1 freq
yatt - 2 freq
yoda - 3 freq
€œyou'd - 1 freq
€™ye-do - 1 freq
yiddie - 1 freq
yeti - 1 freq
€œyat - 1 freq
yeÂ’d - 10 freq
youÂ’d - 4 freq
yet” - 1 freq
yaud' - 1 freq
yeet - 1 freq
youd - 4 freq
yoit - 1 freq
YAT
Time to execute Levenshtein function - 0.470042 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.641066 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.070809 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.080376 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000782 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.