A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to gyah in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
gyah (0) - 1 freq
gyan (1) - 104 freq
gyat (1) - 1 freq
gaah (1) - 1 freq
uyah (1) - 2 freq
gya (1) - 1 freq
gyad (1) - 3 freq
yah (1) - 3 freq
gaan (2) - 244 freq
lyh (2) - 1 freq
yap (2) - 17 freq
€˜ah (2) - 123 freq
goaf (2) - 1 freq
yae (2) - 1059 freq
gyt (2) - 1 freq
gyde (2) - 3 freq
snah (2) - 1 freq
gay (2) - 66 freq
gva (2) - 1 freq
ahah (2) - 1 freq
dah (2) - 3 freq
fyth (2) - 1 freq
yar (2) - 3 freq
gram (2) - 10 freq
'mah (2) - 2 freq
gyah (0) - 1 freq
gaah (1) - 1 freq
yah (2) - 3 freq
gyad (2) - 3 freq
guh (2) - 1 freq
gh (2) - 5 freq
gyat (2) - 1 freq
gyan (2) - 104 freq
gya (2) - 1 freq
uyah (2) - 2 freq
ych (3) - 1 freq
uyh (3) - 1 freq
goa (3) - 2 freq
sah (3) - 2 freq
gluh (3) - 1 freq
yh (3) - 2 freq
goach (3) - 1 freq
goal (3) - 89 freq
gaar (3) - 1 freq
hah (3) - 4 freq
woah (3) - 3 freq
jah (3) - 2 freq
yogh (3) - 3 freq
gead (3) - 2 freq
gym (3) - 40 freq
SoundEx code - G000
gie - 2520 freq
gey - 1312 freq
go - 1915 freq
gae - 501 freq
gay - 66 freq
'gie - 26 freq
guy - 210 freq
gooooo - 1 freq
g - 275 freq
gu - 8 freq
gee - 178 freq
'go - 11 freq
gi'e - 2 freq
'gae - 3 freq
gq - 3 freq
ge - 13 freq
gye - 194 freq
gaw - 5 freq
ga - 29 freq
goa - 2 freq
gow - 7 freq
gce - 4 freq
geiy - 16 freq
'gee - 1 freq
goo - 5 freq
guiy - 1 freq
gwee - 1 freq
giy - 16 freq
g'wa - 17 freq
g'awa - 7 freq
gsoh - 2 freq
'gsoh' - 1 freq
gsoh' - 1 freq
gie-awa - 1 freq
gea - 1 freq
gooey - 1 freq
goe - 1 freq
'gey - 2 freq
gaah - 1 freq
gui - 1 freq
g' - 1 freq
gös - 2 freq
go' - 1 freq
'gh' - 1 freq
gaye - 2 freq
gaia - 3 freq
'go' - 2 freq
gcse - 5 freq
gy - 11 freq
geo - 19 freq
gyo - 6 freq
gaey - 1 freq
gei - 15 freq
gau' - 4 freq
gou - 1 freq
-go - 1 freq
gjo - 2 freq
ªg - 1 freq
gøs - 1 freq
gi - 27 freq
gh - 5 freq
gaea - 1 freq
gio - 1 freq
€˜g - 5 freq
€œg - 2 freq
€œgo - 6 freq
gí - 1 freq
€œgie - 8 freq
gau - 2 freq
€˜go - 2 freq
€˜gie - 3 freq
€œgey - 1 freq
€œgae - 1 freq
€”go - 1 freq
€˜-gh - 1 freq
€œguy - 1 freq
gz - 3 freq
gg - 4 freq
guh - 1 freq
gk - 2 freq
g'wa' - 1 freq
gx - 4 freq
gcsw - 1 freq
gs - 3 freq
‘gie - 1 freq
gawa - 5 freq
gca - 1 freq
gki - 1 freq
ggks - 1 freq
gkk - 1 freq
gxx - 1 freq
gya - 1 freq
gai - 1 freq
gco - 1 freq
gj - 5 freq
gsy - 1 freq
gie' - 5 freq
goooooo - 1 freq
gc - 3 freq
gyy - 1 freq
gcc - 2 freq
“gie - 2 freq
“gey - 1 freq
gxh - 1 freq
gcs - 1 freq
gcwih - 1 freq
güs - 1 freq
gkg - 1 freq
ggc - 1 freq
gyah - 1 freq
gquo - 1 freq
gcuo - 1 freq
gqkzs - 1 freq
gxz - 1 freq
gqe - 1 freq
'gey' - 1 freq
guw - 1 freq
gjxjjc - 1 freq
gkh - 1 freq
gccc - 1 freq
gqo - 1 freq
gw - 1 freq
MetaPhone code - JY
gye - 194 freq
gyo - 6 freq
gya - 1 freq
gyah - 1 freq
jyi - 1 freq
GYAH
Time to execute Levenshtein function - 0.385663 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.615463 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.058535 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.041269 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000928 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.