A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to gyah in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
gyah (0) - 1 freq
gya (1) - 1 freq
gyad (1) - 3 freq
gyan (1) - 104 freq
gyat (1) - 1 freq
uyah (1) - 2 freq
gaah (1) - 1 freq
yah (1) - 3 freq
goan (2) - 64 freq
awah (2) - 14 freq
gyran (2) - 1 freq
gyaan (2) - 83 freq
gras (2) - 5 freq
eyan (2) - 1 freq
'ah (2) - 382 freq
gyz (2) - 2 freq
dyan (2) - 2 freq
yvh (2) - 1 freq
gyd (2) - 1 freq
graph (2) - 4 freq
guh (2) - 1 freq
glad (2) - 105 freq
gyo (2) - 6 freq
gean (2) - 10 freq
gogh (2) - 1 freq
gyah (0) - 1 freq
gaah (1) - 1 freq
yah (2) - 3 freq
guh (2) - 1 freq
uyah (2) - 2 freq
gh (2) - 5 freq
gyat (2) - 1 freq
gyad (2) - 3 freq
gya (2) - 1 freq
gyan (2) - 104 freq
gear (3) - 237 freq
yaho (3) - 1 freq
gyaun (3) - 10 freq
eah (3) - 1 freq
gam (3) - 2 freq
dah (3) - 3 freq
gead (3) - 2 freq
gab (3) - 32 freq
gype (3) - 47 freq
yih (3) - 4 freq
yach (3) - 1 freq
gaan (3) - 244 freq
naah (3) - 3 freq
gaz (3) - 1 freq
gyit (3) - 2 freq
SoundEx code - G000
gie - 2567 freq
gey - 1316 freq
go - 1980 freq
gae - 503 freq
gay - 69 freq
'gie - 26 freq
guy - 222 freq
gooooo - 1 freq
g - 278 freq
gu - 8 freq
gee - 178 freq
'go - 11 freq
gi'e - 2 freq
'gae - 3 freq
gq - 3 freq
ge - 13 freq
gye - 194 freq
gaw - 5 freq
ga - 29 freq
goa - 2 freq
gow - 7 freq
gce - 4 freq
geiy - 16 freq
'gee - 1 freq
goo - 5 freq
guiy - 1 freq
gwee - 1 freq
giy - 16 freq
g'wa - 17 freq
g'awa - 7 freq
gsoh - 2 freq
'gsoh' - 1 freq
gsoh' - 1 freq
gie-awa - 1 freq
gea - 1 freq
gooey - 1 freq
goe - 1 freq
'gey - 2 freq
gaah - 1 freq
gui - 1 freq
g' - 1 freq
gös - 2 freq
go' - 1 freq
'gh' - 1 freq
gaye - 2 freq
gaia - 3 freq
'go' - 2 freq
gcse - 5 freq
gy - 11 freq
geo - 19 freq
gyo - 6 freq
gaey - 1 freq
gei - 15 freq
gau' - 4 freq
gou - 1 freq
-go - 1 freq
gjo - 2 freq
ªg - 1 freq
gøs - 1 freq
gi - 27 freq
gh - 5 freq
gaea - 1 freq
gio - 1 freq
€˜g - 5 freq
€œg - 2 freq
€œgo - 6 freq
gí - 1 freq
€œgie - 8 freq
gau - 2 freq
€˜go - 2 freq
€˜gie - 3 freq
€œgey - 1 freq
€œgae - 1 freq
€”go - 1 freq
€˜-gh - 1 freq
€œguy - 1 freq
gz - 3 freq
gg - 4 freq
guh - 1 freq
gk - 2 freq
g'wa' - 1 freq
gx - 4 freq
gcsw - 1 freq
gs - 3 freq
‘gie - 1 freq
gawa - 5 freq
gca - 1 freq
gki - 1 freq
ggks - 1 freq
gkk - 1 freq
gxx - 1 freq
gya - 1 freq
gai - 1 freq
gco - 1 freq
gj - 5 freq
gsy - 1 freq
gie' - 5 freq
goooooo - 1 freq
gc - 3 freq
gyy - 1 freq
gcc - 2 freq
“gie - 2 freq
“gey - 1 freq
gxh - 1 freq
gcs - 1 freq
gcwih - 1 freq
güs - 1 freq
gkg - 1 freq
ggc - 1 freq
gyah - 1 freq
gquo - 1 freq
gcuo - 1 freq
gqkzs - 1 freq
gxz - 1 freq
gqe - 1 freq
'gey' - 1 freq
guw - 1 freq
gjxjjc - 1 freq
gkh - 1 freq
gccc - 1 freq
gqo - 1 freq
gw - 1 freq
MetaPhone code - JY
gye - 194 freq
gyo - 6 freq
gya - 1 freq
gyah - 1 freq
jyi - 1 freq
GYAH
Time to execute Levenshtein function - 0.227216 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.364490 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.028848 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.038063 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000943 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.