A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to gi� in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
git (3) - 1232 freq
giff (3) - 1 freq
gicin (3) - 1 freq
girly (3) - 1 freq
gind (3) - 1 freq
girst (3) - 2 freq
gigas (3) - 1 freq
gied (3) - 1350 freq
gimps (3) - 4 freq
girse (3) - 90 freq
ginin (3) - 1 freq
gite (3) - 2 freq
gieit (3) - 1 freq
giza (3) - 1 freq
gizmo (3) - 2 freq
giean (3) - 4 freq
girs (3) - 4 freq
ging (3) - 343 freq
ginos (3) - 1 freq
givin (3) - 8 freq
giaur (3) - 1 freq
gings (3) - 75 freq
gin' (3) - 1 freq
giy's (3) - 1 freq
gigs (3) - 5 freq
girl' (6) - 1 freq
gilt (6) - 2 freq
gie't (6) - 12 freq
gites (6) - 1 freq
gid (6) - 295 freq
giled (6) - 1 freq
gifts (6) - 32 freq
giess (6) - 4 freq
gift (6) - 116 freq
giy (6) - 16 freq
girss (6) - 16 freq
gight (6) - 3 freq
git' (6) - 3 freq
gibb (6) - 12 freq
gimin (6) - 2 freq
giez (6) - 1 freq
gig (6) - 36 freq
gibt (6) - 3 freq
gim (6) - 1 freq
gid' (6) - 1 freq
gicd (6) - 1 freq
gink (6) - 1 freq
giric (6) - 1 freq
girnt (6) - 4 freq
giekt (6) - 1 freq
SoundEx code - G000
gie - 2520 freq
gey - 1312 freq
go - 1915 freq
gae - 501 freq
gay - 66 freq
'gie - 26 freq
guy - 210 freq
gooooo - 1 freq
g - 275 freq
gu - 8 freq
gee - 178 freq
'go - 11 freq
gi'e - 2 freq
'gae - 3 freq
gq - 3 freq
ge - 13 freq
gye - 194 freq
gaw - 5 freq
ga - 29 freq
goa - 2 freq
gow - 7 freq
gce - 4 freq
geiy - 16 freq
'gee - 1 freq
goo - 5 freq
guiy - 1 freq
gwee - 1 freq
giy - 16 freq
g'wa - 17 freq
g'awa - 7 freq
gsoh - 2 freq
'gsoh' - 1 freq
gsoh' - 1 freq
gie-awa - 1 freq
gea - 1 freq
gooey - 1 freq
goe - 1 freq
'gey - 2 freq
gaah - 1 freq
gui - 1 freq
g' - 1 freq
gös - 2 freq
go' - 1 freq
'gh' - 1 freq
gaye - 2 freq
gaia - 3 freq
'go' - 2 freq
gcse - 5 freq
gy - 11 freq
geo - 19 freq
gyo - 6 freq
gaey - 1 freq
gei - 15 freq
gau' - 4 freq
gou - 1 freq
-go - 1 freq
gjo - 2 freq
ªg - 1 freq
gøs - 1 freq
gi - 27 freq
gh - 5 freq
gaea - 1 freq
gio - 1 freq
€˜g - 5 freq
€œg - 2 freq
€œgo - 6 freq
gí - 1 freq
€œgie - 8 freq
gau - 2 freq
€˜go - 2 freq
€˜gie - 3 freq
€œgey - 1 freq
€œgae - 1 freq
€”go - 1 freq
€˜-gh - 1 freq
€œguy - 1 freq
gz - 3 freq
gg - 4 freq
guh - 1 freq
gk - 2 freq
g'wa' - 1 freq
gx - 4 freq
gcsw - 1 freq
gs - 3 freq
‘gie - 1 freq
gawa - 5 freq
gca - 1 freq
gki - 1 freq
ggks - 1 freq
gkk - 1 freq
gxx - 1 freq
gya - 1 freq
gai - 1 freq
gco - 1 freq
gj - 5 freq
gsy - 1 freq
gie' - 5 freq
goooooo - 1 freq
gc - 3 freq
gyy - 1 freq
gcc - 2 freq
“gie - 2 freq
“gey - 1 freq
gxh - 1 freq
gcs - 1 freq
gcwih - 1 freq
güs - 1 freq
gkg - 1 freq
ggc - 1 freq
gyah - 1 freq
gquo - 1 freq
gcuo - 1 freq
gqkzs - 1 freq
gxz - 1 freq
gqe - 1 freq
'gey' - 1 freq
guw - 1 freq
gjxjjc - 1 freq
gkh - 1 freq
gccc - 1 freq
gqo - 1 freq
gw - 1 freq
MetaPhone code - J
gie - 2520 freq
gey - 1312 freq
j - 186 freq
joy - 244 freq
jaw - 57 freq
jaw- - 1 freq
jow - 6 freq
'gie - 26 freq
jo - 28 freq
gee - 178 freq
joe - 105 freq
gi'e - 2 freq
jee - 5 freq
jaa - 31 freq
je - 10 freq
ge - 13 freq
jey - 6 freq
wj - 4 freq
ji - 7 freq
geiy - 16 freq
'gee - 1 freq
joo - 3 freq
giy - 16 freq
jae - 2 freq
j- - 1 freq
j' - 6 freq
yj - 5 freq
jah - 2 freq
'j - 3 freq
ja - 7 freq
gea - 1 freq
'gey - 2 freq
'joy - 1 freq
‹j - 1 freq
joey - 21 freq
gy - 11 freq
dge - 1 freq
geo - 19 freq
ju - 6 freq
gei - 15 freq
jo' - 1 freq
'ja' - 1 freq
'ja - 1 freq
j'ai - 2 freq
jew - 6 freq
gi - 27 freq
hygiea - 1 freq
gio - 1 freq
€œgie - 8 freq
€˜j - 1 freq
€˜gie - 3 freq
€˜joe - 1 freq
€œgey - 1 freq
€œje - 1 freq
jai - 1 freq
©j - 1 freq
hj - 2 freq
jeyy - 1 freq
ygi - 1 freq
hjo - 1 freq
jj - 7 freq
‘gie - 1 freq
hgi - 1 freq
joh - 1 freq
jay - 7 freq
joeÂ’ - 1 freq
joao - 38 freq
jww - 1 freq
gie' - 5 freq
wjy - 1 freq
yyj - 1 freq
jeh - 1 freq
jei - 1 freq
gyy - 1 freq
“gie - 2 freq
jw - 4 freq
“gey - 1 freq
jy - 2 freq
juh - 1 freq
yjh - 1 freq
'gey' - 1 freq
jie - 1 freq
jyy - 1 freq
hyj - 1 freq
GI�
Time to execute Levenshtein function - 0.166980 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.323477 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.028316 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.037755 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000904 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.