A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to gawa in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
gawa (0) - 5 freq
awa (1) - 4197 freq
g'wa (1) - 17 freq
gaw (1) - 5 freq
g'awa (1) - 7 freq
gaia (1) - 3 freq
gaga (1) - 1 freq
gaea (1) - 1 freq
'awa (1) - 22 freq
gawk (1) - 3 freq
gala (1) - 15 freq
gaws (1) - 1 freq
gaza (1) - 1 freq
gawp (1) - 9 freq
gawn (1) - 34 freq
qawa (1) - 1 freq
gawd (1) - 2 freq
geta (2) - 1 freq
daia (2) - 2 freq
gota (2) - 3 freq
baa (2) - 73 freq
paws (2) - 46 freq
gawds (2) - 3 freq
garm (2) - 1 freq
baba (2) - 2 freq
gawa (0) - 5 freq
gaw (1) - 5 freq
gawn (2) - 34 freq
gawp (2) - 9 freq
gw (2) - 1 freq
gaza (2) - 1 freq
gawd (2) - 2 freq
gow (2) - 7 freq
gaws (2) - 1 freq
guw (2) - 1 freq
qawa (2) - 1 freq
awa (2) - 4197 freq
g'wa (2) - 17 freq
gala (2) - 15 freq
gaia (2) - 3 freq
g'awa (2) - 7 freq
gaga (2) - 1 freq
gawk (2) - 3 freq
gaea (2) - 1 freq
'awa (2) - 22 freq
gaan (3) - 244 freq
gaap (3) - 1 freq
vaw (3) - 1 freq
'away (3) - 3 freq
gowp (3) - 65 freq
SoundEx code - G000
gie - 2520 freq
gey - 1312 freq
go - 1915 freq
gae - 501 freq
gay - 66 freq
'gie - 26 freq
guy - 210 freq
gooooo - 1 freq
g - 275 freq
gu - 8 freq
gee - 178 freq
'go - 11 freq
gi'e - 2 freq
'gae - 3 freq
gq - 3 freq
ge - 13 freq
gye - 194 freq
gaw - 5 freq
ga - 29 freq
goa - 2 freq
gow - 7 freq
gce - 4 freq
geiy - 16 freq
'gee - 1 freq
goo - 5 freq
guiy - 1 freq
gwee - 1 freq
giy - 16 freq
g'wa - 17 freq
g'awa - 7 freq
gsoh - 2 freq
'gsoh' - 1 freq
gsoh' - 1 freq
gie-awa - 1 freq
gea - 1 freq
gooey - 1 freq
goe - 1 freq
'gey - 2 freq
gaah - 1 freq
gui - 1 freq
g' - 1 freq
gös - 2 freq
go' - 1 freq
'gh' - 1 freq
gaye - 2 freq
gaia - 3 freq
'go' - 2 freq
gcse - 5 freq
gy - 11 freq
geo - 19 freq
gyo - 6 freq
gaey - 1 freq
gei - 15 freq
gau' - 4 freq
gou - 1 freq
-go - 1 freq
gjo - 2 freq
ªg - 1 freq
gøs - 1 freq
gi - 27 freq
gh - 5 freq
gaea - 1 freq
gio - 1 freq
€˜g - 5 freq
€œg - 2 freq
€œgo - 6 freq
gí - 1 freq
€œgie - 8 freq
gau - 2 freq
€˜go - 2 freq
€˜gie - 3 freq
€œgey - 1 freq
€œgae - 1 freq
€”go - 1 freq
€˜-gh - 1 freq
€œguy - 1 freq
gz - 3 freq
gg - 4 freq
guh - 1 freq
gk - 2 freq
g'wa' - 1 freq
gx - 4 freq
gcsw - 1 freq
gs - 3 freq
‘gie - 1 freq
gawa - 5 freq
gca - 1 freq
gki - 1 freq
ggks - 1 freq
gkk - 1 freq
gxx - 1 freq
gya - 1 freq
gai - 1 freq
gco - 1 freq
gj - 5 freq
gsy - 1 freq
gie' - 5 freq
goooooo - 1 freq
gc - 3 freq
gyy - 1 freq
gcc - 2 freq
“gie - 2 freq
“gey - 1 freq
gxh - 1 freq
gcs - 1 freq
gcwih - 1 freq
güs - 1 freq
gkg - 1 freq
ggc - 1 freq
gyah - 1 freq
gquo - 1 freq
gcuo - 1 freq
gqkzs - 1 freq
gxz - 1 freq
gqe - 1 freq
'gey' - 1 freq
guw - 1 freq
gjxjjc - 1 freq
gkh - 1 freq
gccc - 1 freq
gqo - 1 freq
gw - 1 freq
MetaPhone code - KW
gwee - 1 freq
g'wa - 17 freq
g'awa - 7 freq
kiwi - 2 freq
'cawa - 1 freq
c'wa - 6 freq
cowe - 1 freq
g'wa' - 1 freq
qawa - 1 freq
gawa - 5 freq
qwa - 1 freq
kwai - 1 freq
GAWA
Time to execute Levenshtein function - 0.312794 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.512370 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.054690 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.040108 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000770 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.