A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to groustie in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
groustie (0) - 2 freq
roustie (1) - 4 freq
goustie (1) - 4 freq
broustie (1) - 1 freq
grushie (2) - 3 freq
ghostie (2) - 1 freq
crouslie (2) - 1 freq
groatie (2) - 3 freq
grottie (2) - 1 freq
groutin (2) - 1 freq
gustie (2) - 3 freq
voustie (2) - 1 freq
'goustie (2) - 1 freq
grouse (2) - 29 freq
roustit (2) - 5 freq
gowstie (2) - 7 freq
drousie (2) - 1 freq
roushie (2) - 17 freq
roostie (2) - 2 freq
groupie (2) - 3 freq
poustie (2) - 2 freq
gruntie (2) - 1 freq
roustin (2) - 1 freq
groattie (2) - 1 freq
frostie (2) - 1 freq
groustie (0) - 2 freq
broustie (2) - 1 freq
goustie (2) - 4 freq
roustie (2) - 4 freq
grouse (3) - 29 freq
groattie (3) - 1 freq
gowstie (3) - 7 freq
gustie (3) - 3 freq
gruntie (3) - 1 freq
grottie (3) - 1 freq
ghostie (3) - 1 freq
grushie (3) - 3 freq
frostie (3) - 1 freq
groatie (3) - 3 freq
roostie (3) - 2 freq
breastie (4) - 1 freq
rousit (4) - 6 freq
roust (4) - 5 freq
trustee (4) - 1 freq
greesie (4) - 1 freq
agrostis (4) - 1 freq
tristie (4) - 1 freq
rousty (4) - 1 freq
groose (4) - 4 freq
freistie (4) - 1 freq
SoundEx code - G623
grazed - 5 freq
grossit - 2 freq
grushed - 1 freq
grosets - 3 freq
gairsty - 2 freq
goargit - 1 freq
groustie - 2 freq
grayssed - 1 freq
grist - 36 freq
greased - 3 freq
grectin - 1 freq
graced - 4 freq
grassed - 5 freq
greased-back - 1 freq
greasy-heidit - 1 freq
gurged - 1 freq
gorged - 2 freq
gristle - 3 freq
greast - 1 freq
grogged - 1 freq
gree-get - 1 freq
greastest - 1 freq
girst - 2 freq
grossets - 1 freq
gourgaud - 1 freq
grosset - 2 freq
gorsedh - 1 freq
garscadden - 2 freq
garygatesmusic - 1 freq
georgethepoet - 1 freq
grctjk - 1 freq
grozet - 1 freq
greggwatson - 1 freq
MetaPhone code - KRST
grazed - 5 freq
cursed - 43 freq
caressed - 2 freq
grossit - 2 freq
croasst - 3 freq
corset - 2 freq
curst - 2 freq
crossit - 7 freq
crosst - 12 freq
curiosity - 34 freq
crust - 18 freq
crest - 7 freq
crusty - 5 freq
crossed - 100 freq
crusade - 4 freq
crossd - 1 freq
queerest - 6 freq
curset - 1 freq
gairsty - 2 freq
curiositie - 1 freq
groustie - 2 freq
cruist - 1 freq
grayssed - 1 freq
cressida - 1 freq
kirsty - 161 freq
curiousity - 2 freq
grist - 36 freq
cristo - 1 freq
croassed - 7 freq
greased - 3 freq
graced - 4 freq
grassed - 5 freq
keerstee - 2 freq
kirst - 6 freq
cross-eed - 1 freq
cruised - 1 freq
curse'd - 2 freq
keeriosity - 1 freq
cursit - 4 freq
cresseid - 2 freq
coorsed - 1 freq
greast - 1 freq
curriest - 1 freq
creosote - 3 freq
cross't - 1 freq
corssed - 2 freq
ker-side - 1 freq
grosset - 2 freq
crazed - 3 freq
€œkirsty - 1 freq
€œcrossed - 1 freq
creased - 2 freq
gorsedh - 1 freq
coursed - 1 freq
“kirst - 1 freq
grozet - 1 freq
krista - 4 freq
GROUSTIE
Time to execute Levenshtein function - 0.282989 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.474100 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.037578 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.037493 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000992 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.