A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to gourgaud in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
gourgaud (0) - 1 freq
gurged (3) - 1 freq
gurgled (3) - 3 freq
dounhaud (3) - 1 freq
outgaun (3) - 2 freq
foirgaun (3) - 3 freq
doungaun (3) - 2 freq
gourd (3) - 5 freq
gorged (3) - 2 freq
gurgle (4) - 2 freq
graand (4) - 4 freq
morga (4) - 1 freq
ourlach (4) - 1 freq
norward (4) - 1 freq
dounlaid (4) - 1 freq
coughed (4) - 15 freq
forward (4) - 162 freq
forgat (4) - 5 freq
furraed (4) - 2 freq
burgled (4) - 2 freq
grauf (4) - 3 freq
nouveau (4) - 3 freq
forgave (4) - 1 freq
organ (4) - 16 freq
soured (4) - 1 freq
gourgaud (0) - 1 freq
gorged (3) - 2 freq
gurged (3) - 1 freq
gourd (4) - 5 freq
gurgled (4) - 3 freq
gurges (5) - 1 freq
gorgon (5) - 3 freq
gurred (5) - 8 freq
grudged (5) - 1 freq
gorgin (5) - 1 freq
gurled (5) - 10 freq
gerald (5) - 26 freq
grugous (5) - 2 freq
purged (5) - 1 freq
forged (5) - 3 freq
gorge (5) - 5 freq
forgied (5) - 2 freq
gorgeous (5) - 69 freq
graund (5) - 24 freq
gurned (5) - 1 freq
guard (5) - 39 freq
goargit (5) - 1 freq
gord (5) - 1 freq
surged (5) - 1 freq
gaured (5) - 2 freq
SoundEx code - G623
grazed - 5 freq
grossit - 2 freq
grushed - 1 freq
grosets - 3 freq
gairsty - 2 freq
goargit - 1 freq
groustie - 2 freq
grayssed - 1 freq
grist - 36 freq
greased - 3 freq
grectin - 1 freq
graced - 4 freq
grassed - 5 freq
greased-back - 1 freq
greasy-heidit - 1 freq
gurged - 1 freq
gorged - 2 freq
gristle - 3 freq
greast - 1 freq
grogged - 1 freq
gree-get - 1 freq
greastest - 1 freq
girst - 2 freq
grossets - 1 freq
gourgaud - 1 freq
grosset - 2 freq
gorsedh - 1 freq
garscadden - 2 freq
garygatesmusic - 1 freq
georgethepoet - 1 freq
grctjk - 1 freq
grozet - 1 freq
greggwatson - 1 freq
MetaPhone code - KRKT
crackt - 7 freq
crackit - 23 freq
croquet - 47 freq
croakit - 2 freq
crookit - 17 freq
correct - 40 freq
croqueit - 1 freq
cracked - 41 freq
cruiked - 1 freq
craiked - 4 freq
cruikit - 9 freq
crakt - 1 freq
carket - 1 freq
crakket - 2 freq
crooked - 6 freq
crack't - 1 freq
cricd - 1 freq
crockett - 4 freq
cricket - 13 freq
crockt - 1 freq
croaked - 6 freq
cairry-cot - 1 freq
'correct - 1 freq
creaked - 4 freq
grogged - 1 freq
correckit - 2 freq
cruggit - 2 freq
€˜cricket - 1 freq
correkkit - 1 freq
corkit - 2 freq
corrugat - 1 freq
gourgaud - 1 freq
creeked - 1 freq
craacked - 1 freq
creakit - 1 freq
€˜correct - 1 freq
crocket - 1 freq
corekt - 1 freq
coreect - 1 freq
quarecuttie - 1 freq
GOURGAUD
Time to execute Levenshtein function - 0.247242 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.407687 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.046854 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.038834 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.001069 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.