A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to gordaidh in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
gordaidh (0) - 3 freq
gordan (3) - 1 freq
gordie (3) - 1 freq
gowdfish (3) - 9 freq
garadh (3) - 1 freq
uraidh (3) - 1 freq
iarraidh (3) - 1 freq
donaidh (3) - 1 freq
goldfish (3) - 18 freq
ordairs (3) - 2 freq
gordin (3) - 2 freq
lochaidh (3) - 1 freq
graith (3) - 128 freq
gordonh (3) - 2 freq
gorsedh (3) - 1 freq
jordan (4) - 52 freq
geordie (4) - 373 freq
gowdd (4) - 1 freq
mermaid (4) - 3 freq
roddside (4) - 2 freq
kurdish (4) - 1 freq
goldie (4) - 3 freq
modairn (4) - 7 freq
gerald (4) - 26 freq
braid (4) - 256 freq
gordaidh (0) - 3 freq
gorsedh (4) - 1 freq
garadh (4) - 1 freq
gordonh (4) - 2 freq
graith (5) - 128 freq
gordie (5) - 1 freq
gordan (5) - 1 freq
gordin (5) - 2 freq
girded (5) - 1 freq
iarraidh (5) - 1 freq
uraidh (5) - 1 freq
geordies (6) - 4 freq
geordic (6) - 1 freq
gordons (6) - 16 freq
graand (6) - 4 freq
ruariaidh (6) - 2 freq
worded (6) - 1 freq
gordeanna (6) - 1 freq
guerded (6) - 1 freq
gorged (6) - 2 freq
gerard (6) - 2 freq
greyish (6) - 3 freq
grand (6) - 363 freq
ruaraidh (6) - 10 freq
gordon (6) - 123 freq
SoundEx code - G633
guairdit - 6 freq
greeted - 28 freq
graithed - 12 freq
gairdit - 6 freq
greetit - 10 freq
graduate - 11 freq
gairded - 3 freq
gratitude - 16 freq
guarded - 2 freq
garrottit - 1 freq
gratetul - 1 freq
gritted - 3 freq
graduated - 2 freq
gratet - 1 freq
geordietta - 17 freq
gyrated - 1 freq
graduates - 3 freq
guardit - 1 freq
graduation - 2 freq
garthdee - 7 freq
greitit - 1 freq
graduatin - 2 freq
girded - 1 freq
gradyooit - 2 freq
guairded - 3 freq
garritted - 1 freq
gradations - 1 freq
graded - 3 freq
garrotted - 1 freq
gratiteid - 2 freq
graddad - 1 freq
grey-heidit - 1 freq
graduatit - 1 freq
gratiteed - 1 freq
guerded - 1 freq
grated - 1 freq
graduating - 1 freq
gordaidh - 3 freq
gerrydotp - 1 freq
geordiedentist - 1 freq
MetaPhone code - KRTT
guairdit - 6 freq
credit - 111 freq
crowdit - 7 freq
coortit - 5 freq
croudit - 4 freq
creatit - 34 freq
greeted - 28 freq
cairtit - 11 freq
gairdit - 6 freq
creaatit - 1 freq
greetit - 10 freq
cordite - 2 freq
graduate - 11 freq
crowded - 14 freq
croodit - 17 freq
created - 27 freq
gairded - 3 freq
coorted - 2 freq
crowdid - 1 freq
guarded - 2 freq
garrottit - 1 freq
quartet - 2 freq
crooded - 9 freq
gritted - 3 freq
corroded - 4 freq
cairted - 5 freq
crowdet - 2 freq
gratet - 1 freq
creautit - 2 freq
carotid - 1 freq
guardit - 1 freq
greitit - 1 freq
kerted - 1 freq
guairded - 3 freq
garritted - 1 freq
courtit - 1 freq
graded - 3 freq
garrotted - 1 freq
graddad - 1 freq
credite - 2 freq
creedit - 1 freq
carted - 2 freq
co-airtit - 1 freq
guerded - 1 freq
grated - 1 freq
gordaidh - 3 freq
curatit - 1 freq
curated - 1 freq
GORDAIDH
Time to execute Levenshtein function - 0.223773 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.410886 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.028286 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.039489 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000838 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.