A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to guardian in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
guardian (0) - 13 freq
guardians (1) - 2 freq
guardin (1) - 3 freq
guairdian (1) - 5 freq
guardiantv (2) - 1 freq
guairdians (2) - 3 freq
guardit (2) - 1 freq
guairdin (2) - 1 freq
gaurdian (2) - 7 freq
suzidian (3) - 2 freq
bardin (3) - 1 freq
fardin (3) - 1 freq
cardifan (3) - 1 freq
duardna (3) - 1 freq
georgian (3) - 4 freq
jardin (3) - 2 freq
glaran (3) - 2 freq
wardin (3) - 1 freq
gurnin (3) - 6 freq
guairdit (3) - 6 freq
guarded (3) - 2 freq
gearin (3) - 2 freq
goaadin (3) - 1 freq
grampian (3) - 11 freq
guidman (3) - 54 freq
guardian (0) - 13 freq
guardin (1) - 3 freq
guairdian (1) - 5 freq
gaurdian (2) - 7 freq
guardians (2) - 2 freq
guairdin (2) - 1 freq
gordan (3) - 1 freq
gairdin (3) - 19 freq
girdin (3) - 2 freq
gerdin (3) - 2 freq
garden (3) - 67 freq
gordin (3) - 2 freq
guirden (3) - 1 freq
guardit (3) - 1 freq
guairdians (3) - 3 freq
guards (4) - 13 freq
gairdians (4) - 2 freq
gaarin (4) - 1 freq
garrin (4) - 35 freq
boardin (4) - 7 freq
wurdin (4) - 1 freq
gairden (4) - 500 freq
giordano (4) - 1 freq
gerden (4) - 4 freq
gairdun (4) - 1 freq
SoundEx code - G635
greetin - 362 freq
gairden - 500 freq
greetin-faced - 5 freq
gairdner - 17 freq
gairdens - 90 freq
groutin - 1 freq
grytness - 2 freq
guirden - 1 freq
gairdners - 18 freq
gairden' - 5 freq
gairdeners - 28 freq
greetin's - 1 freq
gairden's - 3 freq
gordon - 123 freq
garden - 67 freq
greeting - 11 freq
grittin - 3 freq
greitin - 26 freq
gairden-an - 1 freq
gaurdian - 7 freq
gordon's - 6 freq
gardener - 2 freq
gairdener - 9 freq
gratin - 4 freq
'greetin' - 1 freq
gretna - 6 freq
gordonstoun - 2 freq
garden's - 1 freq
greetan - 10 freq
guardians - 2 freq
gairdenin - 8 freq
gairdians - 2 freq
gairdin - 19 freq
greetin' - 9 freq
graithin - 20 freq
gordin - 2 freq
gardens - 10 freq
growthieness - 2 freq
great-aunt - 1 freq
guardian - 13 freq
greetinfaced - 1 freq
gordons - 16 freq
gairden-how - 1 freq
grootan - 1 freq
greetings - 2 freq
greatness - 2 freq
gerdin - 2 freq
gairtens - 2 freq
gairdener's - 1 freq
grutten - 3 freq
greetins - 6 freq
garden' - 2 freq
gairdeen - 4 freq
graithins - 3 freq
girdin - 2 freq
greittin - 2 freq
guairdin - 1 freq
guairdian - 5 freq
gaerdeen - 3 freq
gerdeen - 10 freq
greeteen - 1 freq
greetin-teenies - 1 freq
gyratin - 1 freq
gairdins - 2 freq
gratins - 1 freq
greitin's - 1 freq
gairdnin - 3 freq
gairden- - 2 freq
garden-how - 1 freq
gardeners - 10 freq
gairtmorn - 1 freq
gartmorndam - 1 freq
gordeanna - 1 freq
garten - 2 freq
greatness' - 1 freq
graithen - 2 freq
gerden - 4 freq
guairdians - 3 freq
gratna - 1 freq
giordano - 1 freq
guardin - 3 freq
gairdun - 1 freq
guairdianship - 1 freq
€˜gairden - 1 freq
groutin' - 1 freq
gradient - 1 freq
gairdenfit - 2 freq
gradients - 1 freq
great-uncle - 1 freq
€œgreetings - 2 freq
gertin - 1 freq
gairdening - 1 freq
gardening - 3 freq
grtm - 1 freq
gordonsimpson - 3 freq
gordondunsmuir - 2 freq
gordonshortsbestmate - 1 freq
garytank - 2 freq
greatnorthrun - 1 freq
gerryadamssf - 3 freq
gordonguthrie - 1 freq
gardiner - 1 freq
gordonramsay - 2 freq
gordonginoandfred - 1 freq
“garden” - 1 freq
guardianopinion - 1 freq
guardiantv - 1 freq
gardner's - 1 freq
gardnerj - 1 freq
gordonhepburn - 1 freq
gaerdenin - 1 freq
gaerden - 1 freq
gartmoreps - 1 freq
gordonghll - 2 freq
gordonh - 2 freq
gordan - 1 freq
gordonschools - 1 freq
groatnews - 1 freq
MetaPhone code - KRTN
greetin - 362 freq
gairden - 500 freq
curtain - 43 freq
groutin - 1 freq
guirden - 1 freq
gairden' - 5 freq
gordon - 123 freq
coorie-doun - 1 freq
garden - 67 freq
grittin - 3 freq
greitin - 26 freq
creatin - 25 freq
gaurdian - 7 freq
cordon - 2 freq
'coortin' - 1 freq
carton - 7 freq
gratin - 4 freq
cortina - 1 freq
croudin - 1 freq
'greetin' - 1 freq
coortin - 18 freq
gretna - 6 freq
cairdin - 2 freq
cairtin - 7 freq
greetan - 10 freq
curteen - 1 freq
gairdin - 19 freq
greetin' - 9 freq
kirtana - 1 freq
gordin - 2 freq
coortin' - 1 freq
cartoon - 13 freq
cretin - 2 freq
guardian - 13 freq
croodin - 2 freq
curtin - 6 freq
cardin - 2 freq
grootan - 1 freq
crowdin - 1 freq
courtin - 4 freq
cruden - 26 freq
cartin - 1 freq
cairton - 3 freq
coorteen - 1 freq
grutten - 3 freq
cairtoon - 1 freq
garden' - 2 freq
gairdeen - 4 freq
greittin - 2 freq
guairdin - 1 freq
guairdian - 5 freq
coortain - 2 freq
gaerdeen - 3 freq
kartoon - 1 freq
greeteen - 1 freq
kertan - 1 freq
creatin' - 1 freq
curatin' - 1 freq
gairden- - 2 freq
crouton - 2 freq
cretan - 8 freq
curdna - 1 freq
gordeanna - 1 freq
garten - 2 freq
coorie-doon - 1 freq
gratna - 1 freq
cairtoun - 1 freq
guardin - 3 freq
gairdun - 1 freq
€˜gairden - 1 freq
groutin' - 1 freq
crueton - 1 freq
cruetown - 1 freq
krátun - 1 freq
crawdoun - 1 freq
cartain - 1 freq
€œcoortin - 1 freq
crdon - 1 freq
cardoon - 1 freq
“garden” - 1 freq
cratin - 1 freq
kourtney - 1 freq
gaerden - 1 freq
“coortin” - 1 freq
gordonh - 2 freq
gordan - 1 freq
GUARDIAN
Time to execute Levenshtein function - 0.319666 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.430430 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.029317 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.039994 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000854 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.