A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to great-aunt in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
great-aunt (0) - 1 freq
creatioun (4) - 1 freq
greatest (4) - 42 freq
greasepaint (4) - 1 freq
greatguy (4) - 1 freq
greatfu (4) - 1 freq
greachan (4) - 5 freq
restraint (4) - 5 freq
treatment (4) - 44 freq
great-great (4) - 1 freq
greetan (4) - 10 freq
restraunts (4) - 1 freq
graunt (4) - 1 freq
gra-an (4) - 1 freq
great-uncle (4) - 1 freq
resteraunt (4) - 1 freq
breathan (4) - 4 freq
reactiouns (5) - 1 freq
treatments (5) - 3 freq
fragrant (5) - 6 freq
greesan (5) - 1 freq
grateful (5) - 33 freq
fit-dunt (5) - 1 freq
radiant (5) - 5 freq
dreadnaucht (5) - 1 freq
great-aunt (0) - 1 freq
graunt (6) - 1 freq
treatment (6) - 44 freq
greetan (6) - 10 freq
gra-an (6) - 1 freq
great-great (6) - 1 freq
great-uncle (6) - 1 freq
greatest (6) - 42 freq
greasepaint (6) - 1 freq
greetit (7) - 10 freq
gradient (7) - 1 freq
aert-kent (7) - 1 freq
greement (7) - 20 freq
gratins (7) - 1 freq
gratet (7) - 1 freq
gratna (7) - 1 freq
grettest (7) - 1 freq
greetins (7) - 7 freq
grunt (7) - 33 freq
greitt (7) - 1 freq
graint (7) - 4 freq
greitit (7) - 1 freq
gratin (7) - 4 freq
greeting (7) - 13 freq
greetin (7) - 366 freq
SoundEx code - G635
greetin - 366 freq
gairden - 503 freq
greetin-faced - 5 freq
gairdner - 17 freq
gairdens - 91 freq
groutin - 1 freq
grytness - 2 freq
guirden - 1 freq
gairdners - 18 freq
gairden' - 5 freq
gairdeners - 28 freq
greetin's - 1 freq
gairden's - 3 freq
gordon - 123 freq
garden - 67 freq
greeting - 13 freq
grittin - 3 freq
greitin - 26 freq
gairden-an - 1 freq
gaurdian - 7 freq
gordon's - 6 freq
gardener - 2 freq
gairdener - 9 freq
gratin - 4 freq
'greetin' - 1 freq
gretna - 6 freq
gordonstoun - 2 freq
garden's - 1 freq
greetan - 10 freq
guardians - 2 freq
gairdenin - 8 freq
gairdians - 2 freq
gairdin - 19 freq
greetin' - 9 freq
graithin - 20 freq
gordin - 2 freq
gardens - 10 freq
greetins - 7 freq
growthieness - 2 freq
great-aunt - 1 freq
guardian - 13 freq
greetinfaced - 1 freq
gordons - 16 freq
gairden-how - 1 freq
grootan - 1 freq
greetings - 2 freq
greatness - 2 freq
gerdin - 2 freq
gairtens - 2 freq
gairdener's - 1 freq
grutten - 3 freq
garden' - 2 freq
gairdeen - 4 freq
graithins - 3 freq
girdin - 2 freq
greittin - 2 freq
guairdin - 1 freq
guairdian - 5 freq
gaerdeen - 3 freq
gerdeen - 10 freq
greeteen - 1 freq
greetin-teenies - 1 freq
gyratin - 1 freq
gairdins - 2 freq
gratins - 1 freq
greitin's - 1 freq
gairdnin - 3 freq
gairden- - 2 freq
garden-how - 1 freq
gardeners - 10 freq
gairtmorn - 1 freq
gartmorndam - 1 freq
gordeanna - 1 freq
garten - 2 freq
greatness' - 1 freq
graithen - 2 freq
gerden - 4 freq
guairdians - 3 freq
gratna - 1 freq
giordano - 1 freq
guardin - 3 freq
gairdun - 1 freq
guairdianship - 1 freq
€˜gairden - 1 freq
groutin' - 1 freq
gradient - 1 freq
gairdenfit - 2 freq
gradients - 1 freq
great-uncle - 1 freq
€œgreetings - 2 freq
gertin - 1 freq
gairdening - 1 freq
gardening - 3 freq
grtm - 1 freq
gordonsimpson - 3 freq
gordondunsmuir - 2 freq
gordonshortsbestmate - 1 freq
garytank - 2 freq
greatnorthrun - 1 freq
gerryadamssf - 3 freq
gordonguthrie - 1 freq
gardiner - 1 freq
gordonramsay - 2 freq
gordonginoandfred - 1 freq
“garden” - 1 freq
guardianopinion - 1 freq
guardiantv - 1 freq
gardner's - 1 freq
gardnerj - 1 freq
gordonhepburn - 1 freq
gaerdenin - 1 freq
gaerden - 1 freq
gartmoreps - 1 freq
gordonghll - 2 freq
gordonh - 2 freq
gordan - 1 freq
gordonschools - 1 freq
groatnews - 1 freq
MetaPhone code - KRTNT
curtained - 1 freq
great-aunt - 1 freq
coordinate - 2 freq
co-ordinate - 6 freq
gradient - 1 freq
GREAT-AUNT
Time to execute Levenshtein function - 0.238695 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.406847 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.031393 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.039399 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000906 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.