A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to margaret in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
margaret (0) - 95 freq
'margaret (1) - 1 freq
margret (1) - 7 freq
magaret (1) - 2 freq
margarets (1) - 1 freq
margarete (1) - 1 freq
magret (2) - 2 freq
margaret's (2) - 17 freq
margaretsh (2) - 1 freq
marget (2) - 6 freq
marge (3) - 1 freq
aroart (3) - 1 freq
market (3) - 51 freq
target (3) - 33 freq
maugered (3) - 3 freq
magrit (3) - 1 freq
carret (3) - 3 freq
marraes (3) - 6 freq
mararyk (3) - 2 freq
martyred (3) - 3 freq
margery (3) - 1 freq
mairret (3) - 17 freq
angeret (3) - 1 freq
argyre (3) - 1 freq
garret (3) - 4 freq
margaret (0) - 95 freq
margarete (1) - 1 freq
margret (1) - 7 freq
margarets (2) - 1 freq
magaret (2) - 2 freq
'margaret (2) - 1 freq
marget (3) - 6 freq
magret (3) - 2 freq
magrit (4) - 1 freq
marrit (4) - 1 freq
margit (4) - 32 freq
mairret (4) - 17 freq
margery (4) - 1 freq
marguerite (4) - 4 freq
margaretsh (4) - 1 freq
margaret's (4) - 17 freq
marriet (4) - 15 freq
margarine (4) - 10 freq
merrit (5) - 6 freq
mertert (5) - 1 freq
mairriet (5) - 55 freq
mogert (5) - 1 freq
murderit (5) - 7 freq
marmore (5) - 2 freq
haggart (5) - 3 freq
SoundEx code - M626
merger - 1 freq
margaret - 95 freq
morcar - 4 freq
marjorie - 1 freq
markers - 7 freq
marguerite - 4 freq
muirkirk - 47 freq
margaret's - 17 freq
'margaret - 1 freq
margarine - 10 freq
mercury - 15 freq
morayshire - 1 freq
mairker - 1 freq
marker - 11 freq
mairchers - 4 freq
merry-go-roun - 2 freq
morecar - 1 freq
mairchers' - 1 freq
merker - 6 freq
mercurial - 1 freq
margret's - 3 freq
margret - 7 freq
mars-orcadia - 1 freq
mirkrife - 1 freq
margery - 1 freq
mirkier - 2 freq
mirker - 1 freq
mairkers - 1 freq
mercurius - 1 freq
margarete - 1 freq
marjoribanks - 18 freq
marjoribank - 1 freq
marjory - 1 freq
merkers - 1 freq
margarets - 1 freq
merkir - 1 freq
markrowantree - 4 freq
merrycrimbo - 1 freq
margaretsh - 1 freq
mrsurgfnlz - 1 freq
moiragreentree - 1 freq
mairgrass - 8 freq
marccorbishley - 1 freq
murrkirk - 2 freq
mrskrabapple - 1 freq
markryansmith - 1 freq
margaretdunne - 2 freq
markweir - 2 freq
muirkirk's - 1 freq
muirkirkcoop - 1 freq
mauracurrie - 1 freq
MetaPhone code - MRKRT
margaret - 95 freq
marguerite - 4 freq
'margaret - 1 freq
margret - 7 freq
margarete - 1 freq
MARGARET
Time to execute Levenshtein function - 0.203150 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.379998 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.029358 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.037455 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000939 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.