A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to mrskrabapple in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
mrskrabapple (0) - 1 freq
miserable (5) - 57 freq
thraipple (6) - 7 freq
meiserable (6) - 1 freq
desirable (6) - 4 freq
eskadale (6) - 1 freq
sermaple (6) - 1 freq
meisurable (6) - 2 freq
describable (6) - 1 freq
meeserable (6) - 6 freq
mïserable (6) - 1 freq
craw-aipple (6) - 1 freq
straachle (6) - 2 freq
scrabble (6) - 12 freq
muirbattle (6) - 1 freq
thrapple (6) - 123 freq
trapple (6) - 3 freq
mbappe (6) - 1 freq
morebattle (6) - 1 freq
strabane (6) - 3 freq
sapple (6) - 1 freq
meesrable (6) - 2 freq
memorable (6) - 10 freq
grapple (6) - 2 freq
meirbattle (6) - 1 freq
mrskrabapple (0) - 1 freq
miserable (9) - 57 freq
miserably (10) - 3 freq
meeserable (10) - 6 freq
meesrable (10) - 2 freq
meisurable (10) - 2 freq
amyspineapple (10) - 1 freq
meiserable (10) - 1 freq
meirbattle (11) - 1 freq
grapple (11) - 2 freq
meeserably (11) - 1 freq
sapple (11) - 1 freq
treisurable (11) - 2 freq
morebattle (11) - 1 freq
mermarble (11) - 1 freq
mrsfraserphs (11) - 4 freq
marketable (11) - 1 freq
memorable (11) - 10 freq
describable (11) - 1 freq
craw-aipple (11) - 1 freq
desirable (11) - 4 freq
mbappe (11) - 1 freq
sermaple (11) - 1 freq
scrabble (11) - 12 freq
mïserable (11) - 1 freq
SoundEx code - M626
merger - 1 freq
margaret - 95 freq
morcar - 4 freq
marjorie - 1 freq
markers - 7 freq
marguerite - 4 freq
muirkirk - 47 freq
margaret's - 17 freq
'margaret - 1 freq
margarine - 10 freq
mercury - 15 freq
morayshire - 1 freq
mairker - 1 freq
marker - 11 freq
mairchers - 4 freq
merry-go-roun - 2 freq
morecar - 1 freq
mairchers' - 1 freq
merker - 6 freq
mercurial - 1 freq
margret's - 3 freq
margret - 7 freq
mars-orcadia - 1 freq
mirkrife - 1 freq
margery - 1 freq
mirkier - 2 freq
mirker - 1 freq
mairkers - 1 freq
mercurius - 1 freq
margarete - 1 freq
marjoribanks - 18 freq
marjoribank - 1 freq
marjory - 1 freq
merkers - 1 freq
margarets - 1 freq
merkir - 1 freq
markrowantree - 4 freq
merrycrimbo - 1 freq
margaretsh - 1 freq
mrsurgfnlz - 1 freq
moiragreentree - 1 freq
mairgrass - 8 freq
marccorbishley - 1 freq
murrkirk - 2 freq
mrskrabapple - 1 freq
markryansmith - 1 freq
margaretdunne - 2 freq
markweir - 2 freq
muirkirk's - 1 freq
muirkirkcoop - 1 freq
mauracurrie - 1 freq
MetaPhone code - MRSKRBPL
mrskrabapple - 1 freq
MRSKRABAPPLE
Time to execute Levenshtein function - 0.214700 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.406392 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.036285 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.040045 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000888 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.