A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to worm-eeten in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
worm-eeten (0) - 1 freq
wirm-etten (2) - 1 freq
worm-taen (3) - 1 freq
oreetin (4) - 1 freq
tormented (4) - 10 freq
wormsection (4) - 1 freq
tormentin (4) - 5 freq
moch-aeten (4) - 1 freq
moch-etten (4) - 1 freq
someeen (4) - 1 freq
moth-eaten (4) - 1 freq
wormskates (4) - 3 freq
meeten (4) - 1 freq
compeetin (4) - 1 freq
moth-etten (4) - 1 freq
somtheen (5) - 19 freq
forfochten (5) - 10 freq
written (5) - 283 freq
compleen (5) - 6 freq
formulate (5) - 2 freq
torsten (5) - 1 freq
compiete (5) - 1 freq
tormentit (5) - 4 freq
fermeen (5) - 1 freq
momeen (5) - 2 freq
worm-eeten (0) - 1 freq
worm-taen (3) - 1 freq
wirm-etten (3) - 1 freq
tormentin (6) - 5 freq
wormsection (6) - 1 freq
waementin (7) - 1 freq
writen (7) - 1 freq
wormit (7) - 7 freq
formattin (7) - 1 freq
wrouchten (7) - 1 freq
wirm-like (7) - 1 freq
warm-like (7) - 1 freq
work-in (7) - 1 freq
formulatin (7) - 1 freq
warmest (7) - 5 freq
worritin (7) - 1 freq
ormiston (7) - 1 freq
wrutten (7) - 84 freq
wi-outen (7) - 1 freq
worman (7) - 1 freq
worn-oot (7) - 2 freq
writeen (7) - 3 freq
moth-eaten (7) - 1 freq
wormskates (7) - 3 freq
owersetten (7) - 7 freq
SoundEx code - W653
wairmth - 8 freq
warmed - 18 freq
wormed - 1 freq
warned - 36 freq
waarnt - 1 freq
warmt - 2 freq
warmth - 59 freq
warrant - 19 freq
worm-eeten - 1 freq
wormit - 7 freq
wairned - 2 freq
whirwind - 1 freq
weren't - 4 freq
warrandice - 2 freq
warnt - 20 freq
wormwidd - 77 freq
wormwidd's - 4 freq
wormwidds' - 1 freq
'wormwidd - 1 freq
waarmth - 12 freq
weerin't - 1 freq
warmit - 2 freq
worned - 1 freq
waarmed - 3 freq
warn't - 1 freq
worn-oot - 2 freq
waarantie - 3 freq
where-inti - 1 freq
where-intil - 9 freq
worm-taen - 1 freq
waarmt - 3 freq
waarint - 1 freq
waar-naitered - 1 freq
warranted - 1 freq
warranty - 1 freq
warrandyces - 1 freq
warrandyce - 1 freq
warrender - 1 freq
wirmwid - 1 freq
wirmed - 2 freq
warrand - 5 freq
waarned - 2 freq
wirm-etten - 1 freq
warranties - 2 freq
wirmit - 1 freq
werent - 1 freq
weren’t - 1 freq
MetaPhone code - WRMTN
worm-eeten - 1 freq
worm-taen - 1 freq
wirm-etten - 1 freq
WORM-EETEN
Time to execute Levenshtein function - 0.230164 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.434460 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.030082 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.041916 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000949 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.