A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to warm in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
warm (0) - 355 freq
worm (1) - 34 freq
waurm (1) - 1 freq
wart (1) - 4 freq
farm (1) - 28 freq
warp (1) - 9 freq
harm (1) - 21 freq
warn (1) - 34 freq
wurm (1) - 1 freq
arm (1) - 50 freq
warum (1) - 5 freq
ware (1) - 26 freq
garm (1) - 1 freq
waam (1) - 1 freq
watrm (1) - 1 freq
war (1) - 1438 freq
warmt (1) - 2 freq
wairm (1) - 46 freq
swarm (1) - 2 freq
karm (1) - 2 freq
wirm (1) - 17 freq
wark (1) - 892 freq
wary (1) - 11 freq
warms (1) - 8 freq
wam (1) - 1 freq
warm (0) - 355 freq
warum (1) - 5 freq
wirm (1) - 17 freq
wairm (1) - 46 freq
waarm (1) - 81 freq
wurm (1) - 1 freq
waurm (1) - 1 freq
worm (1) - 34 freq
wark (2) - 892 freq
wary (2) - 11 freq
karm (2) - 2 freq
warms (2) - 8 freq
wam (2) - 1 freq
ward (2) - 89 freq
swarm (2) - 2 freq
wars (2) - 61 freq
wormy (2) - 1 freq
warg (2) - 1 freq
warl (2) - 164 freq
harm (2) - 21 freq
warn (2) - 34 freq
warp (2) - 9 freq
warmt (2) - 2 freq
wart (2) - 4 freq
arm (2) - 50 freq
SoundEx code - W650
werena - 45 freq
warm - 355 freq
waarm - 81 freq
worm - 34 freq
weerin - 66 freq
warnae - 15 freq
wirm - 17 freq
wirnae - 31 freq
worryin - 20 freq
werenae - 95 freq
wearin - 190 freq
wairm - 46 freq
worn - 59 freq
wurnae - 73 freq
wirin - 2 freq
wernae - 13 freq
weirin - 58 freq
wearyin - 8 freq
warna - 36 freq
whorin - 2 freq
waurm - 1 freq
werrin - 3 freq
warum - 5 freq
werna - 13 freq
wirryin - 10 freq
warren - 35 freq
warn - 34 freq
warran - 4 freq
whaur-in - 1 freq
wirna - 34 freq
wairn - 2 freq
waurna - 10 freq
wearin' - 4 freq
wermm - 1 freq
weeren - 1 freq
wooryin' - 1 freq
whaurin - 5 freq
werain - 1 freq
waurnae - 1 freq
ware-in - 1 freq
wirno - 8 freq
wurm - 1 freq
weren - 2 freq
wern - 1 freq
wir'n - 1 freq
wearan - 9 freq
wurn - 1 freq
whirran - 1 freq
waarn - 1 freq
warn-awey - 1 freq
wren - 3 freq
worryan - 1 freq
waeran - 1 freq
wierin - 4 freq
wuirn - 2 freq
wirmie - 1 freq
warrum - 1 freq
wharein - 1 freq
warrin - 1 freq
wran - 2 freq
whaur'm - 1 freq
'weirin - 1 freq
whereon - 1 freq
whorne - 2 freq
warin - 2 freq
whirrin - 4 freq
wereni - 1 freq
wurna - 2 freq
€œwarn - 1 freq
wormy - 1 freq
werenay - 1 freq
wereny - 2 freq
warrim - 1 freq
wer'nae - 1 freq
wurny - 2 freq
wairin - 1 freq
MetaPhone code - WRM
warm - 355 freq
waarm - 81 freq
worm - 34 freq
wirm - 17 freq
wairm - 46 freq
waurm - 1 freq
warum - 5 freq
wermm - 1 freq
wurm - 1 freq
wirmie - 1 freq
warrum - 1 freq
whaur'm - 1 freq
wormy - 1 freq
warrim - 1 freq
WARM
Time to execute Levenshtein function - 0.189968 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.326549 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027730 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.037177 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000802 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.