A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to walsh in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
walsh (0) - 7 freq
wals (1) - 1 freq
warsh (1) - 1 freq
wash (1) - 131 freq
welsh (1) - 174 freq
waash (1) - 32 freq
walth (1) - 35 freq
wal (2) - 24 freq
waist (2) - 45 freq
wlnh (2) - 1 freq
waugh (2) - 2 freq
wesh (2) - 8 freq
wais (2) - 6 freq
wels (2) - 5 freq
waks (2) - 8 freq
ways (2) - 49 freq
balsa (2) - 1 freq
wall (2) - 76 freq
waas (2) - 155 freq
waals (2) - 7 freq
wish (2) - 374 freq
swash (2) - 1 freq
wasp (2) - 5 freq
wally (2) - 47 freq
gash (2) - 14 freq
walsh (0) - 7 freq
welsh (1) - 174 freq
walth (2) - 35 freq
waash (2) - 32 freq
wals (2) - 1 freq
wash (2) - 131 freq
warsh (2) - 1 freq
wls (3) - 1 freq
wailth (3) - 1 freq
wish (3) - 374 freq
whash (3) - 1 freq
walthy (3) - 10 freq
weish (3) - 4 freq
waels (3) - 4 freq
welch (3) - 2 freq
welsch (3) - 1 freq
awash (3) - 2 freq
weesh (3) - 11 freq
wersh (3) - 24 freq
waals (3) - 7 freq
wales (3) - 37 freq
wesh (3) - 8 freq
wels (3) - 5 freq
gulsh (3) - 2 freq
lash (3) - 16 freq
SoundEx code - W420
walk - 464 freq
whyles - 85 freq
whiles - 478 freq
willie's - 22 freq
whilk - 193 freq
wells - 19 freq
wheels - 83 freq
walks - 97 freq
walls - 27 freq
while's - 4 freq
will's - 1 freq
waalk - 5 freq
walk' - 3 freq
willox - 1 freq
weel's - 16 freq
wheelhoose - 9 freq
wulls - 2 freq
wullie's - 19 freq
wellies - 16 freq
wheel's - 9 freq
whales - 15 freq
wallace - 117 freq
wills - 3 freq
woolies - 2 freq
weles - 1 freq
wails - 6 freq
wull's - 6 freq
waals - 7 freq
wulks - 1 freq
wyles - 2 freq
wales - 37 freq
wels - 5 freq
'whiles - 2 freq
wallies - 16 freq
welsh - 174 freq
willicks - 2 freq
well's - 1 freq
wall's - 8 freq
whelks - 14 freq
'whelks - 1 freq
weelàss - 7 freq
weelass - 2 freq
wully's - 11 freq
wiles - 11 freq
wullies - 9 freq
wullies's - 2 freq
walsh - 7 freq
whilk's - 1 freq
walays - 1 freq
waalls - 1 freq
willick - 2 freq
willick's - 1 freq
willies - 6 freq
willows - 4 freq
willock - 1 freq
whalsa - 3 freq
w-wullie's - 1 freq
waa-like - 1 freq
weill's - 2 freq
'walk - 1 freq
whaals - 4 freq
wulk - 1 freq
walls' - 1 freq
weil's - 1 freq
whillie's - 1 freq
waels - 4 freq
walkie - 5 freq
walloch - 1 freq
waulk - 1 freq
whales' - 1 freq
welch - 2 freq
wheils - 1 freq
€œwhiles - 4 freq
waleys - 1 freq
wallahs - 1 freq
wheelies - 2 freq
welsch - 1 freq
whaalsa - 4 freq
wal's - 1 freq
wals - 1 freq
€œwhyles - 1 freq
€œwalk - 1 freq
waelz - 1 freq
wheelhouse - 2 freq
waaalks - 1 freq
williewaugh - 1 freq
wls - 1 freq
wallsie - 1 freq
“whiles” - 1 freq
willis - 1 freq
whelks' - 1 freq
whalsay - 1 freq
willieÂ’s - 1 freq
wlk - 1 freq
MetaPhone code - WLX
welsh - 174 freq
walsh - 7 freq
walloch - 1 freq
welch - 2 freq
WALSH
Time to execute Levenshtein function - 0.187521 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.360306 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027513 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.036359 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000853 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.