A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to whilk in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
whilk (0) - 193 freq
whill (1) - 13 freq
qwhilk (1) - 1 freq
while (1) - 1458 freq
whisk (1) - 2 freq
ahil (2) - 1 freq
whine (2) - 5 freq
whilst (2) - 124 freq
weil- (2) - 1 freq
will (2) - 3306 freq
hulk (2) - 4 freq
'hink (2) - 5 freq
whins (2) - 12 freq
think (2) - 3054 freq
whict (2) - 1 freq
whale (2) - 32 freq
whin' (2) - 1 freq
wrily (2) - 1 freq
wwhile (2) - 1 freq
wuill (2) - 1 freq
wilt (2) - 3 freq
wheek (2) - 8 freq
wuil (2) - 1 freq
chink (2) - 13 freq
whiles (2) - 479 freq
whilk (0) - 193 freq
whill (2) - 13 freq
while (2) - 1458 freq
whisk (2) - 2 freq
qwhilk (2) - 1 freq
while' (3) - 1 freq
whelks (3) - 14 freq
walk (3) - 483 freq
awhile (3) - 15 freq
wulk (3) - 1 freq
chalk (3) - 25 freq
whaisk (3) - 1 freq
whilom (3) - 1 freq
whitlik (3) - 35 freq
whilie (3) - 87 freq
waulk (3) - 1 freq
whiley (3) - 12 freq
whul (3) - 1 freq
wheils (3) - 1 freq
whl (3) - 1 freq
whole (3) - 191 freq
whele (3) - 1 freq
whenk (3) - 4 freq
wlk (3) - 1 freq
quhilk (3) - 4 freq
SoundEx code - W420
walk - 483 freq
whyles - 85 freq
whiles - 479 freq
willie's - 22 freq
whilk - 193 freq
wells - 19 freq
wheels - 83 freq
walks - 100 freq
walls - 29 freq
while's - 4 freq
will's - 1 freq
waalk - 5 freq
walk' - 3 freq
willox - 1 freq
weel's - 16 freq
wheelhoose - 9 freq
wulls - 2 freq
wullie's - 19 freq
wellies - 16 freq
wheel's - 9 freq
whales - 15 freq
wallace - 118 freq
wills - 3 freq
woolies - 2 freq
weles - 1 freq
wails - 6 freq
wull's - 6 freq
waals - 7 freq
wulks - 1 freq
wyles - 2 freq
wales - 39 freq
wels - 5 freq
welch - 4 freq
welsh - 175 freq
'whiles - 2 freq
wallies - 16 freq
willicks - 2 freq
well's - 1 freq
wall's - 8 freq
whelks - 14 freq
'whelks - 1 freq
weelàss - 7 freq
weelass - 2 freq
wully's - 11 freq
wiles - 11 freq
wullies - 9 freq
wullies's - 2 freq
walsh - 7 freq
whilk's - 1 freq
walays - 1 freq
waalls - 1 freq
willick - 2 freq
willick's - 1 freq
willies - 6 freq
willows - 4 freq
willock - 1 freq
whalsa - 3 freq
w-wullie's - 1 freq
waa-like - 1 freq
weill's - 2 freq
'walk - 1 freq
whaals - 4 freq
wulk - 1 freq
walls' - 1 freq
weil's - 1 freq
whillie's - 1 freq
waels - 4 freq
walkie - 5 freq
walloch - 1 freq
waulk - 1 freq
whales' - 1 freq
wheils - 1 freq
€œwhiles - 4 freq
waleys - 1 freq
wallahs - 1 freq
wheelies - 2 freq
welsch - 1 freq
whaalsa - 4 freq
wal's - 1 freq
wals - 1 freq
€œwhyles - 1 freq
€œwalk - 1 freq
waelz - 1 freq
wheelhouse - 2 freq
waaalks - 1 freq
williewaugh - 1 freq
wls - 1 freq
wallsie - 1 freq
“whiles” - 1 freq
willis - 1 freq
whelks' - 1 freq
whalsay - 1 freq
willieÂ’s - 1 freq
wlk - 1 freq
MetaPhone code - WLK
walk - 483 freq
whilk - 193 freq
waalk - 5 freq
walk' - 3 freq
willick - 2 freq
willock - 1 freq
waa-like - 1 freq
'walk - 1 freq
wulk - 1 freq
walkie - 5 freq
waulk - 1 freq
€œwalk - 1 freq
WHILK
Time to execute Levenshtein function - 0.199057 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.350538 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.033573 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.039178 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000866 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.