A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to whaups in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
whaups (0) - 21 freq
whalps (1) - 3 freq
whaup (1) - 15 freq
whaps (1) - 3 freq
whau's (1) - 7 freq
whaup's (1) - 1 freq
whaurs (1) - 9 freq
whaaps (1) - 1 freq
whuts (2) - 11 freq
awhaurs (2) - 1 freq
whuns (2) - 5 freq
wha''s (2) - 2 freq
whar's (2) - 8 freq
waps (2) - 1 freq
whisps (2) - 1 freq
whurs (2) - 1 freq
whau’s (2) - 1 freq
staups (2) - 3 freq
wauks (2) - 5 freq
haips (2) - 1 freq
whau'd (2) - 1 freq
jaups (2) - 3 freq
what's (2) - 116 freq
caups (2) - 6 freq
whap's (2) - 1 freq
whaups (0) - 21 freq
whaaps (1) - 1 freq
whaps (1) - 3 freq
wheeps (2) - 1 freq
whips (2) - 22 freq
whaurs (2) - 9 freq
whoops (2) - 9 freq
whau's (2) - 7 freq
whaup (2) - 15 freq
whaup's (2) - 1 freq
whalps (2) - 3 freq
haeps (3) - 4 freq
whup (3) - 8 freq
whause (3) - 20 freq
whaes (3) - 27 freq
wha'es (3) - 3 freq
whauras (3) - 5 freq
wasps (3) - 4 freq
shaeps (3) - 3 freq
shups (3) - 1 freq
whats (3) - 18 freq
whaap (3) - 3 freq
waasps (3) - 1 freq
whaas (3) - 1 freq
whae's (3) - 29 freq
SoundEx code - W120
weeps - 1 freq
waves - 137 freq
wappie's - 1 freq
wouves - 1 freq
wives - 42 freq
wife's - 27 freq
whips - 22 freq
wipes - 4 freq
whoops - 9 freq
wowfs - 3 freq
wowffs - 1 freq
wifock - 1 freq
waffs - 7 freq
wifies - 57 freq
wifie's - 19 freq
whaups - 21 freq
wifies' - 2 freq
waps - 1 freq
wheeps - 1 freq
waive's - 11 freq
wave's - 1 freq
waives - 1 freq
wayv's - 1 freq
waiv's - 1 freq
wyfe's - 1 freq
wabs - 10 freq
wifes - 7 freq
weaves - 8 freq
whaps - 3 freq
whaup's - 1 freq
whuffs - 3 freq
'whips - 1 freq
webs - 2 freq
wheefs - 1 freq
wouffs - 1 freq
wives' - 1 freq
wab's - 2 freq
waffish - 2 freq
wips - 1 freq
wey-bauk - 1 freq
wyvis - 1 freq
wyffis - 1 freq
wyves - 1 freq
wyfes - 2 freq
whaaps - 1 freq
wyfis - 1 freq
waifs - 1 freq
wyiffis - 1 freq
whiffs - 1 freq
whaupshaw - 4 freq
woufs - 1 freq
wouf's - 2 freq
wiffies - 1 freq
whap's - 1 freq
wbki - 1 freq
wifeys - 1 freq
wifie’s - 2 freq
wwfxqo - 1 freq
wife’s - 3 freq
wpz - 1 freq
wbgy - 1 freq
wuvx - 1 freq
wpg - 1 freq
wbkuy - 1 freq
wipkh - 1 freq
wvix - 1 freq
weaaufbiss - 1 freq
wbaggs - 1 freq
wives’ - 1 freq
MetaPhone code - WPS
weeps - 1 freq
wappie's - 1 freq
whips - 22 freq
wipes - 4 freq
whoops - 9 freq
whaups - 21 freq
waps - 1 freq
wheeps - 1 freq
whaps - 3 freq
whaup's - 1 freq
'whips - 1 freq
wips - 1 freq
whaaps - 1 freq
whap's - 1 freq
WHAUPS
Time to execute Levenshtein function - 0.377986 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.629305 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.029327 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.063563 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000826 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.