A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to thoom-blurried in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
thoom-blurried (0) - 1 freq
thoom-blakked (4) - 1 freq
ploom-coloured (7) - 3 freq
throu-bearin (7) - 29 freq
storm-cloured (7) - 1 freq
gooseberries (7) - 3 freq
throu-beirin (7) - 1 freq
blurred (7) - 6 freq
stoor-bliind (7) - 1 freq
shooldered (7) - 1 freq
thoomed (7) - 3 freq
hurried (7) - 57 freq
five-baurred (7) - 2 freq
flurried (7) - 1 freq
hoose-spurgie (7) - 1 freq
skooshberries (7) - 1 freq
blood-blotched (7) - 1 freq
courried (7) - 1 freq
toun-bred (7) - 3 freq
doon-graded (8) - 1 freq
timbered (8) - 1 freq
tortured (8) - 19 freq
hame-brewed (8) - 1 freq
crowberries (8) - 1 freq
thimbles (8) - 2 freq
thoom-blurried (0) - 1 freq
thoom-blakked (6) - 1 freq
thrie-coloured (11) - 1 freq
hame-brewed (11) - 1 freq
blurred (11) - 6 freq
five-baurred (11) - 2 freq
throu-beirin (11) - 1 freq
throu-bearin (11) - 29 freq
toun-bred (11) - 3 freq
ploom-coloured (11) - 3 freq
storm-cloured (11) - 1 freq
two-bedroomed (12) - 1 freq
throsslebaird (12) - 9 freq
trembled (12) - 9 freq
tumbled (12) - 9 freq
white-breid (12) - 1 freq
home-brew (12) - 2 freq
haun-barrae (12) - 1 freq
threi-legged (12) - 2 freq
thoum-nail (12) - 1 freq
time-silvered (12) - 1 freq
three-tiered (12) - 2 freq
hauf-stirred (12) - 1 freq
shambled (12) - 2 freq
home-baked (12) - 1 freq
SoundEx code - T514
tumbling - 1 freq
temples - 10 freq
thimble - 13 freq
tumbl't - 1 freq
tounfolk - 2 freq
tumblin - 5 freq
tumblers - 4 freq
tumble - 9 freq
temple - 87 freq
toonfolk - 1 freq
temple' - 2 freq
tuinfully - 1 freq
tinfoyle - 1 freq
tumbler - 4 freq
thoom-blakked - 1 freq
thoom-blurried - 1 freq
tumbles - 3 freq
tumbled - 9 freq
templepethrick - 1 freq
tam-fools - 1 freq
tumblt - 2 freq
tin-foil - 1 freq
templates - 2 freq
tombola - 2 freq
tumblan - 2 freq
tumflre - 1 freq
template - 6 freq
toonfolk's - 1 freq
templeton - 2 freq
templeton's - 1 freq
thimbles - 2 freq
templ - 2 freq
tenable - 1 freq
tinfoil - 2 freq
€˜tumble - 1 freq
templetons - 2 freq
tuneful - 1 freq
tomwapowell - 1 freq
tomfletcher - 1 freq
timeflies - 1 freq
tenpole - 1 freq
MetaPhone code - 0MBLRT
thoom-blurried - 1 freq
THOOM-BLURRIED
Time to execute Levenshtein function - 0.248796 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.450406 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.028501 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.051404 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000951 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.