A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to dream in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
dream (0) - 259 freq
d-ream (1) - 1 freq
bream (1) - 2 freq
dreame (1) - 6 freq
dreem (1) - 3 freq
cream (1) - 139 freq
dreamy (1) - 10 freq
dram (1) - 115 freq
drem (1) - 7 freq
ream (1) - 4 freq
drear (1) - 10 freq
dream' (1) - 1 freq
dreym (1) - 9 freq
dreamt (1) - 38 freq
dreams (1) - 153 freq
dread (1) - 34 freq
drems (2) - 1 freq
reat (2) - 1 freq
deas (2) - 1 freq
reaf (2) - 1 freq
dreip (2) - 2 freq
areas (2) - 94 freq
draem (2) - 15 freq
daem (2) - 1 freq
dreams' (2) - 1 freq
dream (0) - 259 freq
drem (1) - 7 freq
dreym (1) - 9 freq
dram (1) - 115 freq
dreamy (1) - 10 freq
dreame (1) - 6 freq
dreem (1) - 3 freq
drame (2) - 23 freq
drum (2) - 72 freq
draem (2) - 15 freq
draim (2) - 5 freq
dread (2) - 34 freq
drama (2) - 95 freq
dreme (2) - 5 freq
druim (2) - 1 freq
d-ream (2) - 1 freq
dreams (2) - 153 freq
bream (2) - 2 freq
cream (2) - 139 freq
ream (2) - 4 freq
dream' (2) - 1 freq
drear (2) - 10 freq
dreamt (2) - 38 freq
dreean (3) - 1 freq
fram (3) - 1 freq
SoundEx code - D650
dream - 259 freq
droon - 42 freq
drawn - 90 freq
daurin - 10 freq
dreamy - 10 freq
drawin - 119 freq
droun - 7 freq
dern - 21 freq
drame - 23 freq
dram - 115 freq
drum - 72 freq
drain - 30 freq
drama - 95 freq
draan - 23 freq
daurna - 24 freq
durin - 183 freq
darn - 3 freq
darnae - 5 freq
dreean - 1 freq
drama' - 1 freq
dryin - 38 freq
drewn - 2 freq
darin - 5 freq
daurnae - 4 freq
drone - 19 freq
dreame - 6 freq
dreem - 3 freq
dryen - 1 freq
darenae - 4 freq
droney - 2 freq
'dorian - 1 freq
dorian - 32 freq
'dorian' - 1 freq
dreme - 5 freq
dryin' - 1 freq
darwin - 3 freq
druim - 1 freq
darien - 8 freq
draa'in - 1 freq
durham - 11 freq
drome - 1 freq
draain - 15 freq
draim - 5 freq
doreen - 3 freq
darren - 8 freq
dreym - 9 freq
draem - 15 freq
drem - 7 freq
draaeen - 4 freq
duran - 3 freq
darena - 3 freq
dryan - 2 freq
draawn - 3 freq
draa-an - 1 freq
drawin' - 2 freq
durin' - 2 freq
drane - 1 freq
drywyn - 1 freq
draen - 1 freq
dream' - 1 freq
dauran - 1 freq
d'aran - 1 freq
daarna - 1 freq
dairyin - 1 freq
€œdrama - 1 freq
dreein - 1 freq
draawin - 2 freq
drown - 2 freq
'drain - 1 freq
dorine - 2 freq
drien - 1 freq
daarin - 1 freq
€œdruim - 1 freq
derrenm - 1 freq
driyin - 1 freq
dooron - 5 freq
draÂ’n - 1 freq
d-ream - 1 freq
darrn - 1 freq
drmo - 3 freq
durinÂ’ - 1 freq
MetaPhone code - TRM
term - 99 freq
dream - 259 freq
dreamy - 10 freq
trym - 1 freq
drame - 23 freq
dram - 115 freq
tearoom - 3 freq
drum - 72 freq
drama - 95 freq
trim - 16 freq
trauma - 9 freq
tram - 23 freq
drama' - 1 freq
dreame - 6 freq
dreem - 3 freq
dreme - 5 freq
druim - 1 freq
trauma' - 1 freq
drome - 1 freq
draim - 5 freq
dreym - 9 freq
draem - 15 freq
drem - 7 freq
tea-room - 4 freq
dream' - 1 freq
tairm - 13 freq
€œdrama - 1 freq
€œtrum - 1 freq
tarim - 1 freq
€œdruim - 1 freq
d-ream - 1 freq
drmo - 3 freq
DREAM
Time to execute Levenshtein function - 0.204745 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.390089 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.031329 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.039090 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000936 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.