A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to drama in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
drama (0) - 95 freq
rama (1) - 3 freq
dram (1) - 110 freq
drame (1) - 23 freq
draa (1) - 51 freq
dramas (1) - 6 freq
drama' (1) - 1 freq
drams (1) - 28 freq
draims (2) - 2 freq
data (2) - 68 freq
draas (2) - 12 freq
dreamy (2) - 9 freq
damoa (2) - 7 freq
fraa (2) - 2 freq
drat (2) - 2 freq
dramin (2) - 1 freq
draim (2) - 5 freq
dreme (2) - 2 freq
dram's (2) - 4 freq
prima (2) - 7 freq
draps (2) - 61 freq
ramah (2) - 1 freq
orams (2) - 1 freq
dwams (2) - 17 freq
dame (2) - 16 freq
drama (0) - 95 freq
dram (1) - 110 freq
drame (1) - 23 freq
dreme (2) - 2 freq
dream (2) - 251 freq
drome (2) - 1 freq
drum (2) - 72 freq
drem (2) - 7 freq
drmo (2) - 3 freq
dreame (2) - 6 freq
draim (2) - 5 freq
draem (2) - 15 freq
dreamy (2) - 9 freq
drams (2) - 28 freq
rama (2) - 3 freq
drama' (2) - 1 freq
dramas (2) - 6 freq
draa (2) - 51 freq
dreaman (3) - 2 freq
drag (3) - 43 freq
dwam (3) - 101 freq
drake (3) - 6 freq
drames (3) - 7 freq
damo (3) - 10 freq
dam (3) - 28 freq
SoundEx code - D650
dream - 251 freq
droon - 41 freq
drawn - 88 freq
daurin - 10 freq
dreamy - 9 freq
drawin - 113 freq
droun - 7 freq
dern - 21 freq
drame - 23 freq
dram - 110 freq
drum - 72 freq
drain - 30 freq
drama - 95 freq
draan - 23 freq
daurna - 24 freq
durin - 181 freq
darn - 3 freq
darnae - 5 freq
dreean - 1 freq
drama' - 1 freq
dryin - 37 freq
drewn - 2 freq
darin - 5 freq
daurnae - 4 freq
drone - 17 freq
dreame - 6 freq
dreem - 3 freq
dryen - 1 freq
darenae - 4 freq
darwin - 3 freq
druim - 1 freq
darien - 8 freq
draa'in - 1 freq
durham - 11 freq
drome - 1 freq
draain - 15 freq
draim - 5 freq
doreen - 3 freq
darren - 8 freq
dreym - 9 freq
draem - 15 freq
drem - 7 freq
dorian - 2 freq
draaeen - 4 freq
duran - 3 freq
darena - 3 freq
dryan - 2 freq
draawn - 3 freq
draa-an - 1 freq
drawin' - 2 freq
durin' - 2 freq
drane - 1 freq
drywyn - 1 freq
draen - 1 freq
dream' - 1 freq
dreme - 2 freq
dauran - 1 freq
d'aran - 1 freq
daarna - 1 freq
dairyin - 1 freq
€œdrama - 1 freq
dreein - 1 freq
draawin - 2 freq
drown - 2 freq
'drain - 1 freq
dorine - 2 freq
drien - 1 freq
daarin - 1 freq
€œdruim - 1 freq
derrenm - 1 freq
driyin - 1 freq
dooron - 5 freq
draÂ’n - 1 freq
d-ream - 1 freq
darrn - 1 freq
drmo - 3 freq
durinÂ’ - 1 freq
MetaPhone code - TRM
term - 98 freq
dream - 251 freq
dreamy - 9 freq
trym - 1 freq
drame - 23 freq
dram - 110 freq
tearoom - 3 freq
drum - 72 freq
drama - 95 freq
trim - 15 freq
trauma - 8 freq
tram - 16 freq
drama' - 1 freq
dreame - 6 freq
dreem - 3 freq
druim - 1 freq
trauma' - 1 freq
drome - 1 freq
draim - 5 freq
dreym - 9 freq
draem - 15 freq
drem - 7 freq
tea-room - 4 freq
dream' - 1 freq
dreme - 2 freq
tairm - 13 freq
€œdrama - 1 freq
€œtrum - 1 freq
tarim - 1 freq
€œdruim - 1 freq
d-ream - 1 freq
drmo - 3 freq
DRAMA
Time to execute Levenshtein function - 0.200157 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.380778 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.032267 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.038290 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000867 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.