A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to durham in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
durham (0) - 11 freq
burnham (2) - 4 freq
dream (2) - 251 freq
duran (2) - 3 freq
dura (2) - 1 freq
dram (2) - 110 freq
wurhae (2) - 1 freq
d-ream (2) - 1 freq
eduroam (2) - 1 freq
fulham (2) - 1 freq
quham (2) - 1 freq
surname (3) - 12 freq
muchas (3) - 1 freq
jura (3) - 3 freq
mural (3) - 1 freq
draan (3) - 23 freq
letham (3) - 6 freq
dashcam (3) - 1 freq
hurtan (3) - 1 freq
dirdan (3) - 1 freq
drhue (3) - 1 freq
druim (3) - 1 freq
pram (3) - 18 freq
duina (3) - 6 freq
drear (3) - 10 freq
durham (0) - 11 freq
eduroam (3) - 1 freq
dram (3) - 110 freq
dream (3) - 251 freq
drhue (4) - 1 freq
drah (4) - 1 freq
dreame (4) - 6 freq
drem (4) - 7 freq
dirdum (4) - 20 freq
drum (4) - 72 freq
druim (4) - 1 freq
dreamy (4) - 9 freq
draem (4) - 15 freq
drama (4) - 95 freq
drame (4) - 23 freq
dreym (4) - 9 freq
dreem (4) - 3 freq
draim (4) - 5 freq
abraham (4) - 53 freq
graham (4) - 38 freq
wurhae (4) - 1 freq
fulham (4) - 1 freq
quham (4) - 1 freq
rhum (4) - 2 freq
dura (4) - 1 freq
SoundEx code - D650
dream - 251 freq
droon - 41 freq
drawn - 88 freq
daurin - 10 freq
dreamy - 9 freq
drawin - 113 freq
droun - 7 freq
dern - 21 freq
drame - 23 freq
dram - 110 freq
drum - 72 freq
drain - 30 freq
drama - 95 freq
draan - 23 freq
daurna - 24 freq
durin - 181 freq
darn - 3 freq
darnae - 5 freq
dreean - 1 freq
drama' - 1 freq
dryin - 37 freq
drewn - 2 freq
darin - 5 freq
daurnae - 4 freq
drone - 17 freq
dreame - 6 freq
dreem - 3 freq
dryen - 1 freq
darenae - 4 freq
darwin - 3 freq
druim - 1 freq
darien - 8 freq
draa'in - 1 freq
durham - 11 freq
drome - 1 freq
draain - 15 freq
draim - 5 freq
doreen - 3 freq
darren - 8 freq
dreym - 9 freq
draem - 15 freq
drem - 7 freq
dorian - 2 freq
draaeen - 4 freq
duran - 3 freq
darena - 3 freq
dryan - 2 freq
draawn - 3 freq
draa-an - 1 freq
drawin' - 2 freq
durin' - 2 freq
drane - 1 freq
drywyn - 1 freq
draen - 1 freq
dream' - 1 freq
dreme - 2 freq
dauran - 1 freq
d'aran - 1 freq
daarna - 1 freq
dairyin - 1 freq
€œdrama - 1 freq
dreein - 1 freq
draawin - 2 freq
drown - 2 freq
'drain - 1 freq
dorine - 2 freq
drien - 1 freq
daarin - 1 freq
€œdruim - 1 freq
derrenm - 1 freq
driyin - 1 freq
dooron - 5 freq
draÂ’n - 1 freq
d-ream - 1 freq
darrn - 1 freq
drmo - 3 freq
durinÂ’ - 1 freq
MetaPhone code - TRHM
durham - 11 freq
DURHAM
Time to execute Levenshtein function - 0.182669 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.342125 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027518 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.037544 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000863 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.