A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to dheireadh in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
dheireadh (0) - 1 freq
heired (3) - 3 freq
hearadh (3) - 1 freq
heitath (4) - 1 freq
shired' (4) - 1 freq
chaired (4) - 6 freq
thread (4) - 16 freq
heird (4) - 1 freq
cheilidh (4) - 2 freq
heared (4) - 14 freq
eireann (4) - 1 freq
aireamh (4) - 2 freq
heiled (4) - 1 freq
decreed (4) - 9 freq
ahready (4) - 4 freq
chreach (4) - 1 freq
hered (4) - 2 freq
shired (4) - 5 freq
there'd (4) - 21 freq
cheerleads (4) - 1 freq
threads (4) - 8 freq
deived (4) - 1 freq
aweready (4) - 3 freq
owerreach (4) - 1 freq
desired (4) - 12 freq
dheireadh (0) - 1 freq
hearadh (4) - 1 freq
adhered (5) - 2 freq
heired (5) - 3 freq
degrade (6) - 1 freq
dread (6) - 34 freq
desired (6) - 12 freq
threads (6) - 8 freq
rhuaridh (6) - 1 freq
dhtreds (6) - 8 freq
sheared (6) - 6 freq
whered (6) - 1 freq
heered (6) - 2 freq
cheered (6) - 27 freq
shaired (6) - 3 freq
dreided (6) - 4 freq
hired (6) - 27 freq
shired (6) - 5 freq
cheilidh (6) - 2 freq
heared (6) - 14 freq
heird (6) - 1 freq
thread (6) - 16 freq
shired' (6) - 1 freq
chaired (6) - 6 freq
haired (6) - 7 freq
SoundEx code - D630
drouthy - 17 freq
dirt - 70 freq
dirty - 107 freq
driet - 2 freq
dreid - 51 freq
draaed - 1 freq
dried - 74 freq
durty - 11 freq
dairt - 6 freq
drouth - 49 freq
dorty - 7 freq
dread - 34 freq
doort - 1 freq
drooth - 20 freq
door-ti - 1 freq
daured - 24 freq
draw'd - 1 freq
dirt' - 1 freq
dorothy - 45 freq
dared - 11 freq
drouthie - 12 freq
droothie - 1 freq
dearth - 9 freq
dird - 2 freq
dreed - 13 freq
dryed - 4 freq
dirtie - 1 freq
droid - 1 freq
dart - 16 freq
dorito - 1 freq
drawed - 1 freq
drite - 1 freq
daurt - 6 freq
draa'd - 3 freq
durt - 2 freq
drat - 2 freq
dry't - 2 freq
daurd - 1 freq
droothy - 4 freq
druith - 1 freq
dorty-wye - 1 freq
daared - 2 freq
durtie - 2 freq
dort - 2 freq
derth - 2 freq
drowt - 1 freq
dredd - 6 freq
draed - 1 freq
darth - 3 freq
derrida - 1 freq
derd - 1 freq
dortie - 3 freq
dheireadh - 1 freq
druid - 2 freq
'dreid' - 1 freq
daar't - 1 freq
€˜dread - 1 freq
dairth - 1 freq
dorado - 1 freq
dryte - 1 freq
€œdorothy - 1 freq
dee-haird - 1 freq
dae-or-dee - 1 freq
€˜dirty - 1 freq
doherty - 1 freq
droit - 1 freq
drowth - 1 freq
MetaPhone code - THRT
dheireadh - 1 freq
dee-haird - 1 freq
doherty - 1 freq
DHEIREADH
Time to execute Levenshtein function - 0.236231 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.399680 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.028160 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.038729 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000859 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.