A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to dunollie in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
dunollie (0) - 6 freq
dunkellie (2) - 8 freq
dollie (2) - 1 freq
sullie (3) - 2 freq
finallie (3) - 1 freq
nellie (3) - 42 freq
unrulie (3) - 1 freq
dunlin (3) - 2 freq
snorlie (3) - 1 freq
dunblane (3) - 4 freq
collie (3) - 27 freq
ollie (3) - 7 freq
mollie (3) - 16 freq
uillie (3) - 1 freq
dunkle (3) - 1 freq
dunkie (3) - 5 freq
dollies (3) - 4 freq
runklie (3) - 1 freq
dunli (3) - 1 freq
rollie (3) - 4 freq
follie (3) - 8 freq
gullie (3) - 9 freq
wullie (3) - 422 freq
wulllie (3) - 1 freq
dullies (3) - 2 freq
dunollie (0) - 6 freq
dollie (3) - 1 freq
dunkellie (3) - 8 freq
fynallie (4) - 1 freq
duanalla (4) - 1 freq
drillie (4) - 1 freq
dallie (4) - 13 freq
tonallie (4) - 1 freq
dunkle (4) - 1 freq
donella (4) - 5 freq
dunli (4) - 1 freq
finallie (4) - 1 freq
danielle (4) - 1 freq
dunlin (4) - 2 freq
nellie (4) - 42 freq
dull (5) - 118 freq
du'll (5) - 37 freq
dounlade (5) - 3 freq
dounluik (5) - 1 freq
droll (5) - 13 freq
doolie (5) - 3 freq
dunderie (5) - 1 freq
dounlaid (5) - 1 freq
duponline (5) - 1 freq
dunneil (5) - 1 freq
SoundEx code - D540
doonhill - 6 freq
donal - 7 freq
denial - 14 freq
dinah'll - 9 freq
daniel - 128 freq
doon-low - 1 freq
demmle - 2 freq
'daniel - 1 freq
dwinnle - 1 freq
doonlow - 1 freq
dunloy - 4 freq
dinnle - 2 freq
dam'll - 1 freq
dwamly - 1 freq
donnal - 1 freq
danielle - 1 freq
downhill - 1 freq
demmel - 1 freq
dounhill - 1 freq
dunneil - 1 freq
dumela - 1 freq
danelaw - 3 freq
dhomnuill - 1 freq
dimly - 3 freq
donella - 5 freq
dinnil - 1 freq
€™dinnle - 1 freq
€˜daniel - 7 freq
donnell - 1 freq
donnelly - 1 freq
dunollie - 6 freq
downhall - 2 freq
daniella - 1 freq
duanalla - 1 freq
domnhall - 1 freq
dunli - 1 freq
daniela - 2 freq
MetaPhone code - TNL
donal - 7 freq
tunnel - 87 freq
denial - 14 freq
dinah'll - 9 freq
daniel - 128 freq
doon-low - 1 freq
'daniel - 1 freq
doonlow - 1 freq
dunloy - 4 freq
dinnle - 2 freq
tenniel - 2 freq
tonal - 8 freq
donnal - 1 freq
danielle - 1 freq
toenail - 1 freq
dunneil - 1 freq
danelaw - 3 freq
tonallie - 1 freq
donella - 5 freq
towneley - 1 freq
dunghill - 1 freq
tonnel - 1 freq
dinnil - 1 freq
€™dinnle - 1 freq
ten'll - 1 freq
€˜daniel - 7 freq
donnell - 1 freq
donnelly - 1 freq
dunollie - 6 freq
daniella - 1 freq
duanalla - 1 freq
tanle - 1 freq
dunli - 1 freq
daniela - 2 freq
DUNOLLIE
Time to execute Levenshtein function - 0.236815 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.386535 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.028817 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.037750 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000859 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.