A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to daeint in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
daeint (0) - 1 freq
daein' (1) - 14 freq
daein't (1) - 1 freq
daeins (1) - 125 freq
daein (1) - 860 freq
daeing (1) - 2 freq
daint (1) - 4 freq
dae'nt (1) - 1 freq
dairt (2) - 6 freq
daesant (2) - 1 freq
dei't (2) - 2 freq
dent (2) - 4 freq
dacant (2) - 1 freq
gaein (2) - 147 freq
daen' (2) - 1 freq
aeiot (2) - 1 freq
deesint (2) - 2 freq
daena (2) - 26 freq
dawins (2) - 1 freq
'daein' (2) - 1 freq
doesnt (2) - 2 freq
daelin (2) - 6 freq
laein (2) - 9 freq
dant (2) - 1 freq
dae't (2) - 7 freq
daeint (0) - 1 freq
daint (1) - 4 freq
dent (2) - 4 freq
daunt (2) - 3 freq
dant (2) - 1 freq
dainty (2) - 4 freq
deinty (2) - 1 freq
dint (2) - 10 freq
daein' (2) - 14 freq
daein't (2) - 1 freq
daeins (2) - 125 freq
daein (2) - 860 freq
dae'nt (2) - 1 freq
daeing (2) - 2 freq
aint (3) - 13 freq
dailt (3) - 1 freq
daetit (3) - 1 freq
davit (3) - 23 freq
dyein (3) - 1 freq
taint (3) - 2 freq
ahint (3) - 743 freq
daelt (3) - 2 freq
draint (3) - 5 freq
saint (3) - 42 freq
dain' (3) - 2 freq
SoundEx code - D530
don't - 560 freq
doomit - 2 freq
dunt - 92 freq
deen't - 3 freq
denied - 29 freq
damned - 39 freq
daint - 4 freq
deemed - 12 freq
dawned - 13 freq
dammit - 9 freq
dwyned - 9 freq
dwined - 18 freq
dimmed - 5 freq
dandy - 37 freq
'don't - 11 freq
doomed - 21 freq
dundee - 201 freq
din't - 4 freq
dined - 5 freq
damn't - 5 freq
donnt - 1 freq
'damned - 1 freq
dwinit - 4 freq
dawnit - 3 freq
daintie - 1 freq
dentie - 6 freq
deintie - 2 freq
dint - 10 freq
dont - 76 freq
dawn't - 3 freq
dandee - 1 freq
dammed - 2 freq
donned - 2 freq
dooned - 3 freq
downed - 5 freq
damnit - 5 freq
damnt - 15 freq
deemit - 2 freq
dwamed - 1 freq
denwette - 1 freq
daeint - 1 freq
dimwit - 2 freq
dae'nt - 1 freq
deein't - 1 freq
dant - 1 freq
'dundy' - 2 freq
denee'd - 2 freq
da-wind - 1 freq
donati - 3 freq
'dante - 1 freq
dante - 58 freq
dunnet - 1 freq
dainty - 4 freq
duimed - 2 freq
dwaamed - 2 freq
daunt - 3 freq
'dundee - 3 freq
dwaumed - 1 freq
dinned - 4 freq
dwynt - 1 freq
doant - 2 freq
domed - 2 freq
dent - 4 freq
denty - 6 freq
'don't' - 1 freq
damndee - 1 freq
dywned - 1 freq
damiotti - 2 freq
daein't - 1 freq
dounhaud - 1 freq
damt - 2 freq
deinty - 1 freq
dinnit - 1 freq
dunnit - 1 freq
€œdon't - 1 freq
damed - 1 freq
donate - 9 freq
€˜dundee - 1 freq
demit - 2 freq
deein-oot - 1 freq
dam't - 1 freq
daen-oot - 1 freq
dinda - 1 freq
dandie - 1 freq
dmÂ’d - 1 freq
donÂ’t - 14 freq
donut - 1 freq
dondy - 1 freq
dundeh - 1 freq
dinnet - 1 freq
“don’t - 1 freq
danite - 5 freq
dianeda - 1 freq
demmet - 1 freq
‘…denied - 1 freq
dunnett - 6 freq
dnd - 1 freq
MetaPhone code - TNT
don't - 560 freq
tuned - 8 freq
tynt - 12 freq
dunt - 92 freq
doughnut - 7 freq
tent - 458 freq
deen't - 3 freq
tend - 55 freq
denied - 29 freq
daint - 4 freq
dawned - 13 freq
tentie - 38 freq
dwyned - 9 freq
tint - 218 freq
dandy - 37 freq
'don't - 11 freq
dundee - 201 freq
din't - 4 freq
dined - 5 freq
donnt - 1 freq
teen't - 1 freq
tntae - 1 freq
tand - 1 freq
dawnit - 3 freq
daintie - 1 freq
dentie - 6 freq
deintie - 2 freq
dint - 10 freq
dont - 76 freq
tyned - 9 freq
dawn't - 3 freq
dandee - 1 freq
taunt - 2 freq
tnt' - 1 freq
tinned - 8 freq
tant - 1 freq
donned - 2 freq
dooned - 3 freq
downed - 5 freq
toonty - 2 freq
daeint - 1 freq
tned - 1 freq
dae'nt - 1 freq
deein't - 1 freq
dant - 1 freq
tinto - 3 freq
'dundy' - 2 freq
tint' - 3 freq
denee'd - 2 freq
donati - 3 freq
'dante - 1 freq
dante - 58 freq
dunnet - 1 freq
dainty - 4 freq
tuined - 1 freq
twyned - 3 freq
tunity - 1 freq
tinnd - 1 freq
toned - 1 freq
tonto - 5 freq
daunt - 3 freq
'dundee - 3 freq
tenty - 6 freq
dinned - 4 freq
dwynt - 1 freq
doant - 2 freq
taint - 2 freq
deigned - 1 freq
dent - 4 freq
denty - 6 freq
'don't' - 1 freq
tyne't - 1 freq
twynit - 2 freq
tanta - 1 freq
taen't - 1 freq
dywned - 1 freq
daein't - 1 freq
taand - 1 freq
deinty - 1 freq
tined - 2 freq
tanned - 10 freq
dinnit - 1 freq
tannoid - 1 freq
dunnit - 1 freq
€œdon't - 1 freq
donate - 9 freq
€˜dundee - 1 freq
deein-oot - 1 freq
daen-oot - 1 freq
dinda - 1 freq
taind - 1 freq
dandie - 1 freq
donÂ’t - 14 freq
donut - 1 freq
dondy - 1 freq
dundeh - 1 freq
dinnet - 1 freq
tonite - 6 freq
“don’t - 1 freq
danite - 5 freq
tnawdaw - 1 freq
taand' - 1 freq
dianeda - 1 freq
‘…denied - 1 freq
dunnett - 6 freq
dnd - 1 freq
DAEINT
Time to execute Levenshtein function - 0.183477 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.358422 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027547 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.036970 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000823 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.