A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to didn’t in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
didn’t (0) - 3 freq
didnt (2) - 3 freq
don’t (2) - 14 freq
didn't (2) - 39 freq
isn’t (2) - 2 freq
ain’t (2) - 1 freq
din't (3) - 4 freq
diidnt (3) - 1 freq
dinnit (3) - 1 freq
didnay (3) - 2 freq
fin’s (3) - 1 freq
won’t (3) - 5 freq
dir’s (3) - 2 freq
dingit (3) - 13 freq
can’t (3) - 11 freq
didna (3) - 1636 freq
dignit (3) - 1 freq
disn't (3) - 4 freq
wid’v (3) - 1 freq
'didn't (3) - 1 freq
min’s (3) - 2 freq
didno (3) - 17 freq
dic’s (3) - 1 freq
fin’ (3) - 1 freq
dinnet (3) - 1 freq
didn’t (0) - 3 freq
don’t (3) - 14 freq
doesn’t (4) - 7 freq
isn’t (4) - 2 freq
ain’t (4) - 1 freq
didnt (4) - 3 freq
didn't (4) - 39 freq
wasn’t (5) - 1 freq
“don’t (5) - 1 freq
can’t (5) - 11 freq
won’t (5) - 5 freq
diidnt (5) - 1 freq
dad’s (5) - 1 freq
aren’t (5) - 1 freq
ken’t (5) - 1 freq
divn't (6) - 4 freq
disnt (6) - 1 freq
didne (6) - 1 freq
ding't (6) - 2 freq
didn (6) - 25 freq
weren’t (6) - 1 freq
durin’ (6) - 1 freq
kiddin’ (6) - 1 freq
couldn’t (6) - 1 freq
damien’s (6) - 1 freq
SoundEx code - D353
didn't - 39 freq
detention - 10 freq
detained - 1 freq
'didn't - 1 freq
da-da-n'''da-da'n'da - 1 freq
deid-end - 1 freq
detentions - 1 freq
detonations - 1 freq
diidnt - 1 freq
didn’t - 3 freq
didnt - 3 freq
MetaPhone code - TTNT
didn't - 39 freq
detained - 1 freq
'didn't - 1 freq
titnt - 1 freq
deid-end - 1 freq
diidnt - 1 freq
didn’t - 3 freq
didnt - 3 freq
DIDN’T
Time to execute Levenshtein function - 0.797867 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.992324 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.036566 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.082292 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.001189 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.