A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to shouldna in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
shouldna (0) - 32 freq
shouldn (1) - 3 freq
should'a (1) - 2 freq
shouldnt (1) - 1 freq
shouldno (1) - 1 freq
shuldna (1) - 2 freq
shoudna (1) - 6 freq
shouldne (1) - 1 freq
shouldni (1) - 1 freq
shoulda (1) - 29 freq
shouldnae (1) - 60 freq
shouldda (1) - 2 freq
soudna (2) - 2 freq
shudna (2) - 4 freq
shouldn't (2) - 5 freq
shoodna (2) - 3 freq
shoudnae (2) - 3 freq
couldna (2) - 414 freq
should' (2) - 1 freq
shouldat (2) - 1 freq
shuidna (2) - 16 freq
ecouldna (2) - 1 freq
should (2) - 907 freq
shoulder (2) - 32 freq
shouldae (2) - 1 freq
shouldna (0) - 32 freq
shuldna (1) - 2 freq
shouldne (1) - 1 freq
shouldno (1) - 1 freq
shouldnae (1) - 60 freq
shouldni (1) - 1 freq
shouldn (1) - 3 freq
shoulda (2) - 29 freq
shouldda (2) - 2 freq
should'a (2) - 2 freq
shoudna (2) - 6 freq
shouldnt (2) - 1 freq
shouldat (3) - 1 freq
shuidna (3) - 16 freq
should (3) - 907 freq
shoulder (3) - 32 freq
should' (3) - 1 freq
shoudnae (3) - 3 freq
shouldae (3) - 1 freq
shudna (3) - 4 freq
shoudno (3) - 2 freq
shoodna (3) - 3 freq
shoodnae (4) - 1 freq
shieldin (4) - 4 freq
shuld (4) - 29 freq
SoundEx code - S435
shouldna - 32 freq
shouldnae - 60 freq
shouldnae've - 2 freq
shouldn't - 5 freq
scoldin - 2 freq
skeletons - 11 freq
solution - 19 freq
sliding - 6 freq
seldom - 25 freq
slidin - 21 freq
sultana - 1 freq
sultanas - 1 freq
slyden - 1 freq
scauldin-hot - 1 freq
shouldno - 1 freq
skeleton - 8 freq
shieldin - 4 freq
shuldna - 2 freq
saltoun - 3 freq
salt-and-pepper - 2 freq
slideen - 1 freq
salutin - 1 freq
skeleton's - 1 freq
skiltin - 1 freq
solutions - 20 freq
slitten - 1 freq
slatin - 2 freq
sel-identification - 1 freq
€˜solution - 1 freq
sel-loathin - 1 freq
shouldni - 1 freq
skelton - 1 freq
shouldn - 3 freq
scaldin - 1 freq
shielding - 2 freq
shouldnÂ’t - 2 freq
sheldomni - 2 freq
slating - 1 freq
'skeleton - 1 freq
swallydooncally - 1 freq
shouldnt - 1 freq
sheelding - 1 freq
sheilatempleto - 1 freq
shouldne - 1 freq
MetaPhone code - XLTN
shouldna - 32 freq
shouldnae - 60 freq
shouldno - 1 freq
shieldin - 4 freq
shuldna - 2 freq
chilton - 1 freq
shouldni - 1 freq
shouldn - 3 freq
shouldne - 1 freq
SHOULDNA
should - 907 freq
should've - 18 freq
shouldna - 32 freq
shouldnae - 60 freq
shoulda - 29 freq
Time to execute Levenshtein function - 0.182091 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.347787 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.030057 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.038695 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000895 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.