A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to shouldnae in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
shouldnae (0) - 63 freq
shouldae (1) - 2 freq
shoudnae (1) - 3 freq
shouldne (1) - 1 freq
shouldna (1) - 32 freq
shoud'nae (2) - 1 freq
should've (2) - 18 freq
couldnae (2) - 623 freq
shudnae (2) - 30 freq
shouldni (2) - 1 freq
shouldnt (2) - 1 freq
shouldn (2) - 3 freq
ecouldnae (2) - 1 freq
shouldda (2) - 2 freq
shoudna (2) - 6 freq
shoodnae (2) - 1 freq
shouldat (2) - 1 freq
shoulda (2) - 30 freq
should'a (2) - 2 freq
shuldna (2) - 2 freq
shouldn't (2) - 7 freq
wouldnae (2) - 104 freq
shouldno (2) - 1 freq
shuidnae (2) - 11 freq
hudnae (3) - 62 freq
shouldnae (0) - 63 freq
shouldna (1) - 32 freq
shouldne (1) - 1 freq
shouldni (2) - 1 freq
shouldn (2) - 3 freq
shuldna (2) - 2 freq
shouldno (2) - 1 freq
shouldae (2) - 2 freq
shoudnae (2) - 3 freq
shoulda (3) - 30 freq
shouldat (3) - 1 freq
should'a (3) - 2 freq
shuidnae (3) - 11 freq
shoodnae (3) - 1 freq
shouldda (3) - 2 freq
shoudna (3) - 6 freq
shudnae (3) - 30 freq
shouldnt (3) - 1 freq
shidnae (4) - 1 freq
shoudno (4) - 2 freq
shoodna (4) - 3 freq
shudna (4) - 4 freq
shuidna (4) - 16 freq
should' (4) - 1 freq
shoulder (4) - 38 freq
SoundEx code - S435
shouldna - 32 freq
shouldnae - 63 freq
shouldnae've - 2 freq
shouldn't - 7 freq
scoldin - 2 freq
skeletons - 11 freq
solution - 19 freq
sliding - 6 freq
seldom - 27 freq
slidin - 26 freq
sultana - 1 freq
sultanas - 1 freq
slyden - 1 freq
scauldin-hot - 1 freq
shouldno - 1 freq
skeleton - 8 freq
shieldin - 4 freq
shuldna - 2 freq
saltoun - 3 freq
salt-and-pepper - 2 freq
slideen - 1 freq
salutin - 1 freq
skeleton's - 1 freq
skiltin - 1 freq
solutions - 20 freq
slitten - 1 freq
slatin - 2 freq
sel-identification - 1 freq
€˜solution - 1 freq
sel-loathin - 1 freq
shouldni - 1 freq
skelton - 1 freq
shouldn - 3 freq
scaldin - 1 freq
shielding - 2 freq
shouldnÂ’t - 2 freq
sheldomni - 2 freq
slating - 1 freq
'skeleton - 1 freq
swallydooncally - 1 freq
shouldnt - 1 freq
sheelding - 1 freq
sheilatempleto - 1 freq
shouldne - 1 freq
MetaPhone code - XLTN
shouldna - 32 freq
shouldnae - 63 freq
shouldno - 1 freq
shieldin - 4 freq
shuldna - 2 freq
chilton - 1 freq
shouldni - 1 freq
shouldn - 3 freq
shouldne - 1 freq
SHOULDNAE
should - 927 freq
should've - 18 freq
shouldna - 32 freq
shouldnae - 63 freq
shoulda - 30 freq
shuid - 367 freq
Time to execute Levenshtein function - 0.198468 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.398413 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027509 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.037001 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000968 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.