A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to seldom in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
seldom (0) - 25 freq
eldon (2) - 1 freq
sodom (2) - 9 freq
geldof (2) - 1 freq
welcom (2) - 1 freq
feydom (2) - 2 freq
selfou (2) - 1 freq
sullom (2) - 2 freq
gaeldom (2) - 1 freq
sloom (2) - 5 freq
soom (3) - 10 freq
selled (3) - 1 freq
edam (3) - 45 freq
sellin (3) - 70 freq
medow (3) - 1 freq
easedom (3) - 4 freq
seto (3) - 1 freq
aldo' (3) - 1 freq
sclim (3) - 25 freq
elo (3) - 2 freq
sermon (3) - 29 freq
selfs (3) - 3 freq
semoc (3) - 1 freq
bloom (3) - 27 freq
selkie (3) - 17 freq
seldom (0) - 25 freq
sullom (3) - 2 freq
gaeldom (3) - 1 freq
sodom (3) - 9 freq
sloom (3) - 5 freq
salome (4) - 1 freq
selsame (4) - 8 freq
slam (4) - 13 freq
saddam (4) - 1 freq
sld (4) - 9 freq
shadam (4) - 1 freq
idledom (4) - 1 freq
suld (4) - 11 freq
slum (4) - 5 freq
sold (4) - 23 freq
slalom (4) - 1 freq
slim (4) - 16 freq
solder (4) - 2 freq
aisedom (4) - 6 freq
siloam (4) - 2 freq
erldome (4) - 2 freq
easdom (4) - 1 freq
welcom (4) - 1 freq
feydom (4) - 2 freq
selfou (4) - 1 freq
SoundEx code - S435
shouldna - 32 freq
shouldnae - 60 freq
shouldnae've - 2 freq
shouldn't - 5 freq
scoldin - 2 freq
skeletons - 11 freq
solution - 19 freq
sliding - 6 freq
seldom - 25 freq
slidin - 21 freq
sultana - 1 freq
sultanas - 1 freq
slyden - 1 freq
scauldin-hot - 1 freq
shouldno - 1 freq
skeleton - 8 freq
shieldin - 4 freq
shuldna - 2 freq
saltoun - 3 freq
salt-and-pepper - 2 freq
slideen - 1 freq
salutin - 1 freq
skeleton's - 1 freq
skiltin - 1 freq
solutions - 20 freq
slitten - 1 freq
slatin - 2 freq
sel-identification - 1 freq
€˜solution - 1 freq
sel-loathin - 1 freq
shouldni - 1 freq
skelton - 1 freq
shouldn - 3 freq
scaldin - 1 freq
shielding - 2 freq
shouldnÂ’t - 2 freq
sheldomni - 2 freq
slating - 1 freq
'skeleton - 1 freq
swallydooncally - 1 freq
shouldnt - 1 freq
sheelding - 1 freq
sheilatempleto - 1 freq
shouldne - 1 freq
MetaPhone code - SLTM
seldom - 25 freq
SELDOM
Time to execute Levenshtein function - 0.187168 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.337338 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027834 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.037058 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000821 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.