A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to srsdr in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
srsdr (0) - 1 freq
ssd (2) - 12 freq
worser (3) - 3 freq
shed (3) - 160 freq
smaar (3) - 2 freq
sed (3) - 175 freq
swaar (3) - 1 freq
stir (3) - 19 freq
soda (3) - 20 freq
sidda (3) - 1 freq
loser (3) - 4 freq
stuir (3) - 4 freq
rd (3) - 64 freq
shady (3) - 7 freq
src (3) - 1 freq
snide (3) - 3 freq
sids (3) - 1 freq
'rid (3) - 2 freq
friar (3) - 1 freq
bryde (3) - 2 freq
siner (3) - 1 freq
trader (3) - 9 freq
rude (3) - 37 freq
urss (3) - 1 freq
re'r (3) - 1 freq
srsdr (0) - 1 freq
ssd (4) - 12 freq
sortir (5) - 1 freq
ussr (5) - 3 freq
sars (5) - 1 freq
surfer (5) - 1 freq
arsed (5) - 19 freq
cruder (5) - 1 freq
serss (5) - 1 freq
spider (5) - 30 freq
ruder (5) - 2 freq
ersed (5) - 6 freq
serd (5) - 1 freq
vrsdin (5) - 1 freq
sers (5) - 11 freq
dsdur (5) - 1 freq
syder (5) - 5 freq
sunder (5) - 1 freq
sundry (5) - 5 freq
sard (5) - 3 freq
sinder (5) - 13 freq
crusader (5) - 3 freq
surinder (5) - 9 freq
ryder (5) - 1 freq
solder (5) - 2 freq
SoundEx code - S623
shrugged - 47 freq
skraiked - 40 freq
skraiched - 53 freq
skyrocket - 2 freq
shrieked - 5 freq
scree-staned - 1 freq
scragged - 1 freq
serecht-forrit - 1 freq
skreicht - 7 freq
skraicht - 2 freq
scooriest - 1 freq
sky-rocket - 3 freq
skreichd - 1 freq
s'awright - 1 freq
searched - 18 freq
soorest - 2 freq
skrekked - 2 freq
scraiched - 5 freq
soorcit - 1 freq
scarcity - 2 freq
scorched - 5 freq
scursed - 1 freq
shruggit - 4 freq
skreiched - 16 freq
sawright - 3 freq
screiched - 3 freq
scorcht - 1 freq
scraicht - 9 freq
shrieketh - 1 freq
sorriest - 1 freq
scrieched - 4 freq
serieched - 1 freq
shargit - 2 freq
screcked - 3 freq
scraik't - 1 freq
serssit - 2 freq
surrogats - 1 freq
sairched - 1 freq
shark-eyed - 1 freq
screeched - 7 freq
sweirest - 2 freq
sairest - 4 freq
skraichit - 1 freq
shrugd - 1 freq
surest - 1 freq
seraicht - 2 freq
scraacht - 1 freq
screechit - 1 freq
shoregait - 2 freq
skrougit - 2 freq
soor-sweet - 1 freq
skyriest - 1 freq
scariest - 1 freq
surged - 1 freq
scraggit - 1 freq
sair-wechtit - 1 freq
screicht - 1 freq
sharged - 1 freq
sairkyte - 1 freq
scarycath - 5 freq
seawrightdaniel - 26 freq
sirsidneyp - 1 freq
srsdr - 1 freq
skyrocketed - 1 freq
s’awright - 1 freq
swrestling - 1 freq
sirscottyoung - 1 freq
skreighed - 1 freq
sharktrustuk - 1 freq
MetaPhone code - SRSTR
srsdr - 1 freq
SRSDR
Time to execute Levenshtein function - 0.280031 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.465464 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.032219 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.041607 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000932 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.