A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to soorcit in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
soorcit (0) - 1 freq
soonit (2) - 1 freq
smoorit (2) - 5 freq
forcit (2) - 2 freq
scornit (2) - 1 freq
sortit (2) - 51 freq
snortit (2) - 16 freq
sookit (2) - 27 freq
coorit (2) - 2 freq
soorces (2) - 15 freq
soopit (2) - 3 freq
boordit (2) - 2 freq
moorit (2) - 8 freq
soondit (2) - 61 freq
soothit (2) - 1 freq
sorrit (2) - 4 freq
soorest (2) - 2 freq
scorcht (2) - 1 freq
soort (2) - 2 freq
soartit (2) - 1 freq
coortit (2) - 5 freq
soorce (2) - 17 freq
soorik (2) - 2 freq
swoopit (3) - 1 freq
sloppit (3) - 2 freq
soorcit (0) - 1 freq
sorrit (3) - 4 freq
soorce (3) - 17 freq
soorces (3) - 15 freq
soort (3) - 2 freq
sortit (3) - 51 freq
forcit (3) - 2 freq
soartit (3) - 1 freq
soorest (3) - 2 freq
sortie (4) - 2 freq
forcet (4) - 1 freq
soartet (4) - 1 freq
soonit (4) - 1 freq
sovict (4) - 1 freq
sources (4) - 24 freq
sorti (4) - 2 freq
forct (4) - 1 freq
source (4) - 56 freq
sirit (4) - 1 freq
soart (4) - 47 freq
seurrit (4) - 1 freq
scoit (4) - 1 freq
servit (4) - 8 freq
sort (4) - 310 freq
sertit (4) - 1 freq
SoundEx code - S623
shrugged - 47 freq
skraiked - 40 freq
skraiched - 53 freq
skyrocket - 2 freq
shrieked - 5 freq
scree-staned - 1 freq
scragged - 1 freq
serecht-forrit - 1 freq
skreicht - 7 freq
skraicht - 2 freq
scooriest - 1 freq
sky-rocket - 3 freq
skreichd - 1 freq
s'awright - 1 freq
searched - 18 freq
soorest - 2 freq
skrekked - 2 freq
scraiched - 5 freq
soorcit - 1 freq
scarcity - 2 freq
scorched - 5 freq
scursed - 1 freq
shruggit - 4 freq
skreiched - 16 freq
sawright - 3 freq
screiched - 3 freq
scorcht - 1 freq
scraicht - 9 freq
shrieketh - 1 freq
sorriest - 1 freq
scrieched - 4 freq
serieched - 1 freq
shargit - 2 freq
screcked - 3 freq
scraik't - 1 freq
serssit - 2 freq
surrogats - 1 freq
sairched - 1 freq
shark-eyed - 1 freq
screeched - 7 freq
sweirest - 2 freq
sairest - 4 freq
skraichit - 1 freq
shrugd - 1 freq
surest - 1 freq
seraicht - 2 freq
scraacht - 1 freq
screechit - 1 freq
shoregait - 2 freq
skrougit - 2 freq
soor-sweet - 1 freq
skyriest - 1 freq
scariest - 1 freq
surged - 1 freq
scraggit - 1 freq
sair-wechtit - 1 freq
screicht - 1 freq
sharged - 1 freq
sairkyte - 1 freq
scarycath - 5 freq
seawrightdaniel - 26 freq
sirsidneyp - 1 freq
srsdr - 1 freq
skyrocketed - 1 freq
s’awright - 1 freq
swrestling - 1 freq
sirscottyoung - 1 freq
skreighed - 1 freq
sharktrustuk - 1 freq
MetaPhone code - SRST
soorest - 2 freq
soorcit - 1 freq
sorriest - 1 freq
serssit - 2 freq
sairest - 4 freq
surest - 1 freq
SOORCIT
Time to execute Levenshtein function - 0.324913 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.472397 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.029555 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.038558 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000897 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.