A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to outside in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
outside (0) - 87 freq
outsides (1) - 2 freq
ootside (1) - 635 freq
outsyde (1) - 2 freq
outsider (1) - 1 freq
cotside (2) - 1 freq
ootdside (2) - 1 freq
ootsider (2) - 8 freq
gutsie (2) - 2 freq
ootsite (2) - 1 freq
subside (2) - 2 freq
upside (2) - 37 freq
ootsyde (2) - 4 freq
ootsize (2) - 1 freq
onside (2) - 2 freq
oanside (2) - 1 freq
outline (2) - 4 freq
offside (2) - 3 freq
southside (2) - 4 freq
outshine (2) - 1 freq
couttie (3) - 12 freq
fouthie (3) - 16 freq
guttie (3) - 5 freq
outin (3) - 4 freq
€™ootside (3) - 1 freq
outside (0) - 87 freq
outsyde (1) - 2 freq
ootside (1) - 635 freq
outsider (2) - 1 freq
ootsyde (2) - 4 freq
outsides (2) - 2 freq
cotside (3) - 1 freq
onside (3) - 2 freq
ootsiyd (3) - 1 freq
teeside (3) - 1 freq
ootisde (3) - 1 freq
ootsize (3) - 1 freq
tayside (3) - 2 freq
oanside (3) - 1 freq
ootsider (3) - 8 freq
ootsite (3) - 1 freq
ootdside (3) - 1 freq
upside (3) - 37 freq
faside (4) - 1 freq
sooside (4) - 2 freq
outset (4) - 1 freq
beside (4) - 63 freq
tousie (4) - 11 freq
gateside (4) - 1 freq
ootsized (4) - 1 freq
SoundEx code - O323
ootside - 635 freq
outsider - 1 freq
outstandin - 1 freq
outside - 87 freq
ootisde - 1 freq
ootstreiched - 1 freq
ootstreitched - 1 freq
ootstandin - 3 freq
ootstreekit - 2 freq
ootsheddies - 1 freq
outstretched - 3 freq
ootstrecht - 1 freq
ootsyde - 4 freq
ootsider - 8 freq
ootsiders - 4 freq
oddest - 1 freq
outstanding - 4 freq
ootstannin - 2 freq
ootset - 15 freq
ootsets - 6 freq
outsyde - 2 freq
oot-streetched - 1 freq
out-streikit - 1 freq
ootsiyd - 1 freq
ootgate - 1 freq
outsides - 2 freq
ootgait - 1 freq
ootstreeckit - 1 freq
ootstaunin - 3 freq
o'eedjit - 1 freq
outstreikit - 1 freq
outsettin - 1 freq
outstandingly - 1 freq
ootsite - 1 freq
€™ootside - 1 freq
ootsettin - 1 freq
outstaundin - 1 freq
ootstretched - 1 freq
€˜ootside - 1 freq
ootcottj - 1 freq
ootdside - 1 freq
outset - 1 freq
ootsideÂ… - 1 freq
MetaPhone code - OTST
ootside - 635 freq
outside - 87 freq
ootisde - 1 freq
ootsyde - 4 freq
oddest - 1 freq
ootset - 15 freq
outsyde - 2 freq
ootsiyd - 1 freq
ootsite - 1 freq
€™ootside - 1 freq
€˜ootside - 1 freq
outset - 1 freq
ootsideÂ… - 1 freq
OUTSIDE
ootside - 635 freq
outside - 87 freq
ootsider - 8 freq
outsider - 1 freq
ootsiders - 4 freq
outsiders - freq
Time to execute Levenshtein function - 0.200288 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.332454 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027336 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.037024 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.001063 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.