A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to sodom in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
sodom (0) - 9 freq
soom (1) - 10 freq
dodos (2) - 1 freq
sodium (2) - 1 freq
sidon (2) - 11 freq
'dom (2) - 2 freq
modem (2) - 4 freq
sod (2) - 22 freq
loom (2) - 14 freq
soop (2) - 17 freq
aidom (2) - 1 freq
sado (2) - 1 freq
adom (2) - 1 freq
scoom (2) - 2 freq
goom (2) - 1 freq
todo (2) - 1 freq
boddom (2) - 39 freq
seldom (2) - 27 freq
sood (2) - 37 freq
soomt (2) - 1 freq
shoom (2) - 1 freq
xoom (2) - 1 freq
doom (2) - 34 freq
solos (2) - 3 freq
sook (2) - 64 freq
sodom (0) - 9 freq
soom (2) - 10 freq
sodium (2) - 1 freq
som (3) - 2 freq
soda (3) - 20 freq
easedom (3) - 4 freq
shoom (3) - 1 freq
aisedom (3) - 6 freq
doom (3) - 34 freq
so'm (3) - 1 freq
sodae (3) - 2 freq
sodas (3) - 1 freq
easdom (3) - 1 freq
dom (3) - 3 freq
soum (3) - 10 freq
sloom (3) - 5 freq
sood (3) - 37 freq
sods (3) - 10 freq
-dom (3) - 2 freq
sado (3) - 1 freq
adom (3) - 1 freq
aidom (3) - 1 freq
sod (3) - 22 freq
sidon (3) - 11 freq
modem (3) - 4 freq
SoundEx code - S350
stane - 421 freq
sittin - 737 freq
shoutin - 137 freq
skytin - 15 freq
staun - 224 freq
stany - 5 freq
sudden - 213 freq
steen - 114 freq
staan - 13 freq
steam - 77 freq
settin - 144 freq
shuttin - 39 freq
steyin - 40 freq
shootin - 43 freq
scaudin - 4 freq
stymie - 2 freq
saitin - 3 freq
settan - 14 freq
sweatin - 16 freq
sydney - 7 freq
stem - 18 freq
stone - 85 freq
stowin - 6 freq
suttin - 81 freq
schtum - 2 freq
skitin - 23 freq
showtime - 1 freq
squattin - 1 freq
stewin - 4 freq
seatoun - 1 freq
sit-doun - 1 freq
stame - 8 freq
swaden - 1 freq
sweitin - 2 freq
shoudna - 6 freq
setten - 22 freq
sitten - 12 freq
shitin - 4 freq
skuddin - 1 freq
stayin - 22 freq
shoutin' - 2 freq
sittin' - 21 freq
sidden - 16 freq
scootin - 4 freq
shuidnae - 11 freq
sweetin - 3 freq
seethin - 3 freq
stan - 150 freq
steinway - 4 freq
staem - 1 freq
sodden - 11 freq
stoon - 9 freq
stein - 22 freq
showdin - 2 freq
staney - 6 freq
stam - 2 freq
scuddin - 5 freq
sidn - 1 freq
sheetin - 6 freq
steenie - 25 freq
stan¢ - 1 freq
satan - 24 freq
sodom - 9 freq
scythin - 2 freq
shoutan - 5 freq
seethan - 1 freq
sodium - 1 freq
shudnae - 30 freq
stowen - 3 freq
seton - 18 freq
sweden - 20 freq
soothin - 5 freq
sioatin' - 1 freq
settin' - 13 freq
steem - 13 freq
shotten - 4 freq
shouten - 2 freq
sheeten - 1 freq
seitten - 1 freq
skiddin - 4 freq
suddin - 1 freq
skidden - 1 freq
syden - 1 freq
staine - 2 freq
shidnae - 1 freq
sa'tin' - 1 freq
shutt'n - 1 freq
sit'n - 2 freq
sitt'n - 1 freq
stony - 4 freq
stan' - 8 freq
sedn - 1 freq
stine - 1 freq
stain - 26 freq
seatin - 4 freq
showdoon - 1 freq
shittin - 4 freq
satin - 12 freq
southan - 1 freq
steamy - 5 freq
stane-waa - 1 freq
shidna - 9 freq
sutten - 8 freq
steeny - 7 freq
stiyin - 1 freq
stawn - 4 freq
stehin - 1 freq
stowan - 1 freq
styin - 4 freq
sheddin - 5 freq
sithean - 1 freq
sidney - 23 freq
sidon - 11 freq
sïttin - 8 freq
stane' - 2 freq
sautin - 1 freq
skatin - 3 freq
'stan - 2 freq
soothin' - 1 freq
side-on - 1 freq
sweeten' - 1 freq
sit-in - 2 freq
saddam - 1 freq
steen' - 2 freq
shutten - 2 freq
steamie - 9 freq
sea-aeten - 1 freq
shuttan - 3 freq
sittan - 32 freq
skoitin - 1 freq
soodna - 7 freq
stown - 7 freq
shoodna - 3 freq
ston - 9 freq
stun - 2 freq
shuidna - 16 freq
skeetin - 1 freq
sweatan - 1 freq
shouteen - 1 freq
soddan - 1 freq
shaidin - 28 freq
sïxtaen - 1 freq
shuitin - 4 freq
staen - 2 freq
shuitten - 2 freq
shoudno - 2 freq
situn - 1 freq
stoma - 5 freq
sthaain - 1 freq
stöd'im - 1 freq
steyn - 8 freq
shut-doon - 1 freq
shadam - 1 freq
suden - 1 freq
staeyan - 1 freq
setteen - 2 freq
stayan - 1 freq
schotten - 2 freq
shiten - 1 freq
stoun - 2 freq
sheddan - 1 freq
soothan - 1 freq
seteen - 1 freq
sudna - 13 freq
stonn - 1 freq
stime - 4 freq
sateen - 1 freq
steyan - 2 freq
said-na - 1 freq
sittm - 1 freq
swaiden - 2 freq
styme - 2 freq
soudna - 2 freq
sweeten - 3 freq
schatten - 1 freq
seedin - 1 freq
sautan - 1 freq
saidna - 1 freq
stimna - 1 freq
suitin - 1 freq
swattin - 2 freq
swytin - 1 freq
'staun - 1 freq
swithin - 1 freq
€˜staun - 1 freq
suidna - 3 freq
sawtan - 1 freq
stanie - 2 freq
stawen - 3 freq
stowein - 1 freq
swiytin - 1 freq
shootan - 5 freq
shudna - 4 freq
sýstem - 2 freq
shitein - 2 freq
staun' - 2 freq
shadno - 1 freq
stane- - 1 freq
stayin' - 1 freq
soddin' - 1 freq
suiden - 1 freq
squaattin - 1 freq
scoutin - 1 freq
shadin - 1 freq
€œstaan - 1 freq
sudn - 1 freq
styin' - 1 freq
staiyin - 2 freq
siden - 2 freq
€˜siden - 1 freq
stenn - 1 freq
suidnae - 1 freq
seithin - 1 freq
shoudnae - 3 freq
sutton - 6 freq
skaitan - 1 freq
sittn - 1 freq
sitin - 1 freq
set-in - 1 freq
shitin' - 1 freq
stoney - 3 freq
scottm - 1 freq
seaton - 3 freq
shutdoon - 2 freq
setn - 1 freq
stane” - 1 freq
sjtnw - 1 freq
shoud'nae - 1 freq
settn - 1 freq
shoodnae - 1 freq
shutdown - 2 freq
scottewen - 1 freq
sudan - 1 freq
stayhome - 1 freq
MetaPhone code - STM
steam - 77 freq
stymie - 2 freq
stem - 18 freq
stame - 8 freq
wyssdom - 1 freq
staem - 1 freq
stam - 2 freq
sodom - 9 freq
sodium - 1 freq
steem - 13 freq
steamy - 5 freq
saddam - 1 freq
steamie - 9 freq
stoma - 5 freq
wísdom - 1 freq
stime - 4 freq
sittm - 1 freq
styme - 2 freq
xtmy - 1 freq
SODOM
Time to execute Levenshtein function - 0.204179 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.364319 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027807 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.039489 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000928 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.