A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to sun-tan in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
sun-tan (0) - 2 freq
suntan (1) - 2 freq
huntan (2) - 4 freq
cun-nan (2) - 1 freq
dunstan (2) - 1 freq
stan (3) - 149 freq
countan (3) - 1 freq
punchan (3) - 1 freq
funstane (3) - 1 freq
sunti (3) - 9 freq
saunstane (3) - 1 freq
sinkan (3) - 2 freq
suitin (3) - 1 freq
sutten (3) - 8 freq
sun-lik (3) - 1 freq
suttin (3) - 81 freq
sanften (3) - 1 freq
six-man (3) - 1 freq
suntee (3) - 1 freq
puritan (3) - 2 freq
cuntin (3) - 1 freq
burstan (3) - 1 freq
settan (3) - 14 freq
sun-god (3) - 1 freq
sunfaa (3) - 1 freq
sun-tan (0) - 2 freq
suntan (2) - 2 freq
snortan (4) - 2 freq
santin (4) - 1 freq
ten-ton (4) - 1 freq
sanften (4) - 1 freq
saun-rain (4) - 1 freq
syne-an (4) - 1 freq
saunstane (4) - 1 freq
sanstane (4) - 1 freq
sauntin (4) - 1 freq
cun-nan (4) - 1 freq
huntan (4) - 4 freq
dunstan (4) - 1 freq
senten (4) - 1 freq
pantan (5) - 3 freq
shoutan (5) - 5 freq
shootan (5) - 5 freq
sae-an (5) - 1 freq
set-tae (5) - 1 freq
sustain (5) - 5 freq
een-an (5) - 1 freq
singan (5) - 12 freq
run-in (5) - 1 freq
shuttan (5) - 3 freq
SoundEx code - S535
sometimes - 347 freq
somethin - 1079 freq
somethin's - 19 freq
something - 441 freq
sentence - 134 freq
smeddum - 111 freq
sendin - 63 freq
sentenced - 22 freq
soundin - 8 freq
sending - 13 freq
smitten - 21 freq
sentences - 42 freq
sundoon - 1 freq
soondin - 47 freq
sundoun - 11 freq
sundouns - 7 freq
skindiana - 1 freq
sentiment - 16 freq
seen-aathing - 2 freq
sneddin - 15 freq
squintin - 6 freq
somthin - 6 freq
sumthin - 97 freq
sumtimes - 12 freq
scandinavian - 19 freq
sentimental - 5 freq
sentimentality - 4 freq
sentimentalised - 1 freq
sumthing - 24 freq
'somethin - 5 freq
smithin - 1 freq
sendan - 3 freq
smoothin - 5 freq
sundance - 1 freq
sumthin' - 6 freq
soundin' - 1 freq
snawed-in - 1 freq
sumtyme's - 12 freq
sumthein - 4 freq
somethin' - 10 freq
sometheen - 54 freq
sentiments - 8 freq
somethm - 1 freq
sumthin's - 4 freq
sun-tan - 2 freq
snod-in-aboot - 1 freq
sometime - 30 freq
somiethin - 1 freq
summation - 1 freq
sumtime - 3 freq
sometims - 4 freq
sometim - 1 freq
sentencing - 1 freq
sentence't - 1 freq
sentimentalities - 1 freq
somethins - 2 freq
sneddin' - 1 freq
sometin - 7 freq
'something's - 1 freq
sentencin - 1 freq
'sometime - 1 freq
'somethin's - 1 freq
'sometimes - 2 freq
sintences - 1 freq
santin - 1 freq
somtheen - 19 freq
soondan - 2 freq
smootin - 2 freq
sentinel - 5 freq
sumtheen - 2 freq
sometheen's - 1 freq
simthin - 11 freq
simethin - 1 freq
smeddun - 1 freq
smaatoun - 1 freq
'sometimes' - 3 freq
sained-na - 1 freq
sentient - 1 freq
soundan - 3 freq
smuithin - 1 freq
syndin - 1 freq
sumthin'll - 1 freq
swynton' - 1 freq
sumth'n - 1 freq
sometym - 1 freq
sundown - 1 freq
santander - 1 freq
soondins - 2 freq
somtimes - 1 freq
smeethin - 2 freq
smittin - 2 freq
scandinavia - 9 freq
sauntin - 1 freq
sumtymes - 3 freq
€˜smeddum - 1 freq
sentimentally - 1 freq
say-onythin - 1 freq
€œsomething - 1 freq
€˜somethin - 2 freq
€œsometimes - 1 freq
scandinavians - 1 freq
smitteneen - 1 freq
sea-maiden - 1 freq
smeddumfu - 1 freq
€œsoinething - 1 freq
€œsomethin - 1 freq
sumtums - 2 freq
sometums - 1 freq
sneddon - 2 freq
smeddumfou - 2 freq
suntan - 2 freq
sumtaims - 6 freq
sentensis - 1 freq
sentins - 1 freq
somethings - 1 freq
snedandrew - 2 freq
simetimes - 1 freq
saintmirrenfc - 4 freq
‘something - 1 freq
'smeddum' - 1 freq
senten - 1 freq
semi-hidden - 1 freq
sundaymorning - 1 freq
smtenkuh - 1 freq
sandancer - 3 freq
MetaPhone code - SNTN
sendin - 63 freq
soundin - 8 freq
sundoon - 1 freq
soondin - 47 freq
sundoun - 11 freq
sneddin - 15 freq
sendan - 3 freq
soundin' - 1 freq
sun-tan - 2 freq
sneddin' - 1 freq
santin - 1 freq
soondan - 2 freq
sained-na - 1 freq
soundan - 3 freq
syndin - 1 freq
swynton' - 1 freq
sundown - 1 freq
sauntin - 1 freq
sneddon - 2 freq
suntan - 2 freq
senten - 1 freq
SUN-TAN
Time to execute Levenshtein function - 0.228248 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.349499 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027213 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.037035 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000843 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.