A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to sunken in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
sunken (0) - 4 freq
sunket (1) - 3 freq
unken (1) - 4 freq
sulken (1) - 1 freq
sukken (1) - 1 freq
sppken (2) - 1 freq
suner (2) - 3 freq
jungen (2) - 1 freq
sutten (2) - 8 freq
sinker (2) - 1 freq
sucked (2) - 6 freq
bunker (2) - 23 freq
sudden (2) - 213 freq
bunden (2) - 1 freq
sulkin (2) - 5 freq
spiken (2) - 3 freq
spaken (2) - 4 freq
stukken (2) - 7 freq
spukken (2) - 10 freq
suntan (2) - 2 freq
stunden (2) - 1 freq
sunset (2) - 28 freq
tunkin (2) - 1 freq
sukan (2) - 1 freq
suiden (2) - 1 freq
sunken (0) - 4 freq
sinkin (2) - 25 freq
sinkn (2) - 1 freq
sukken (2) - 1 freq
sinkan (2) - 2 freq
sunket (2) - 3 freq
sulken (2) - 1 freq
unken (2) - 4 freq
shaken (3) - 7 freq
suckin (3) - 6 freq
sumkeyn (3) - 1 freq
sunks (3) - 3 freq
dunkin (3) - 2 freq
spoken (3) - 217 freq
sumkin (3) - 1 freq
sunn (3) - 8 freq
sokken (3) - 1 freq
hunkin (3) - 1 freq
unkan (3) - 22 freq
sankey (3) - 1 freq
sikken (3) - 11 freq
sicken (3) - 1 freq
snakin (3) - 2 freq
soaken (3) - 2 freq
silken (3) - 10 freq
SoundEx code - S525
sunshine - 79 freq
sinkin - 25 freq
singin - 286 freq
swingin - 27 freq
snowkin - 7 freq
sinsyne - 30 freq
smoking - 13 freq
smokin - 97 freq
sneezin - 27 freq
sneezing - 4 freq
singing - 30 freq
sensins - 1 freq
snoozin - 9 freq
sneckin - 7 freq
somecunt's - 1 freq
smashin - 60 freq
'somecunt - 1 freq
smugness - 2 freq
sensyne - 7 freq
scancin - 6 freq
shangshan - 2 freq
somecunt - 6 freq
smashing - 6 freq
some-cunt - 1 freq
sneakin - 6 freq
samson - 8 freq
sneeshin - 8 freq
smeekin - 2 freq
sing-sang - 4 freq
synsyne's - 1 freq
smokkin - 3 freq
sunsheen - 6 freq
smashinest - 1 freq
sensin - 9 freq
sunken - 4 freq
smookan - 2 freq
singan - 12 freq
singin' - 2 freq
sinkin' - 2 freq
smokin' - 4 freq
simson's - 1 freq
smakken - 1 freq
simson - 1 freq
simsen's - 1 freq
sumkeyn - 1 freq
smok'n - 1 freq
smeakin - 1 freq
scansouns - 1 freq
sinking - 5 freq
smookin - 3 freq
smackin - 4 freq
snashin - 1 freq
swinging - 3 freq
sneexin - 1 freq
sangsmiths - 1 freq
snoggin - 2 freq
somchin - 1 freq
snogyin - 1 freq
skinkin - 1 freq
sing-sangie - 1 freq
snoozan - 1 freq
smokeen - 2 freq
sneakan - 2 freq
snokin - 2 freq
scancein - 1 freq
sing-sangy - 3 freq
singsangy - 1 freq
scansin - 1 freq
sunnschein - 2 freq
sweengan - 2 freq
swung'im - 1 freq
snushan - 2 freq
sinkan - 2 freq
swingan - 1 freq
smokan - 2 freq
sanson - 1 freq
sensan - 1 freq
sangmaakers - 1 freq
sungkin - 1 freq
singkin - 1 freq
swinkin - 1 freq
sin-sheen - 1 freq
samson's - 3 freq
sumkin - 1 freq
smockan - 1 freq
sneezan - 6 freq
sinseen - 1 freq
sinshene - 1 freq
singin't - 1 freq
snoukin-saats - 1 freq
sangsmith - 4 freq
smushin - 4 freq
snowcem- - 1 freq
smeikin - 1 freq
sinshine - 10 freq
sunsheeny - 1 freq
sinsheeny - 1 freq
'singin - 1 freq
sunshein - 1 freq
snakin - 2 freq
singsong - 1 freq
smeegin - 1 freq
swankin' - 1 freq
sneakin' - 1 freq
€œsmashin - 1 freq
snoozing - 1 freq
semi-sympathetic - 1 freq
sensing - 1 freq
sense-o-humour - 1 freq
sangam - 1 freq
synesen - 1 freq
sneckins - 1 freq
sweengin - 2 freq
€˜sinsyne - 1 freq
€˜smashing - 1 freq
€˜smashin - 1 freq
semi-conscious - 1 freq
singjn' - 1 freq
shaunkennedy - 1 freq
smashin' - 1 freq
schmazing - 1 freq
semi-juniors - 1 freq
samheughan - 1 freq
smnzm - 1 freq
singinÂ’ - 1 freq
sinkn - 1 freq
snewsma - 1 freq
samsung - 2 freq
samhowson - 1 freq
snsmw - 1 freq
sjnjmzhrfc - 1 freq
swingingthelead - 1 freq
swanswan - 1 freq
sneaking - 1 freq
snecking - 1 freq
sunshinekid - 1 freq
seanmcm - 1 freq
smoochin - 1 freq
seancampbell - 5 freq
sonnyswanson - 2 freq
shamshoum - 1 freq
snacking - 1 freq
syngenta - 1 freq
snowyscene - 1 freq
MetaPhone code - SNKN
sinkin - 25 freq
snowkin - 7 freq
sneckin - 7 freq
sneakin - 6 freq
sunken - 4 freq
singan - 12 freq
sinkin' - 2 freq
snoggin - 2 freq
sneakan - 2 freq
snokin - 2 freq
sinkan - 2 freq
snakin - 2 freq
sneakin' - 1 freq
sinkn - 1 freq
SUNKEN
Time to execute Levenshtein function - 0.494811 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.629268 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.069744 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.047257 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.001135 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.