A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to smicht in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
smicht (0) - 5 freq
'micht (1) - 1 freq
sicht (1) - 621 freq
shicht (1) - 2 freq
slicht (1) - 23 freq
micht (1) - 1996 freq
sich (2) - 99 freq
fricht (2) - 63 freq
snecht (2) - 1 freq
'micht' (2) - 1 freq
amichty (2) - 1 freq
mycht (2) - 3 freq
wicht (2) - 11 freq
richt (2) - 4119 freq
smilt (2) - 1 freq
socht (2) - 143 freq
liicht (2) - 1 freq
'eicht (2) - 2 freq
mieht (2) - 1 freq
swecht (2) - 1 freq
michi (2) - 1 freq
taicht (2) - 3 freq
blicht (2) - 4 freq
aricht (2) - 71 freq
suiht (2) - 1 freq
smicht (0) - 5 freq
slicht (2) - 23 freq
shicht (2) - 2 freq
micht (2) - 1996 freq
sicht (2) - 621 freq
'micht (2) - 1 freq
shecht (3) - 1 freq
mocht (3) - 3 freq
swecht (3) - 1 freq
skecht (3) - 1 freq
seycht (3) - 1 freq
soucht (3) - 9 freq
secht (3) - 3 freq
smasht (3) - 1 freq
saucht (3) - 10 freq
macht (3) - 3 freq
michta (3) - 7 freq
socht (3) - 143 freq
amichty (3) - 1 freq
mycht (3) - 3 freq
mecht (3) - 1 freq
snecht (3) - 1 freq
stecht (3) - 2 freq
shocht (3) - 1 freq
michty (3) - 210 freq
SoundEx code - S523
scansed - 8 freq
sunset - 28 freq
sensed - 18 freq
sneckit - 28 freq
sneezed - 10 freq
sneysters - 5 freq
sanctity - 2 freq
sinsed - 1 freq
smashed - 35 freq
sensation - 22 freq
saunstane - 1 freq
sanct - 45 freq
sconced - 2 freq
scanced - 10 freq
snashed - 11 freq
sanctioned - 7 freq
smoked - 24 freq
smaaest - 11 freq
smuist - 1 freq
somegate - 1 freq
seamstress - 1 freq
sancts - 4 freq
sneesht - 1 freq
smuchter - 1 freq
snokit - 2 freq
snecked - 6 freq
sneaked - 12 freq
sunsets - 4 freq
sangster's - 6 freq
sneistery - 1 freq
smeekit - 9 freq
singit - 1 freq
semi-steamin - 1 freq
smeakit - 1 freq
sanctuary - 11 freq
snugged - 1 freq
sensitive - 16 freq
sneists - 1 freq
sanctis - 4 freq
sanct's - 1 freq
sanctuarie - 2 freq
snaggit - 1 freq
snichtert - 1 freq
saimstress - 1 freq
snouked - 1 freq
sanct-aundraes - 1 freq
sinister-lookin - 1 freq
sinister - 10 freq
sanction - 4 freq
sanctuary' - 1 freq
snochtered - 4 freq
smuiked - 1 freq
skin-scowder - 1 freq
snaiked - 1 freq
sneck-drawer - 1 freq
smeukit - 1 freq
'sneck-drawin' - 1 freq
snakit - 3 freq
sun-stuffie - 1 freq
smokit - 6 freq
sanctum - 1 freq
sneeked - 1 freq
sunnyside - 1 freq
sneukit - 1 freq
snushed - 3 freq
sangsters - 13 freq
sneakit - 2 freq
sneakt - 1 freq
sanctimonious - 6 freq
skinniest - 1 freq
snogged - 2 freq
sneckt - 1 freq
swan-necked - 1 freq
smasht - 1 freq
snochters - 5 freq
snawstorm - 1 freq
smackit - 4 freq
seenister - 1 freq
smawest - 3 freq
sneestered - 1 freq
smokkit - 1 freq
snuggit - 2 freq
'sunset - 1 freq
singed - 3 freq
smacked - 7 freq
sunset's - 2 freq
sanstane - 1 freq
sennicht - 10 freq
smaa'est - 1 freq
snystie - 3 freq
snowked - 2 freq
snochterdichter - 1 freq
sneyster - 5 freq
sunket - 3 freq
sneist - 5 freq
skansed - 1 freq
sanctuar - 5 freq
sensautioun - 1 freq
sneckid - 1 freq
sensethe - 1 freq
snowkit - 1 freq
smooked - 1 freq
sneistie - 6 freq
smeegit - 3 freq
song-threeds - 1 freq
sneak'd - 1 freq
skenstid - 1 freq
skenstoft - 1 freq
sensitivity - 4 freq
snochter - 1 freq
sneisty - 1 freq
snasht - 1 freq
sneester - 3 freq
smeigit - 1 freq
snaikit - 2 freq
sunkets - 2 freq
snake-heidit - 1 freq
smash't - 1 freq
sneest - 1 freq
somegaits - 3 freq
sneistie-like - 2 freq
somegaits- - 1 freq
shihuangdi - 1 freq
sang-gaitherers - 1 freq
sang-gaitherer - 1 freq
sansgter - 1 freq
suinest - 2 freq
seanachaidh - 2 freq
seinister - 3 freq
scene-settin - 1 freq
smaest - 2 freq
snagged - 1 freq
smashet - 1 freq
sensationalist - 1 freq
sensit - 1 freq
sumgates - 1 freq
sangster - 24 freq
smuchtered - 1 freq
sangstress - 1 freq
skonsed - 1 freq
sinkit - 1 freq
snekkit - 4 freq
smouchterin - 1 freq
sneggit - 1 freq
skonced - 1 freq
snashit - 1 freq
sancte - 4 freq
sainctis - 1 freq
snochter-dichters - 1 freq
snowsuit - 1 freq
€˜sunset - 2 freq
snaked - 2 freq
semester - 2 freq
smeeged - 3 freq
semi-hostile - 1 freq
€˜sangsteris - 1 freq
smackheids - 1 freq
sensations - 3 freq
smackheid - 4 freq
sennichts - 2 freq
sun-god - 1 freq
snochtert - 1 freq
seamstresses - 1 freq
sunset--- - 1 freq
smuikit - 1 freq
swingset - 2 freq
sanctification - 1 freq
sneeshed - 1 freq
sensiteevity - 1 freq
sinister-luckin - 1 freq
somegait - 1 freq
scanct - 1 freq
sneesters - 1 freq
snecht - 1 freq
shamsatoo - 1 freq
sneysterin - 1 freq
snaw-coatit - 1 freq
sangster-sangscrievers - 1 freq
sensational - 7 freq
sunsoot - 1 freq
sowensauthor - 1 freq
smicht - 5 freq
samgoodman - 1 freq
soonest - 1 freq
symngtnbadyin - 1 freq
sxmsdpni - 1 freq
seemescotland - 5 freq
sneistin - 1 freq
sunniest - 1 freq
sungaets - 1 freq
sheenster - 150 freq
sheensterpriv - 1 freq
smxt - 1 freq
MetaPhone code - SMXT
smashed - 35 freq
smasht - 1 freq
smash't - 1 freq
smashet - 1 freq
smatchet - 1 freq
smicht - 5 freq
SMICHT
Time to execute Levenshtein function - 0.274390 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.492359 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.028184 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.069283 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000876 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.