A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to something has gone wrong in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
somethings (14) - 1 freq
somethins (15) - 2 freq
'something's (15) - 1 freq
some'hing's (15) - 2 freq
naethingelaine (15) - 1 freq
something (15) - 441 freq
elginhsgeog (15) - 2 freq
somethin's (15) - 19 freq
misshamiltoneng (16) - 1 freq
commissioner (16) - 20 freq
stephenhannan (16) - 2 freq
things-everything (16) - 1 freq
singsong (16) - 1 freq
nothingness (16) - 1 freq
somethin (16) - 1079 freq
nowthisnews (16) - 2 freq
smithsonian (16) - 3 freq
somehin's (16) - 1 freq
sometirnes (16) - 1 freq
lethington (16) - 4 freq
naethingness (16) - 1 freq
demetriachavon (16) - 1 freq
standglasgow (16) - 1 freq
bettinasross (16) - 1 freq
spittingimage (16) - 1 freq
somethings (21) - 1 freq
itsagoinwrang (22) - 1 freq
strongmisgiving (22) - 3 freq
smithsonian (23) - 3 freq
some'hing's (23) - 2 freq
staggering (23) - 2 freq
somethin's (23) - 19 freq
'something's (23) - 1 freq
somethins (23) - 2 freq
something (23) - 441 freq
neighbouring (24) - 1 freq
theglasgowvoice (24) - 1 freq
sweeping-sweeping- (24) - 1 freq
weeknightsaresoboring (24) - 1 freq
sittingbourne (24) - 1 freq
thanksgiving (24) - 3 freq
standglasgow (24) - 1 freq
engineering (24) - 4 freq
stamagasterin (24) - 1 freq
elginhsgeog (24) - 2 freq
naethingelaine (24) - 1 freq
stevenagnew (24) - 1 freq
stephenhannan (24) - 2 freq
singsong (24) - 1 freq
things-everything (24) - 1 freq
SoundEx code - S535
sometimes - 347 freq
somethin - 1079 freq
somethin's - 19 freq
something - 441 freq
sentence - 134 freq
smeddum - 111 freq
sendin - 63 freq
sentenced - 22 freq
soundin - 8 freq
sending - 13 freq
smitten - 21 freq
sentences - 42 freq
sundoon - 1 freq
soondin - 47 freq
sundoun - 11 freq
sundouns - 7 freq
skindiana - 1 freq
sentiment - 16 freq
seen-aathing - 2 freq
sneddin - 15 freq
squintin - 6 freq
somthin - 6 freq
sumthin - 97 freq
sumtimes - 12 freq
scandinavian - 19 freq
sentimental - 5 freq
sentimentality - 4 freq
sentimentalised - 1 freq
sumthing - 24 freq
'somethin - 5 freq
smithin - 1 freq
sendan - 3 freq
smoothin - 5 freq
sundance - 1 freq
sumthin' - 6 freq
soundin' - 1 freq
snawed-in - 1 freq
sumtyme's - 12 freq
sumthein - 4 freq
somethin' - 10 freq
sometheen - 54 freq
sentiments - 8 freq
somethm - 1 freq
sumthin's - 4 freq
sun-tan - 2 freq
snod-in-aboot - 1 freq
sometime - 30 freq
somiethin - 1 freq
summation - 1 freq
sumtime - 3 freq
sometims - 4 freq
sometim - 1 freq
sentencing - 1 freq
sentence't - 1 freq
sentimentalities - 1 freq
somethins - 2 freq
sneddin' - 1 freq
sometin - 7 freq
'something's - 1 freq
sentencin - 1 freq
'sometime - 1 freq
'somethin's - 1 freq
'sometimes - 2 freq
sintences - 1 freq
santin - 1 freq
somtheen - 19 freq
soondan - 2 freq
smootin - 2 freq
sentinel - 5 freq
sumtheen - 2 freq
sometheen's - 1 freq
simthin - 11 freq
simethin - 1 freq
smeddun - 1 freq
smaatoun - 1 freq
'sometimes' - 3 freq
sained-na - 1 freq
sentient - 1 freq
soundan - 3 freq
smuithin - 1 freq
syndin - 1 freq
sumthin'll - 1 freq
swynton' - 1 freq
sumth'n - 1 freq
sometym - 1 freq
sundown - 1 freq
santander - 1 freq
soondins - 2 freq
somtimes - 1 freq
smeethin - 2 freq
smittin - 2 freq
scandinavia - 9 freq
sauntin - 1 freq
sumtymes - 3 freq
€˜smeddum - 1 freq
sentimentally - 1 freq
say-onythin - 1 freq
€œsomething - 1 freq
€˜somethin - 2 freq
€œsometimes - 1 freq
scandinavians - 1 freq
smitteneen - 1 freq
sea-maiden - 1 freq
smeddumfu - 1 freq
€œsoinething - 1 freq
€œsomethin - 1 freq
sumtums - 2 freq
sometums - 1 freq
sneddon - 2 freq
smeddumfou - 2 freq
suntan - 2 freq
sumtaims - 6 freq
sentensis - 1 freq
sentins - 1 freq
somethings - 1 freq
snedandrew - 2 freq
simetimes - 1 freq
saintmirrenfc - 4 freq
‘something - 1 freq
'smeddum' - 1 freq
senten - 1 freq
semi-hidden - 1 freq
sundaymorning - 1 freq
smtenkuh - 1 freq
sandancer - 3 freq
MetaPhone code - SM0NKHSK
SOMETHING HAS GONE WRONG
Time to execute Levenshtein function - 0.266925 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.412963 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027590 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.037444 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000769 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.