A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to something has gone wrong in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
somethings (14) - 1 freq
elginhsgeog (15) - 2 freq
something (15) - 383 freq
somethins (15) - 2 freq
somethin's (15) - 18 freq
some'hing's (15) - 2 freq
'something's (15) - 1 freq
thethingsyousee (16) - 1 freq
'somethin's (16) - 1 freq
somethin' (16) - 5 freq
nothingness (16) - 1 freq
someither (16) - 1 freq
engineering (16) - 4 freq
getmethinagain (16) - 1 freq
gethincjones (16) - 10 freq
things-everything (16) - 1 freq
nottinghamshire (16) - 1 freq
somehin's (16) - 1 freq
lostglasgow (16) - 1 freq
sometheen (16) - 54 freq
demetriachavon (16) - 1 freq
womenreadwomen (16) - 6 freq
neighbouring (16) - 1 freq
theglasgowvoice (16) - 1 freq
naethingness (16) - 1 freq
somethings (21) - 1 freq
itsagoinwrang (22) - 1 freq
strongmisgiving (22) - 3 freq
'something's (23) - 1 freq
some'hing's (23) - 2 freq
staggering (23) - 2 freq
something (23) - 383 freq
somethins (23) - 2 freq
somethin's (23) - 18 freq
stephenhannan (24) - 2 freq
sweeping-sweeping- (24) - 1 freq
thanksgiving (24) - 2 freq
theglasgowvoice (24) - 1 freq
weeknightsaresoboring (24) - 1 freq
singsong (24) - 1 freq
sittingbourne (24) - 1 freq
neighbouring (24) - 1 freq
elginhsgeog (24) - 2 freq
standglasgow (24) - 1 freq
stevenagnew (24) - 1 freq
engineering (24) - 4 freq
things-everything (24) - 1 freq
sing-sang (25) - 2 freq
smitherin (25) - 1 freq
tightening (25) - 1 freq
SoundEx code - S535
sometimes - 253 freq
somethin - 883 freq
somethin's - 18 freq
something - 383 freq
sentence - 148 freq
sundoun - 11 freq
sundouns - 7 freq
sometheen - 54 freq
sendin - 47 freq
sentiments - 6 freq
sentimental - 3 freq
soondin - 42 freq
sentences - 52 freq
smeddum - 67 freq
'somethin - 4 freq
somethm - 1 freq
soundin - 6 freq
sumthin's - 4 freq
sun-tan - 2 freq
snod-in-aboot - 1 freq
sometime - 26 freq
scandinavian - 19 freq
somiethin - 1 freq
summation - 1 freq
sentenced - 14 freq
sumtimes - 6 freq
sumtime - 2 freq
sumthin - 100 freq
smitten - 14 freq
sometims - 4 freq
sometim - 1 freq
sentencing - 1 freq
sentence't - 1 freq
sentimentalities - 1 freq
sentiment - 12 freq
somethins - 2 freq
smoothin - 3 freq
sneddin - 9 freq
squintin - 3 freq
sneddin' - 1 freq
somethin' - 5 freq
sometin - 7 freq
'something's - 1 freq
seen-aathing - 1 freq
sentencin - 1 freq
'sometime - 1 freq
'somethin's - 1 freq
'sometimes - 1 freq
sintences - 1 freq
santin - 1 freq
somtheen - 19 freq
soondan - 2 freq
smootin - 1 freq
sentinel - 3 freq
sumtheen - 2 freq
sometheen's - 1 freq
simthin - 11 freq
simethin - 1 freq
smeddun - 1 freq
smaatoun - 1 freq
'sometimes' - 3 freq
sained-na - 1 freq
sentient - 1 freq
soundan - 2 freq
sendan - 2 freq
smuithin - 1 freq
sumthing - 10 freq
syndin - 1 freq
sumthin'll - 1 freq
swynton' - 1 freq
sumth'n - 1 freq
sometym - 1 freq
sentimentality - 3 freq
soondins - 2 freq
somtimes - 1 freq
smeethin - 2 freq
smittin - 2 freq
scandinavia - 6 freq
sauntin - 1 freq
say-onythin - 1 freq
€œsomething - 1 freq
sending - 12 freq
€˜somethin - 2 freq
€œsometimes - 1 freq
scandinavians - 1 freq
smitteneen - 1 freq
sea-maiden - 1 freq
sumtymes - 2 freq
smeddumfu - 1 freq
€œsoinething - 1 freq
€œsomethin - 1 freq
sumtums - 2 freq
sometums - 1 freq
sneddon - 2 freq
smeddumfou - 2 freq
suntan - 1 freq
sumtaims - 6 freq
sentensis - 1 freq
sentins - 1 freq
somethings - 1 freq
snedandrew - 2 freq
simetimes - 1 freq
saintmirrenfc - 4 freq
somthin - 1 freq
‘something - 1 freq
'smeddum' - 1 freq
senten - 1 freq
semi-hidden - 1 freq
sundaymorning - 1 freq
smtenkuh - 1 freq
sandancer - 3 freq
MetaPhone code - SM0NKHSK
SOMETHING HAS GONE WRONG
Time to execute Levenshtein function - 0.401761 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.756679 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.045568 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.061522 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000959 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.