A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to scandinavia in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
scandinavia (0) - 9 freq
scandinavian (1) - 19 freq
scaninavia (1) - 2 freq
scandinavians (2) - 1 freq
scaninavians (3) - 1 freq
cardinalis (4) - 1 freq
sardinia (4) - 2 freq
scaudin (5) - 4 freq
scandal (5) - 10 freq
standin-up (5) - 1 freq
scandals (5) - 1 freq
cardinal's (5) - 1 freq
cardinals (5) - 3 freq
standin' (5) - 2 freq
candidates (5) - 15 freq
standing (5) - 31 freq
canigait (5) - 1 freq
scannan (5) - 1 freq
cannabis (5) - 1 freq
sardinian (5) - 3 freq
canaria (5) - 1 freq
scancin (5) - 6 freq
cardinal (5) - 6 freq
scandic (5) - 3 freq
sandisans (5) - 1 freq
scandinavia (0) - 9 freq
scaninavia (2) - 2 freq
scandinavian (2) - 19 freq
scandinavians (4) - 1 freq
scaninavians (6) - 1 freq
scannan (7) - 1 freq
standing (7) - 31 freq
scandic (7) - 3 freq
scancin (7) - 6 freq
standin' (7) - 2 freq
scansin (7) - 1 freq
standin (7) - 96 freq
scandals (7) - 1 freq
scaudin (7) - 4 freq
sardinia (7) - 2 freq
swandive (7) - 1 freq
scandal (7) - 10 freq
skindiana (7) - 1 freq
scanning (7) - 1 freq
scaldin (7) - 1 freq
scannin (7) - 8 freq
scoldin (8) - 2 freq
standand (8) - 1 freq
syndin (8) - 1 freq
sending (8) - 13 freq
SoundEx code - S535
sometimes - 347 freq
somethin - 1079 freq
somethin's - 19 freq
something - 441 freq
sentence - 134 freq
smeddum - 111 freq
sendin - 63 freq
sentenced - 22 freq
soundin - 8 freq
sending - 13 freq
smitten - 21 freq
sentences - 42 freq
sundoon - 1 freq
soondin - 47 freq
sundoun - 11 freq
sundouns - 7 freq
skindiana - 1 freq
sentiment - 16 freq
seen-aathing - 2 freq
sneddin - 15 freq
squintin - 6 freq
somthin - 6 freq
sumthin - 97 freq
sumtimes - 12 freq
scandinavian - 19 freq
sentimental - 5 freq
sentimentality - 4 freq
sentimentalised - 1 freq
sumthing - 24 freq
'somethin - 5 freq
smithin - 1 freq
sendan - 3 freq
smoothin - 5 freq
sundance - 1 freq
sumthin' - 6 freq
soundin' - 1 freq
snawed-in - 1 freq
sumtyme's - 12 freq
sumthein - 4 freq
somethin' - 10 freq
sometheen - 54 freq
sentiments - 8 freq
somethm - 1 freq
sumthin's - 4 freq
sun-tan - 2 freq
snod-in-aboot - 1 freq
sometime - 30 freq
somiethin - 1 freq
summation - 1 freq
sumtime - 3 freq
sometims - 4 freq
sometim - 1 freq
sentencing - 1 freq
sentence't - 1 freq
sentimentalities - 1 freq
somethins - 2 freq
sneddin' - 1 freq
sometin - 7 freq
'something's - 1 freq
sentencin - 1 freq
'sometime - 1 freq
'somethin's - 1 freq
'sometimes - 2 freq
sintences - 1 freq
santin - 1 freq
somtheen - 19 freq
soondan - 2 freq
smootin - 2 freq
sentinel - 5 freq
sumtheen - 2 freq
sometheen's - 1 freq
simthin - 11 freq
simethin - 1 freq
smeddun - 1 freq
smaatoun - 1 freq
'sometimes' - 3 freq
sained-na - 1 freq
sentient - 1 freq
soundan - 3 freq
smuithin - 1 freq
syndin - 1 freq
sumthin'll - 1 freq
swynton' - 1 freq
sumth'n - 1 freq
sometym - 1 freq
sundown - 1 freq
santander - 1 freq
soondins - 2 freq
somtimes - 1 freq
smeethin - 2 freq
smittin - 2 freq
scandinavia - 9 freq
sauntin - 1 freq
sumtymes - 3 freq
€˜smeddum - 1 freq
sentimentally - 1 freq
say-onythin - 1 freq
€œsomething - 1 freq
€˜somethin - 2 freq
€œsometimes - 1 freq
scandinavians - 1 freq
smitteneen - 1 freq
sea-maiden - 1 freq
smeddumfu - 1 freq
€œsoinething - 1 freq
€œsomethin - 1 freq
sumtums - 2 freq
sometums - 1 freq
sneddon - 2 freq
smeddumfou - 2 freq
suntan - 2 freq
sumtaims - 6 freq
sentensis - 1 freq
sentins - 1 freq
somethings - 1 freq
snedandrew - 2 freq
simetimes - 1 freq
saintmirrenfc - 4 freq
‘something - 1 freq
'smeddum' - 1 freq
senten - 1 freq
semi-hidden - 1 freq
sundaymorning - 1 freq
smtenkuh - 1 freq
sandancer - 3 freq
MetaPhone code - SKNTNF
scandinavia - 9 freq
SCANDINAVIA
Time to execute Levenshtein function - 0.280634 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.397327 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.034789 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.042612 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000988 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.