A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to scandinavia in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
scandinavia (0) - 9 freq
scandinavian (1) - 21 freq
scaninavia (1) - 2 freq
scandinavians (2) - 1 freq
scaninavians (3) - 1 freq
sardinia (4) - 3 freq
cardinalis (4) - 1 freq
scandals (5) - 1 freq
scandal (5) - 10 freq
canigait (5) - 1 freq
standin-up (5) - 1 freq
scannin (5) - 8 freq
canaria (5) - 1 freq
scaldin (5) - 1 freq
cardinals (5) - 3 freq
cardinal (5) - 6 freq
cardinal's (5) - 1 freq
skindiana (5) - 1 freq
candidate (5) - 21 freq
scanning (5) - 1 freq
standin (5) - 97 freq
scaudin (5) - 4 freq
scannan (5) - 1 freq
sandisans (5) - 1 freq
scansin (5) - 1 freq
scandinavia (0) - 9 freq
scaninavia (2) - 2 freq
scandinavian (2) - 21 freq
scandinavians (4) - 1 freq
scaninavians (6) - 1 freq
scannan (7) - 1 freq
scaudin (7) - 4 freq
standin (7) - 97 freq
scanning (7) - 1 freq
scansin (7) - 1 freq
swandive (7) - 1 freq
scandic (7) - 3 freq
skindiana (7) - 1 freq
standing (7) - 33 freq
standin' (7) - 2 freq
scancin (7) - 6 freq
scannin (7) - 8 freq
scandals (7) - 1 freq
sardinia (7) - 3 freq
scaldin (7) - 1 freq
scandal (7) - 10 freq
syndin (8) - 1 freq
staandin (8) - 25 freq
spendin (8) - 33 freq
soundin (8) - 10 freq
SoundEx code - S535
sometimes - 362 freq
somethin - 1119 freq
somethin's - 19 freq
something - 451 freq
sentence - 136 freq
smeddum - 111 freq
sendin - 63 freq
sentenced - 22 freq
soundin - 10 freq
sending - 13 freq
smitten - 21 freq
sentences - 43 freq
sundoon - 2 freq
soondin - 47 freq
sundoun - 11 freq
sundouns - 7 freq
skindiana - 1 freq
sentiment - 16 freq
seen-aathing - 2 freq
sneddin - 15 freq
squintin - 7 freq
somthin - 7 freq
sumthin - 97 freq
sumtimes - 12 freq
scandinavian - 21 freq
sentimental - 6 freq
sentimentality - 4 freq
sentimentalised - 1 freq
sumthing - 24 freq
'somethin - 5 freq
smithin - 1 freq
sendan - 3 freq
smoothin - 5 freq
sundance - 1 freq
sumthin' - 6 freq
soundin' - 1 freq
snawed-in - 1 freq
sumtyme's - 12 freq
sumthein - 4 freq
somethin' - 12 freq
squint'n - 1 freq
smitin - 1 freq
somet'n - 1 freq
sounding - 1 freq
sometheen - 54 freq
sentiments - 8 freq
somethm - 1 freq
sumthin's - 4 freq
sun-tan - 2 freq
snod-in-aboot - 1 freq
sometime - 30 freq
somiethin - 1 freq
summation - 1 freq
sumtime - 3 freq
sometims - 4 freq
sometim - 1 freq
sentencing - 1 freq
sentence't - 1 freq
sentimentalities - 1 freq
somethins - 2 freq
sneddin' - 1 freq
sometin - 7 freq
'something's - 1 freq
sentencin - 1 freq
'sometime - 1 freq
'somethin's - 1 freq
'sometimes - 2 freq
sintences - 1 freq
santin - 1 freq
somtheen - 19 freq
soondan - 2 freq
smootin - 2 freq
sentinel - 5 freq
sumtheen - 2 freq
sometheen's - 1 freq
simthin - 11 freq
simethin - 1 freq
smeddun - 1 freq
smaatoun - 1 freq
'sometimes' - 3 freq
sained-na - 1 freq
sentient - 1 freq
soundan - 3 freq
smuithin - 1 freq
syndin - 1 freq
sumthin'll - 1 freq
swynton' - 1 freq
sumth'n - 1 freq
sometym - 1 freq
sundown - 1 freq
santander - 1 freq
soondins - 2 freq
somtimes - 1 freq
smeethin - 2 freq
smittin - 2 freq
scandinavia - 9 freq
sauntin - 1 freq
sumtymes - 3 freq
€˜smeddum - 1 freq
sentimentally - 1 freq
say-onythin - 1 freq
€œsomething - 1 freq
€˜somethin - 2 freq
€œsometimes - 1 freq
scandinavians - 1 freq
smitteneen - 1 freq
sea-maiden - 1 freq
smeddumfu - 1 freq
€œsoinething - 1 freq
€œsomethin - 1 freq
sumtums - 2 freq
sometums - 1 freq
sneddon - 2 freq
smeddumfou - 2 freq
suntan - 2 freq
sumtaims - 6 freq
sentensis - 1 freq
sentins - 1 freq
somethings - 1 freq
snedandrew - 2 freq
simetimes - 1 freq
saintmirrenfc - 4 freq
‘something - 1 freq
'smeddum' - 1 freq
senten - 1 freq
semi-hidden - 1 freq
sundaymorning - 1 freq
smtenkuh - 1 freq
sandancer - 3 freq
MetaPhone code - SKNTNF
scandinavia - 9 freq
SCANDINAVIA
Time to execute Levenshtein function - 0.261600 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.418384 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027685 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.037103 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000881 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.