A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to some-eans in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
some-eans (0) - 1 freq
some-ean (1) - 28 freq
some-een (2) - 4 freq
'some-ean (2) - 1 freq
someens (2) - 2 freq
someean (2) - 3 freq
someane (3) - 17 freq
someeen (3) - 1 freq
somewan's (3) - 3 freq
somewan (3) - 34 freq
someen's (3) - 1 freq
some-cunt (3) - 1 freq
someen (3) - 36 freq
comedians (3) - 7 freq
somethins (3) - 2 freq
someman (3) - 1 freq
coke-cans (3) - 2 freq
someeen's (3) - 1 freq
someanes (3) - 1 freq
somedae's (4) - 1 freq
saveens (4) - 1 freq
sameness (4) - 1 freq
simmans (4) - 1 freq
somedae (4) - 1 freq
sometims (4) - 4 freq
some-eans (0) - 1 freq
some-ean (2) - 28 freq
someens (3) - 2 freq
some-een (3) - 4 freq
someanes (4) - 1 freq
someean (4) - 3 freq
'some-ean (4) - 1 freq
someeen's (5) - 1 freq
someman (5) - 1 freq
comedians (5) - 7 freq
simmans (5) - 1 freq
someones (5) - 1 freq
somethins (5) - 2 freq
someen (5) - 36 freq
someeen (5) - 1 freq
someane (5) - 17 freq
somewan (5) - 34 freq
somewan's (5) - 3 freq
some-cunt (5) - 1 freq
someen's (5) - 1 freq
sowans (6) - 1 freq
screens (6) - 22 freq
sumeen (6) - 1 freq
sometums (6) - 1 freq
servans (6) - 4 freq
SoundEx code - S552
someone's - 4 freq
shenanigans - 8 freq
scanning - 1 freq
seaman's - 1 freq
sameness - 1 freq
summin's - 1 freq
sea-monsters - 1 freq
shining - 8 freq
swimming - 8 freq
'someones - 1 freq
summons - 3 freq
showmanship - 2 freq
seemingly - 14 freq
'scummins' - 1 freq
simon's - 2 freq
seemon's - 4 freq
smawness - 1 freq
someens - 2 freq
some-eans - 1 freq
sooming - 2 freq
simmins - 1 freq
sinians - 1 freq
simmans - 1 freq
someeen's - 1 freq
saimeness - 1 freq
some'hing's - 2 freq
shamanistic - 1 freq
sinnons - 1 freq
sumeen's - 1 freq
shamanic - 2 freq
showman-cum-grocer - 2 freq
someanes - 1 freq
seeming - 1 freq
sumhin's - 1 freq
sumhins - 1 freq
sweeming - 1 freq
somehing - 6 freq
sumhing - 4 freq
somehin's - 1 freq
smaaness - 1 freq
sooning - 1 freq
snowing - 4 freq
sumink - 1 freq
seamanship - 1 freq
symington - 1 freq
smaoineachadh - 1 freq
simmons - 1 freq
skinning - 1 freq
summming - 1 freq
sunning - 1 freq
shannons - 1 freq
shaunamacd - 1 freq
somewan's - 3 freq
someen's - 1 freq
shaming - 1 freq
sweenyness - 1 freq
someones - 1 freq
MetaPhone code - SMNS
someone's - 4 freq
seaman's - 1 freq
sameness - 1 freq
summin's - 1 freq
'someones - 1 freq
summons - 3 freq
saimness - 1 freq
simon's - 2 freq
seemon's - 4 freq
smawness - 1 freq
someens - 2 freq
some-eans - 1 freq
simmins - 1 freq
simmans - 1 freq
someeen's - 1 freq
saimeness - 1 freq
zieman's - 1 freq
sumeen's - 1 freq
someanes - 1 freq
smaaness - 1 freq
simmons - 1 freq
someen's - 1 freq
someones - 1 freq
SOME-EANS
Time to execute Levenshtein function - 0.210285 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.376347 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.029754 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.043849 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.001032 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.