A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to sameness in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
sameness (0) - 1 freq
saimeness (1) - 1 freq
slaeness (2) - 1 freq
bareness (2) - 1 freq
dampness (2) - 4 freq
wareness (2) - 1 freq
sakeless (2) - 5 freq
wameless (2) - 1 freq
sadness (2) - 31 freq
saimness (2) - 1 freq
sleeness (2) - 1 freq
tumeness (2) - 1 freq
hameless (2) - 17 freq
sautness (2) - 1 freq
haleness (2) - 1 freq
nameless (2) - 5 freq
maleness (2) - 1 freq
sureness (2) - 1 freq
shameless (2) - 3 freq
sairness (2) - 8 freq
saftness (2) - 4 freq
saidnes (3) - 1 freq
fineness (3) - 1 freq
numbness (3) - 3 freq
eagerness (3) - 2 freq
sameness (0) - 1 freq
saimeness (1) - 1 freq
saimness (2) - 1 freq
sleeness (3) - 1 freq
sureness (3) - 1 freq
sairness (3) - 8 freq
sadness (3) - 31 freq
sautness (3) - 1 freq
tumeness (3) - 1 freq
smaaness (3) - 1 freq
slaeness (3) - 1 freq
sourness (4) - 1 freq
smugness (4) - 2 freq
fusomeness (4) - 1 freq
soorness (4) - 3 freq
someens (4) - 2 freq
seekness (4) - 11 freq
seamless (4) - 5 freq
smawness (4) - 1 freq
someones (4) - 1 freq
moness (4) - 2 freq
seikness (4) - 7 freq
someanes (4) - 1 freq
shameless (4) - 3 freq
wameless (4) - 1 freq
SoundEx code - S552
someone's - 4 freq
shenanigans - 8 freq
scanning - 1 freq
seaman's - 1 freq
sameness - 1 freq
summin's - 1 freq
sea-monsters - 1 freq
shining - 8 freq
swimming - 8 freq
'someones - 1 freq
summons - 3 freq
showmanship - 2 freq
seemingly - 14 freq
'scummins' - 1 freq
simon's - 2 freq
seemon's - 4 freq
smawness - 1 freq
someens - 2 freq
some-eans - 1 freq
sooming - 2 freq
simmins - 1 freq
sinians - 1 freq
simmans - 1 freq
someeen's - 1 freq
saimeness - 1 freq
some'hing's - 2 freq
shamanistic - 1 freq
sinnons - 1 freq
sumeen's - 1 freq
shamanic - 2 freq
showman-cum-grocer - 2 freq
someanes - 1 freq
seeming - 1 freq
sumhin's - 1 freq
sumhins - 1 freq
sweeming - 1 freq
somehing - 6 freq
sumhing - 4 freq
somehin's - 1 freq
smaaness - 1 freq
sooning - 1 freq
snowing - 4 freq
sumink - 1 freq
seamanship - 1 freq
symington - 1 freq
smaoineachadh - 1 freq
simmons - 1 freq
skinning - 1 freq
summming - 1 freq
sunning - 1 freq
shannons - 1 freq
shaunamacd - 1 freq
somewan's - 3 freq
someen's - 1 freq
shaming - 1 freq
sweenyness - 1 freq
someones - 1 freq
MetaPhone code - SMNS
someone's - 4 freq
seaman's - 1 freq
sameness - 1 freq
summin's - 1 freq
'someones - 1 freq
summons - 3 freq
saimness - 1 freq
simon's - 2 freq
seemon's - 4 freq
smawness - 1 freq
someens - 2 freq
some-eans - 1 freq
simmins - 1 freq
simmans - 1 freq
someeen's - 1 freq
saimeness - 1 freq
zieman's - 1 freq
sumeen's - 1 freq
someanes - 1 freq
smaaness - 1 freq
simmons - 1 freq
someen's - 1 freq
someones - 1 freq
SAMENESS
Time to execute Levenshtein function - 0.311628 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.523621 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.033075 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.046448 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.001241 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.