A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to some-like in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
some-like (0) - 1 freq
somelike (1) - 2 freq
samelike (2) - 2 freq
same-lyke (2) - 1 freq
sojer-like (2) - 1 freq
soup-like (2) - 1 freq
roit-like (3) - 1 freq
maze-like (3) - 1 freq
dowf-like (3) - 2 freq
smertlike (3) - 1 freq
shore-line (3) - 1 freq
sometime (3) - 30 freq
sae-lik (3) - 1 freq
dour-like (3) - 1 freq
loesome-like (3) - 1 freq
slaw-like (3) - 1 freq
wyce-like (3) - 13 freq
douce-like (3) - 8 freq
body-like (3) - 1 freq
soberlie (3) - 1 freq
wyse-like (3) - 1 freq
bousome-like (3) - 1 freq
owre-like (3) - 1 freq
someplace (3) - 14 freq
saw-like (3) - 1 freq
some-like (0) - 1 freq
somelike (2) - 2 freq
same-lyke (2) - 1 freq
soup-like (3) - 1 freq
samelike (3) - 2 freq
sic-like (4) - 11 freq
bousome-like (4) - 1 freq
saw-like (4) - 1 freq
soar-lik (4) - 1 freq
loesome-like (4) - 1 freq
saimelike (4) - 1 freq
saut-like (4) - 1 freq
sik-like (4) - 2 freq
sojer-like (4) - 1 freq
sae-lik (4) - 1 freq
solemn-like (5) - 1 freq
such-like (5) - 1 freq
saelike (5) - 3 freq
och-like (5) - 1 freq
joco-like (5) - 1 freq
gyte-like (5) - 1 freq
nosey-like (5) - 1 freq
wice-like (5) - 3 freq
smellie (5) - 1 freq
caum-like (5) - 1 freq
SoundEx code - S542
smells - 50 freq
sannals - 1 freq
shemmels - 1 freq
soonlessly - 1 freq
sunlight - 10 freq
somelike - 2 freq
sunlicht - 42 freq
samelike - 2 freq
smiles - 103 freq
sounless - 2 freq
snail's - 4 freq
seamless - 5 freq
smile's - 1 freq
snails - 13 freq
sunless - 1 freq
sinlicht - 8 freq
sannels - 1 freq
smell's - 1 freq
smyl's - 1 freq
soonless - 2 freq
shameless - 3 freq
smallest - 3 freq
smileq - 1 freq
snails' - 1 freq
snells - 1 freq
snail's-pace - 2 freq
snæls - 1 freq
smaels - 1 freq
semmelie's - 1 freq
saimelike - 1 freq
smuils - 1 freq
smools - 1 freq
'smallest' - 1 freq
sun-lik - 1 freq
sunnlicht - 2 freq
some-like - 1 freq
snell-lik - 1 freq
similes - 1 freq
skinwalker - 7 freq
same-lyke - 1 freq
seamlessly - 1 freq
smileykaren - 1 freq
smallgingergirl - 3 freq
samuelstrange - 1 freq
MetaPhone code - SMLK
somelike - 2 freq
samelike - 2 freq
symbolic - 9 freq
smileq - 1 freq
saimelike - 1 freq
some-like - 1 freq
same-lyke - 1 freq
SOME-LIKE
Time to execute Levenshtein function - 0.198870 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.358707 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027928 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.037743 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000871 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.