A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to sunlight in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
sunlight (0) - 10 freq
sunlicht (1) - 43 freq
sublight (1) - 1 freq
enlight (2) - 1 freq
slight (2) - 11 freq
sunnlicht (2) - 2 freq
sunlit (2) - 2 freq
sinlicht (2) - 8 freq
skylight (2) - 1 freq
munelight (2) - 1 freq
sudricht (3) - 1 freq
gaslight (3) - 1 freq
slicht (3) - 23 freq
flight (3) - 45 freq
sight (3) - 134 freq
blight (3) - 5 freq
punisht (3) - 2 freq
plight (3) - 11 freq
staight (3) - 1 freq
munelicht (3) - 10 freq
light (3) - 300 freq
delight (3) - 33 freq
sawright (3) - 3 freq
skylicht (3) - 1 freq
upright (3) - 11 freq
sunlight (0) - 10 freq
sublight (2) - 1 freq
sunlicht (2) - 43 freq
sinlicht (3) - 8 freq
skylight (3) - 1 freq
slight (3) - 11 freq
enlight (3) - 1 freq
munelight (3) - 1 freq
sleight (4) - 2 freq
moonlight (4) - 3 freq
sunlit (4) - 2 freq
sunnlicht (4) - 2 freq
stgaight (5) - 1 freq
knight (5) - 10 freq
ash-light (5) - 1 freq
alight (5) - 3 freq
stright (5) - 1 freq
streight (5) - 4 freq
daylight (5) - 22 freq
sennicht (5) - 10 freq
skylights (5) - 1 freq
tealight (5) - 1 freq
danight (5) - 9 freq
straight (5) - 236 freq
twilight (5) - 9 freq
SoundEx code - S542
smells - 54 freq
sannals - 1 freq
shemmels - 1 freq
soonlessly - 1 freq
sunlight - 10 freq
somelike - 2 freq
sunlicht - 43 freq
samelike - 2 freq
smiles - 110 freq
sounless - 2 freq
snail's - 4 freq
seamless - 5 freq
smile's - 1 freq
snails - 13 freq
sunless - 1 freq
sinlicht - 8 freq
sannels - 1 freq
smell's - 1 freq
smyl's - 1 freq
seemless - 1 freq
smouls - 1 freq
soonless - 2 freq
shameless - 3 freq
smallest - 3 freq
smileq - 1 freq
snails' - 1 freq
snells - 1 freq
snail's-pace - 2 freq
snæls - 1 freq
smaels - 1 freq
semmelie's - 1 freq
saimelike - 1 freq
smuils - 1 freq
smools - 1 freq
'smallest' - 1 freq
sun-lik - 1 freq
sunnlicht - 2 freq
some-like - 1 freq
snell-lik - 1 freq
similes - 1 freq
skinwalker - 7 freq
same-lyke - 1 freq
seamlessly - 1 freq
smileykaren - 1 freq
smallgingergirl - 3 freq
samuelstrange - 1 freq
MetaPhone code - SNLFT
sunlight - 10 freq
SUNLIGHT
Time to execute Levenshtein function - 0.189819 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.382140 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027477 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.037185 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.001036 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.