A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to siller-secks in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
siller-secks (0) - 1 freq
siller-reid (4) - 1 freq
siller's (5) - 4 freq
siller-grey (5) - 1 freq
sillerknowes (5) - 2 freq
silversauns (5) - 2 freq
siller-makkin (5) - 1 freq
sillerweed (5) - 1 freq
sillered (5) - 1 freq
silversands (5) - 1 freq
sillocks (5) - 2 freq
sillerie (5) - 15 freq
sillicks (5) - 1 freq
silverback (5) - 2 freq
silverside (6) - 1 freq
spleet-second (6) - 1 freq
killers (6) - 9 freq
ill-luck (6) - 2 freq
collecks (6) - 1 freq
sillerin (6) - 4 freq
ill-deeds (6) - 2 freq
ooie-socks (6) - 1 freq
sculleries (6) - 1 freq
filler-outer (6) - 1 freq
sellers' (6) - 1 freq
siller-secks (0) - 1 freq
sillicks (8) - 1 freq
siller-makkin (8) - 1 freq
sillocks (8) - 2 freq
siller-reid (8) - 1 freq
ball-cocks (9) - 1 freq
silversands (9) - 1 freq
silverback (9) - 2 freq
siller-grey (9) - 1 freq
sillerknowes (9) - 2 freq
siller's (9) - 4 freq
silversauns (9) - 2 freq
limericks (10) - 1 freq
wedder-cocks (10) - 1 freq
hillocks (10) - 3 freq
six-packs (10) - 1 freq
syllabuses (10) - 1 freq
flouer-deckt (10) - 1 freq
splore-seekin (10) - 1 freq
flaesocks (10) - 1 freq
ill-spak (10) - 2 freq
sillars (10) - 2 freq
seller's (10) - 1 freq
siller-obsessit (10) - 1 freq
willicks (10) - 2 freq
SoundEx code - S462
scullery's - 1 freq
sculleries - 1 freq
scholars - 35 freq
siller's - 4 freq
schoolwork - 2 freq
sel-richtous - 1 freq
shoulers - 6 freq
sailors - 17 freq
slurs - 3 freq
seller's - 1 freq
slorach - 1 freq
sailor's - 2 freq
scholarship - 9 freq
slrc - 15 freq
slrc's - 2 freq
schullars - 1 freq
siller-secks - 1 freq
'sailor's - 1 freq
skylark's - 1 freq
sauler's - 1 freq
sellars - 1 freq
saulers - 1 freq
slayers - 2 freq
salaries - 3 freq
siller-glaurin - 1 freq
sillerknowes - 2 freq
shell-wark - 1 freq
scholarship' - 1 freq
sailors' - 2 freq
skülwark - 1 freq
siller-grey - 1 freq
sellers - 2 freq
sellers' - 1 freq
slairgy - 1 freq
sillars - 2 freq
skylark - 2 freq
sjclark - 1 freq
skylarks - 2 freq
ssalyers - 11 freq
solwayrecycling - 1 freq
MetaPhone code - SLRSKS
siller-secks - 1 freq
SILLER-SECKS
siller - 763 freq
sillerie - 15 freq
sillerin - 4 freq
siller's - 4 freq
silver - 121 freq
sillery - 3 freq
siller-secks - 1 freq
silverweed - 1 freq
silvery - 8 freq
silverwood - 3 freq
silversauns - 2 freq
silverfish - 1 freq
Time to execute Levenshtein function - 0.258127 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.408230 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.032941 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.039321 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000998 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.