A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to script in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
script (0) - 30 freq
stript (1) - 4 freq
skript (1) - 1 freq
scrip (1) - 3 freq
scrift (1) - 2 freq
scripts (1) - 3 freq
scrit (1) - 6 freq
scoit (2) - 1 freq
scrime (2) - 3 freq
strict (2) - 16 freq
stripe (2) - 3 freq
scrim (2) - 1 freq
sclimt (2) - 9 freq
stripy (2) - 2 freq
strip (2) - 58 freq
scraps (2) - 22 freq
scriptvs (2) - 1 freq
strippt (2) - 2 freq
slipt (2) - 6 freq
scribe (2) - 7 freq
schip (2) - 1 freq
chipt (2) - 1 freq
scilt (2) - 2 freq
sculpt (2) - 1 freq
sprit (2) - 2 freq
script (0) - 30 freq
scrit (2) - 6 freq
scrapit (2) - 10 freq
scraipet (2) - 2 freq
scripts (2) - 3 freq
stript (2) - 4 freq
scrip (2) - 3 freq
skript (2) - 1 freq
scrift (2) - 2 freq
sacrit (3) - 3 freq
scrievt (3) - 1 freq
scriped (3) - 1 freq
scrape (3) - 29 freq
scriptir (3) - 4 freq
scairt (3) - 6 freq
scrimpit (3) - 4 freq
crypt (3) - 2 freq
sacrist (3) - 1 freq
scriptit (3) - 1 freq
scrunt (3) - 7 freq
scrapy (3) - 1 freq
scrat (3) - 31 freq
crept (3) - 33 freq
scraip (3) - 1 freq
scirt (3) - 1 freq
SoundEx code - S613
served - 73 freq
sairved - 4 freq
skreivit - 11 freq
screivit - 80 freq
scraped - 13 freq
scripture - 15 freq
servit - 8 freq
servt - 1 freq
scrievit - 102 freq
script - 30 freq
servetus - 2 freq
screived - 39 freq
sereived - 1 freq
'sherbit - 1 freq
sherbet - 6 freq
shirpit - 2 freq
scrieved - 46 freq
shrovetide - 1 freq
shrift - 2 freq
surveyed - 5 freq
scribed - 1 freq
scrubbed - 6 freq
scrapit - 10 freq
scriptures - 11 freq
scriptorium - 5 freq
scripturs - 4 freq
scriptur - 2 freq
scrift - 2 freq
scraipet - 2 freq
scrappet - 1 freq
scripted - 2 freq
'served - 1 freq
scruifed - 1 freq
sorbet - 2 freq
scrubbit - 1 freq
swerved - 2 freq
scrubbid - 2 freq
shiropidist - 12 freq
scrabbed - 3 freq
scrïptures - 1 freq
serepta - 1 freq
scrïpture - 1 freq
serviette - 1 freq
sharp-edged - 1 freq
scripter - 6 freq
scriptir - 4 freq
scriptirs - 6 freq
sarepta - 1 freq
scripters - 4 freq
scrievitleid - 1 freq
skribbit - 1 freq
screeved - 2 freq
surreptitiously - 2 freq
servitude - 2 freq
screevit - 3 freq
scrievt - 1 freq
scrybit - 1 freq
scriftin - 4 freq
servitors - 1 freq
serift - 2 freq
scriftit - 2 freq
scriffed - 1 freq
servitours - 1 freq
squarebottle' - 1 freq
scriptur-readin - 1 freq
scrubbit-doon - 1 freq
scripteral - 1 freq
shairp-toothed - 1 freq
scripts - 3 freq
squarebottle - 1 freq
skrievit - 1 freq
schirefdomes - 1 freq
skreived - 2 freq
surfeits - 1 freq
scriptvs - 1 freq
scriped - 1 freq
servitor - 2 freq
scrieved-on - 1 freq
surfeit - 1 freq
scrived - 2 freq
€œserved - 1 freq
survation - 4 freq
skript - 1 freq
sharped - 1 freq
sherbets - 1 freq
scriptit - 1 freq
MetaPhone code - SKRPT
scraped - 13 freq
script - 30 freq
scrapit - 10 freq
scraipet - 2 freq
scrappet - 1 freq
scriped - 1 freq
skript - 1 freq
SCRIPT
Time to execute Levenshtein function - 0.263537 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.574967 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.029070 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.069763 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000933 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.