A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to script in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
script (0) - 30 freq
stript (1) - 4 freq
scripts (1) - 3 freq
scrift (1) - 2 freq
scrit (1) - 6 freq
scrip (1) - 3 freq
skript (1) - 1 freq
scripter (2) - 6 freq
crept (2) - 33 freq
scriptit (2) - 1 freq
scribe (2) - 7 freq
scrive (2) - 1 freq
crypt (2) - 2 freq
scraipet (2) - 2 freq
skrit (2) - 6 freq
scrat (2) - 31 freq
scries (2) - 1 freq
scraps (2) - 22 freq
scirt (2) - 1 freq
sprit (2) - 2 freq
sclimt (2) - 9 freq
chipt (2) - 1 freq
scrape (2) - 30 freq
scriba (2) - 1 freq
criet (2) - 1 freq
script (0) - 30 freq
scrapit (2) - 10 freq
scraipet (2) - 2 freq
scrip (2) - 3 freq
skript (2) - 1 freq
scrit (2) - 6 freq
scrift (2) - 2 freq
scripts (2) - 3 freq
stript (2) - 4 freq
scraip (3) - 1 freq
scrap (3) - 33 freq
scairt (3) - 6 freq
scrunt (3) - 7 freq
scriped (3) - 1 freq
scriptur (3) - 2 freq
scrapy (3) - 1 freq
sculpt (3) - 1 freq
scriptir (3) - 4 freq
scrimpit (3) - 4 freq
scripted (3) - 2 freq
sacrist (3) - 1 freq
scrievt (3) - 1 freq
scrape (3) - 30 freq
scraps (3) - 22 freq
scrat (3) - 31 freq
SoundEx code - S613
served - 75 freq
sairved - 4 freq
skreivit - 11 freq
screivit - 83 freq
scraped - 14 freq
scripture - 15 freq
servit - 8 freq
servt - 1 freq
scrievit - 104 freq
script - 30 freq
servetus - 2 freq
screived - 39 freq
sereived - 1 freq
'sherbit - 1 freq
sherbet - 7 freq
shirpit - 2 freq
scrieved - 47 freq
shrovetide - 1 freq
shrift - 2 freq
surveyed - 5 freq
scribed - 1 freq
scrubbed - 6 freq
scrapit - 10 freq
scriptures - 11 freq
scriptorium - 5 freq
scripturs - 4 freq
scriptur - 2 freq
scrift - 2 freq
scraipet - 2 freq
scrappet - 1 freq
scrived - 3 freq
sherp-tuithed - 1 freq
scripted - 2 freq
'served - 1 freq
scruifed - 1 freq
sorbet - 2 freq
scrubbit - 1 freq
swerved - 2 freq
scrubbid - 2 freq
shiropidist - 12 freq
scrabbed - 3 freq
scrïptures - 1 freq
serepta - 1 freq
scrïpture - 1 freq
serviette - 1 freq
sharp-edged - 1 freq
scripter - 6 freq
scriptir - 4 freq
scriptirs - 6 freq
sarepta - 1 freq
scripters - 4 freq
scrievitleid - 1 freq
skribbit - 1 freq
screeved - 2 freq
surreptitiously - 2 freq
servitude - 2 freq
screevit - 3 freq
scrievt - 1 freq
scrybit - 1 freq
scriftin - 4 freq
servitors - 1 freq
serift - 2 freq
scriftit - 2 freq
scriffed - 1 freq
servitours - 1 freq
squarebottle' - 1 freq
scriptur-readin - 1 freq
scrubbit-doon - 1 freq
scripteral - 1 freq
shairp-toothed - 1 freq
scripts - 3 freq
squarebottle - 1 freq
skrievit - 1 freq
schirefdomes - 1 freq
skreived - 2 freq
surfeits - 1 freq
scriptvs - 1 freq
scriped - 1 freq
servitor - 2 freq
scrieved-on - 1 freq
surfeit - 1 freq
€œserved - 1 freq
survation - 4 freq
skript - 1 freq
sharped - 1 freq
sherbets - 1 freq
scriptit - 1 freq
MetaPhone code - SKRPT
scraped - 14 freq
script - 30 freq
scrapit - 10 freq
scraipet - 2 freq
scrappet - 1 freq
scriped - 1 freq
skript - 1 freq
SCRIPT
Time to execute Levenshtein function - 0.190133 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.337045 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027521 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.038921 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000905 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.