A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to scotplaywright in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
scotplaywright (0) - 8 freq
playwright (4) - 3 freq
playwricht (5) - 4 freq
pleywright (5) - 2 freq
copyright (5) - 1 freq
playwrights (5) - 3 freq
calumwright (6) - 15 freq
scots-airish (6) - 1 freq
spotlight (6) - 4 freq
s’awright (6) - 1 freq
s'awright (6) - 1 freq
scotnight (6) - 2 freq
copyricht (6) - 17 freq
playwrichts (6) - 6 freq
sawright (6) - 3 freq
pleywrights (6) - 6 freq
coapiericht (7) - 4 freq
spotlieht (7) - 1 freq
stgaight (7) - 1 freq
ootright (7) - 1 freq
sixty-eight (7) - 1 freq
compatriot (7) - 2 freq
starlight (7) - 1 freq
scottacraigie (7) - 3 freq
scotlit (7) - 38 freq
scotplaywright (0) - 8 freq
playwright (7) - 3 freq
pleywright (8) - 2 freq
playwrights (9) - 3 freq
copyright (9) - 1 freq
playwricht (9) - 4 freq
scotnight (10) - 2 freq
sawright (10) - 3 freq
pleywrights (10) - 6 freq
s'awright (10) - 1 freq
spotlight (10) - 4 freq
calumwright (10) - 15 freq
s’awright (10) - 1 freq
straight (11) - 236 freq
stright (11) - 1 freq
starlight (11) - 1 freq
bertwright (11) - 2 freq
copyricht (11) - 17 freq
scots-airish (11) - 1 freq
playwrichts (11) - 6 freq
scotlandnt (12) - 26 freq
scotmarshy (12) - 6 freq
streight (12) - 4 freq
scotslanguage (12) - 241 freq
scotlang (12) - 5 freq
SoundEx code - S314
stable - 71 freq
saut-bleared - 1 freq
shot-blastin - 1 freq
steeple - 12 freq
suitably - 3 freq
stifled - 1 freq
stables - 18 freq
stabilisers - 1 freq
suitable - 13 freq
stievely - 13 freq
steive-like - 4 freq
stavelt - 1 freq
stiflin - 4 freq
stubble - 13 freq
stibble-rig - 1 freq
stobhill - 1 freq
stapleton - 1 freq
sweit-blint - 1 freq
southfield - 1 freq
staple - 6 freq
stibble - 14 freq
stabeelitie - 1 freq
steeplechase - 1 freq
stiffly - 7 freq
seatbelts - 2 freq
stipulated - 2 freq
she-devils - 1 freq
seatbelt - 3 freq
staples - 1 freq
sweetiefolls - 1 freq
stubbly - 2 freq
stabill - 1 freq
staibil - 3 freq
step-ladder - 1 freq
seatbele - 1 freq
staff'll - 1 freq
steeple' - 1 freq
steeble - 1 freq
stifling - 1 freq
stabilise - 2 freq
stapless - 1 freq
stifle - 3 freq
stabilitie - 1 freq
steepled - 1 freq
stipplin - 1 freq
'stubbly' - 1 freq
step'll - 1 freq
steivelie - 2 freq
steeplt - 1 freq
sitable - 2 freq
steively - 1 freq
stipulates - 1 freq
stable-fiers - 1 freq
stablin - 1 freq
staiblishin - 1 freq
stabeility - 1 freq
stabillis - 1 freq
stability - 3 freq
skeetiploots - 1 freq
shadae-play - 1 freq
stibblie - 1 freq
shit-filled - 1 freq
staeblee - 1 freq
scotpol - 14 freq
stepladder - 3 freq
stabilization - 1 freq
southbelfast - 1 freq
stfilansdream - 3 freq
stvlouise - 2 freq
stevieleedsy - 3 freq
scotplacenames - 1 freq
stuff-laddies - 1 freq
stevielou - 1 freq
scotplaywright - 8 freq
MetaPhone code - SKTPLRFT
scotplaywright - 8 freq
SCOTPLAYWRIGHT
Time to execute Levenshtein function - 0.206640 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.366733 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027515 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.037420 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000914 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.