A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to sharktrustuk in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
sharktrustuk (0) - 1 freq
starstruck (5) - 1 freq
shoartcut (6) - 1 freq
starkers (6) - 1 freq
smartest (6) - 4 freq
star-struck (6) - 1 freq
shortlistit (6) - 1 freq
sharkies (6) - 1 freq
stardust (6) - 1 freq
shortcut (6) - 4 freq
structur (6) - 9 freq
shortlist (6) - 1 freq
shortest (6) - 1 freq
harprest (6) - 1 freq
arkbounduk (6) - 3 freq
shakers (6) - 2 freq
shortlisted (6) - 1 freq
shargeret (6) - 8 freq
madtrust (6) - 1 freq
shortlists (6) - 1 freq
sharks (6) - 9 freq
augustus (7) - 35 freq
harrumph (7) - 2 freq
status (7) - 118 freq
hairsts (7) - 7 freq
sharktrustuk (0) - 1 freq
shortlistit (9) - 1 freq
shortlisted (9) - 1 freq
shortlist (9) - 1 freq
shortest (9) - 1 freq
shortlists (9) - 1 freq
starstruck (9) - 1 freq
harprest (10) - 1 freq
shakers (10) - 2 freq
characteristic (10) - 5 freq
sharks (10) - 9 freq
shargeret (10) - 8 freq
sharkies (10) - 1 freq
shoartcut (10) - 1 freq
smartest (10) - 4 freq
starkers (10) - 1 freq
shortcut (10) - 4 freq
shairpest (11) - 2 freq
hert-seek (11) - 1 freq
surcastik (11) - 2 freq
shearers (11) - 1 freq
superstructure (11) - 1 freq
shafttours (11) - 1 freq
sherpest (11) - 1 freq
sherbert (11) - 1 freq
SoundEx code - S623
shrugged - 47 freq
skraiked - 40 freq
skraiched - 53 freq
skyrocket - 2 freq
shrieked - 5 freq
scree-staned - 1 freq
scragged - 1 freq
serecht-forrit - 1 freq
skreicht - 7 freq
skraicht - 2 freq
scooriest - 1 freq
sky-rocket - 3 freq
skreichd - 1 freq
s'awright - 1 freq
searched - 18 freq
soorest - 2 freq
skrekked - 2 freq
scraiched - 5 freq
soorcit - 1 freq
scarcity - 2 freq
scorched - 5 freq
scursed - 1 freq
shruggit - 4 freq
skreiched - 16 freq
sawright - 3 freq
screiched - 3 freq
scorcht - 1 freq
scraicht - 9 freq
shrieketh - 1 freq
sorriest - 1 freq
scrieched - 4 freq
serieched - 1 freq
shargit - 2 freq
screcked - 3 freq
scraik't - 1 freq
serssit - 2 freq
surrogats - 1 freq
sairched - 1 freq
shark-eyed - 1 freq
screeched - 7 freq
sweirest - 2 freq
sairest - 4 freq
skraichit - 1 freq
shrugd - 1 freq
surest - 1 freq
seraicht - 2 freq
scraacht - 1 freq
screechit - 1 freq
shoregait - 2 freq
skrougit - 2 freq
soor-sweet - 1 freq
skyriest - 1 freq
scariest - 1 freq
surged - 1 freq
scraggit - 1 freq
sair-wechtit - 1 freq
screicht - 1 freq
sharged - 1 freq
sairkyte - 1 freq
scarycath - 5 freq
seawrightdaniel - 26 freq
sirsidneyp - 1 freq
srsdr - 1 freq
skyrocketed - 1 freq
s’awright - 1 freq
swrestling - 1 freq
sirscottyoung - 1 freq
skreighed - 1 freq
sharktrustuk - 1 freq
MetaPhone code - XRKTRSTK
characteristic - 5 freq
sharktrustuk - 1 freq
SHARKTRUSTUK
Time to execute Levenshtein function - 0.241203 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.392670 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027409 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.037218 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000905 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.