A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to shoregait in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
shoregait (0) - 2 freq
somegait (2) - 1 freq
shooglit (3) - 1 freq
shargit (3) - 2 freq
shruggit (3) - 4 freq
shoggit (3) - 3 freq
shorelin (3) - 2 freq
forgait (3) - 1 freq
segregatit (3) - 3 freq
shouglit (3) - 1 freq
somegaits (3) - 3 freq
shore-win (3) - 1 freq
shreddit (3) - 2 freq
streght (4) - 2 freq
screamt (4) - 4 freq
short (4) - 323 freq
shirpit (4) - 2 freq
somegate (4) - 1 freq
shortlist (4) - 1 freq
soorest (4) - 2 freq
whereat (4) - 1 freq
portrait (4) - 18 freq
threat (4) - 30 freq
spreidit (4) - 3 freq
shoart (4) - 41 freq
shoregait (0) - 2 freq
shargit (3) - 2 freq
shoggit (4) - 3 freq
shruggit (4) - 4 freq
somegait (4) - 1 freq
sharet (5) - 3 freq
chargit (5) - 2 freq
shirpit (5) - 2 freq
shooert (5) - 2 freq
ithergait (5) - 1 freq
short (5) - 323 freq
shergar (5) - 1 freq
shargin (5) - 1 freq
shoart (5) - 41 freq
segregate (5) - 1 freq
shouglit (5) - 1 freq
segregatit (5) - 3 freq
forgait (5) - 1 freq
shooglit (5) - 1 freq
shorelin (5) - 2 freq
shreddit (5) - 2 freq
shugyit (5) - 1 freq
shaepit (6) - 2 freq
threidit (6) - 4 freq
sholmit (6) - 1 freq
SoundEx code - S623
shrugged - 47 freq
skraiked - 40 freq
skraiched - 53 freq
skyrocket - 2 freq
shrieked - 5 freq
scree-staned - 1 freq
scragged - 1 freq
serecht-forrit - 1 freq
skreicht - 7 freq
skraicht - 2 freq
scooriest - 1 freq
sky-rocket - 3 freq
skreichd - 1 freq
s'awright - 1 freq
searched - 18 freq
soorest - 2 freq
skrekked - 2 freq
scraiched - 5 freq
soorcit - 1 freq
scarcity - 2 freq
scorched - 5 freq
scursed - 1 freq
shruggit - 4 freq
skreiched - 16 freq
sawright - 3 freq
screiched - 3 freq
scorcht - 1 freq
scraicht - 9 freq
shrieketh - 1 freq
sorriest - 1 freq
scrieched - 4 freq
serieched - 1 freq
shargit - 2 freq
screcked - 3 freq
scraik't - 1 freq
serssit - 2 freq
surrogats - 1 freq
sairched - 1 freq
shark-eyed - 1 freq
screeched - 7 freq
sweirest - 2 freq
sairest - 4 freq
skraichit - 1 freq
shrugd - 1 freq
surest - 1 freq
seraicht - 2 freq
scraacht - 1 freq
screechit - 1 freq
shoregait - 2 freq
skrougit - 2 freq
soor-sweet - 1 freq
skyriest - 1 freq
scariest - 1 freq
surged - 1 freq
scraggit - 1 freq
sair-wechtit - 1 freq
screicht - 1 freq
sharged - 1 freq
sairkyte - 1 freq
scarycath - 5 freq
seawrightdaniel - 26 freq
sirsidneyp - 1 freq
srsdr - 1 freq
skyrocketed - 1 freq
s’awright - 1 freq
swrestling - 1 freq
sirscottyoung - 1 freq
skreighed - 1 freq
sharktrustuk - 1 freq
MetaPhone code - XRKT
shrugged - 47 freq
shrieked - 5 freq
shruggit - 4 freq
charact - 1 freq
shrugd - 1 freq
shoregait - 2 freq
SHOREGAIT
Time to execute Levenshtein function - 0.194750 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.351274 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027614 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.038777 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000953 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.