A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to shrubs in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
shrubs (0) - 4 freq
shrugs (1) - 13 freq
scrubs (1) - 6 freq
shrub (1) - 2 freq
crubs (2) - 2 freq
shreds (2) - 8 freq
cherubs (2) - 1 freq
subs (2) - 5 freq
sheus (2) - 1 freq
shu's (2) - 2 freq
shrug (2) - 17 freq
shours (2) - 1 freq
shrews (2) - 2 freq
scrub (2) - 19 freq
shrouds (2) - 1 freq
rubs (2) - 10 freq
sheu's (2) - 89 freq
serbs (2) - 2 freq
throbs (2) - 2 freq
shrunk (2) - 9 freq
snubs (2) - 1 freq
grubs (2) - 2 freq
hubs (2) - 3 freq
shups (2) - 1 freq
struts (2) - 11 freq
shrubs (0) - 4 freq
shrugs (2) - 13 freq
shrub (2) - 2 freq
scrubs (2) - 6 freq
shrews (3) - 2 freq
shrouds (3) - 1 freq
shours (3) - 1 freq
thrabs (3) - 1 freq
serbs (3) - 2 freq
throbs (3) - 2 freq
shreds (3) - 8 freq
cherubs (3) - 1 freq
shroods (4) - 2 freq
shargs (4) - 1 freq
shoures (4) - 1 freq
shares (4) - 27 freq
shorts (4) - 42 freq
sherds (4) - 2 freq
shories (4) - 1 freq
sheers (4) - 2 freq
herbs (4) - 14 freq
straabs (4) - 1 freq
suburbs (4) - 8 freq
shearurs (4) - 1 freq
sharks (4) - 9 freq
SoundEx code - S612
soor-faced - 4 freq
sherpest - 1 freq
surface - 86 freq
service - 198 freq
scraps - 22 freq
sherpshuiters - 1 freq
scrieves - 14 freq
services - 66 freq
serves - 26 freq
scrapes - 10 freq
shrubs - 4 freq
shairpest - 2 freq
scrubs - 6 freq
scribes - 7 freq
sharpek - 1 freq
scarf's - 1 freq
scarfs - 4 freq
scerfs - 1 freq
screives - 7 freq
servaice - 1 freq
sharpish - 5 freq
scrappies - 2 freq
sharpishly - 1 freq
surfaced - 3 freq
skirps - 3 freq
serbs - 2 freq
'service - 1 freq
servicin - 1 freq
sairvices - 5 freq
servicemen - 2 freq
services' - 1 freq
surpasst - 1 freq
surveys - 10 freq
serbo-croat - 1 freq
scrapbuik - 5 freq
squarepeg - 1 freq
scrabba's - 1 freq
sarves - 2 freq
scrap-buik - 1 freq
surfaces - 6 freq
scrabster - 5 freq
'surfs' - 1 freq
scarves - 7 freq
sarvice - 5 freq
sairvice - 7 freq
scraeps - 1 freq
surfiece - 1 freq
surfeece - 1 freq
seraphic - 1 freq
surfies - 2 freq
surpassin - 2 freq
surpass - 2 freq
scrovchlin - 1 freq
service' - 2 freq
skreives - 1 freq
skrieves - 3 freq
sairves - 1 freq
soor-pussed - 1 freq
sheriff's - 1 freq
sairvice-hyste - 1 freq
surpassed - 1 freq
sarvices - 5 freq
seerups - 1 freq
surpasses - 1 freq
servicemin - 1 freq
scrapbook - 1 freq
srfk - 1 freq
swarfega - 1 freq
sirbfac - 1 freq
surfacing - 1 freq
sherpish - 1 freq
swarovskioptik - 1 freq
sarahfstewart - 1 freq
szrpxcqybx - 1 freq
MetaPhone code - XRBS
shrubs - 4 freq
cherubs - 1 freq
SHRUBS
Time to execute Levenshtein function - 0.225615 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.421801 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.028895 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.040991 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000939 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.