A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to scrapes in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
scrapes (0) - 10 freq
scrape (1) - 29 freq
scraper (1) - 2 freq
scraped (1) - 13 freq
scraps (1) - 22 freq
schades (2) - 1 freq
scape (2) - 1 freq
scrapit (2) - 10 freq
cranes (2) - 7 freq
craps (2) - 15 freq
scries (2) - 1 freq
rapes (2) - 1 freq
sclaves (2) - 2 freq
scraeps (2) - 1 freq
scrappet (2) - 1 freq
scribes (2) - 7 freq
escapes (2) - 4 freq
drapes (2) - 6 freq
scrapin (2) - 19 freq
scraep (2) - 1 freq
straes (2) - 3 freq
screes (2) - 1 freq
craves (2) - 3 freq
strakes (2) - 1 freq
scanes (2) - 1 freq
scrapes (0) - 10 freq
scraps (1) - 22 freq
scraeps (2) - 1 freq
scraper (2) - 2 freq
scrape (2) - 29 freq
scraped (2) - 13 freq
straps (3) - 9 freq
scruples (3) - 3 freq
stripes (3) - 9 freq
scrapy (3) - 1 freq
scares (3) - 5 freq
scrapan (3) - 2 freq
scrans (3) - 3 freq
scriped (3) - 1 freq
scripts (3) - 3 freq
scraipet (3) - 2 freq
scrap (3) - 33 freq
socrates (3) - 3 freq
scrappy (3) - 4 freq
scrapins (3) - 3 freq
scrappies (3) - 2 freq
screes (3) - 1 freq
scries (3) - 1 freq
escapes (3) - 4 freq
scribes (3) - 7 freq
SoundEx code - S612
soor-faced - 4 freq
sherpest - 1 freq
surface - 85 freq
service - 196 freq
scraps - 22 freq
sherpshuiters - 1 freq
scrieves - 14 freq
services - 66 freq
serves - 26 freq
scrapes - 10 freq
shrubs - 4 freq
shairpest - 2 freq
scrubs - 6 freq
scribes - 7 freq
sharpek - 1 freq
scarf's - 1 freq
servaice - 1 freq
sharpish - 5 freq
scrappies - 2 freq
sharpishly - 1 freq
surfaced - 3 freq
skirps - 3 freq
serbs - 2 freq
'service - 1 freq
servicin - 1 freq
sairvices - 5 freq
servicemen - 2 freq
services' - 1 freq
surpasst - 1 freq
surveys - 10 freq
serbo-croat - 1 freq
scrapbuik - 5 freq
squarepeg - 1 freq
scrabba's - 1 freq
sarves - 2 freq
scrap-buik - 1 freq
surfaces - 6 freq
scrabster - 5 freq
'surfs' - 1 freq
scarves - 7 freq
sarvice - 5 freq
sairvice - 7 freq
scraeps - 1 freq
surfiece - 1 freq
surfeece - 1 freq
seraphic - 1 freq
scarfs - 3 freq
surfies - 2 freq
surpassin - 2 freq
surpass - 2 freq
scrovchlin - 1 freq
service' - 2 freq
skreives - 1 freq
screives - 6 freq
skrieves - 3 freq
sairves - 1 freq
soor-pussed - 1 freq
sheriff's - 1 freq
sairvice-hyste - 1 freq
surpassed - 1 freq
sarvices - 5 freq
seerups - 1 freq
surpasses - 1 freq
servicemin - 1 freq
scrapbook - 1 freq
srfk - 1 freq
swarfega - 1 freq
sirbfac - 1 freq
surfacing - 1 freq
sherpish - 1 freq
swarovskioptik - 1 freq
sarahfstewart - 1 freq
szrpxcqybx - 1 freq
MetaPhone code - SKRPS
scraps - 22 freq
scrapes - 10 freq
scrappies - 2 freq
skirps - 3 freq
scraeps - 1 freq
SCRAPES
Time to execute Levenshtein function - 0.256783 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.363988 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027448 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.037267 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000871 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.