A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to scraps in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
scraps (0) - 22 freq
scrans (1) - 3 freq
scrap (1) - 33 freq
scraeps (1) - 1 freq
straps (1) - 9 freq
craps (1) - 16 freq
scrapes (1) - 10 freq
scrape (1) - 30 freq
scrapy (1) - 1 freq
scap (2) - 2 freq
soaps (2) - 4 freq
caps (2) - 12 freq
chaps (2) - 24 freq
sara's (2) - 1 freq
crams (2) - 2 freq
scraped (2) - 14 freq
schaws (2) - 2 freq
straes (2) - 3 freq
script (2) - 30 freq
claps (2) - 5 freq
scrip (2) - 3 freq
staps (2) - 36 freq
scrat (2) - 31 freq
scrows (2) - 1 freq
scheps (2) - 2 freq
scraps (0) - 22 freq
scrapes (1) - 10 freq
scraeps (1) - 1 freq
scrapy (2) - 1 freq
craps (2) - 16 freq
scrape (2) - 30 freq
straps (2) - 9 freq
scrans (2) - 3 freq
scrap (2) - 33 freq
scuips (3) - 1 freq
scrapan (3) - 2 freq
scrumps (3) - 1 freq
crips (3) - 1 freq
strips (3) - 28 freq
scrapit (3) - 10 freq
schips (3) - 1 freq
scrums (3) - 1 freq
scraper (3) - 2 freq
scroos (3) - 1 freq
scripts (3) - 3 freq
scrogs (3) - 4 freq
scrubs (3) - 6 freq
scrappy (3) - 4 freq
scraep (3) - 1 freq
scoops (3) - 1 freq
SoundEx code - S612
soor-faced - 4 freq
sherpest - 1 freq
surface - 86 freq
service - 198 freq
scraps - 22 freq
sherpshuiters - 1 freq
scrieves - 14 freq
services - 66 freq
serves - 26 freq
scrapes - 10 freq
shrubs - 4 freq
shairpest - 2 freq
scrubs - 6 freq
scribes - 7 freq
sharpek - 1 freq
scarf's - 1 freq
scarfs - 4 freq
scerfs - 1 freq
screives - 7 freq
servaice - 1 freq
sharpish - 5 freq
scrappies - 2 freq
sharpishly - 1 freq
surfaced - 3 freq
skirps - 3 freq
serbs - 2 freq
'service - 1 freq
servicin - 1 freq
sairvices - 5 freq
servicemen - 2 freq
services' - 1 freq
surpasst - 1 freq
surveys - 10 freq
serbo-croat - 1 freq
scrapbuik - 5 freq
squarepeg - 1 freq
scrabba's - 1 freq
sarves - 2 freq
scrap-buik - 1 freq
surfaces - 6 freq
scrabster - 5 freq
'surfs' - 1 freq
scarves - 7 freq
sarvice - 5 freq
sairvice - 7 freq
scraeps - 1 freq
surfiece - 1 freq
surfeece - 1 freq
seraphic - 1 freq
surfies - 2 freq
surpassin - 2 freq
surpass - 2 freq
scrovchlin - 1 freq
service' - 2 freq
skreives - 1 freq
skrieves - 3 freq
sairves - 1 freq
soor-pussed - 1 freq
sheriff's - 1 freq
sairvice-hyste - 1 freq
surpassed - 1 freq
sarvices - 5 freq
seerups - 1 freq
surpasses - 1 freq
servicemin - 1 freq
scrapbook - 1 freq
srfk - 1 freq
swarfega - 1 freq
sirbfac - 1 freq
surfacing - 1 freq
sherpish - 1 freq
swarovskioptik - 1 freq
sarahfstewart - 1 freq
szrpxcqybx - 1 freq
MetaPhone code - SKRPS
scraps - 22 freq
scrapes - 10 freq
scrappies - 2 freq
skirps - 3 freq
scraeps - 1 freq
SCRAPS
Time to execute Levenshtein function - 0.410767 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.975460 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027897 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.167811 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000938 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.