A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to surfaces in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
surfaces (0) - 6 freq
surface (1) - 85 freq
surfaced (1) - 3 freq
surfies (2) - 2 freq
furnaces (2) - 1 freq
surnames (2) - 4 freq
traces (3) - 6 freq
surname (3) - 12 freq
sources (3) - 24 freq
survives (3) - 9 freq
surfan (3) - 1 freq
terraces (3) - 4 freq
sumgates (3) - 1 freq
surpasses (3) - 1 freq
spaces (3) - 19 freq
surfeece (3) - 1 freq
graces (3) - 16 freq
surfacing (3) - 1 freq
saaces (3) - 1 freq
saracen (3) - 1 freq
resurfaced (3) - 2 freq
sarvices (3) - 5 freq
straes (3) - 3 freq
surfeits (3) - 1 freq
suffixes (3) - 2 freq
surfaces (0) - 6 freq
surfaced (2) - 3 freq
surface (2) - 85 freq
surfies (3) - 2 freq
sarvices (4) - 5 freq
surfeits (4) - 1 freq
services (4) - 66 freq
surfeece (4) - 1 freq
surfiece (4) - 1 freq
furnaces (4) - 1 freq
sources (4) - 24 freq
surnames (4) - 4 freq
fornaces (5) - 2 freq
sacrifices (5) - 3 freq
suffuses (5) - 1 freq
survives (5) - 9 freq
braces (5) - 12 freq
surfen (5) - 1 freq
sorraes (5) - 3 freq
surfan (5) - 1 freq
soorces (5) - 15 freq
secryfices (5) - 1 freq
orifices (5) - 1 freq
suffice (5) - 10 freq
resurface (5) - 1 freq
SoundEx code - S612
soor-faced - 4 freq
sherpest - 1 freq
surface - 85 freq
service - 196 freq
scraps - 22 freq
sherpshuiters - 1 freq
scrieves - 14 freq
services - 66 freq
serves - 26 freq
scrapes - 10 freq
shrubs - 4 freq
shairpest - 2 freq
scrubs - 6 freq
scribes - 7 freq
sharpek - 1 freq
scarf's - 1 freq
servaice - 1 freq
sharpish - 5 freq
scrappies - 2 freq
sharpishly - 1 freq
surfaced - 3 freq
skirps - 3 freq
serbs - 2 freq
'service - 1 freq
servicin - 1 freq
sairvices - 5 freq
servicemen - 2 freq
services' - 1 freq
surpasst - 1 freq
surveys - 10 freq
serbo-croat - 1 freq
scrapbuik - 5 freq
squarepeg - 1 freq
scrabba's - 1 freq
sarves - 2 freq
scrap-buik - 1 freq
surfaces - 6 freq
scrabster - 5 freq
'surfs' - 1 freq
scarves - 7 freq
sarvice - 5 freq
sairvice - 7 freq
scraeps - 1 freq
surfiece - 1 freq
surfeece - 1 freq
seraphic - 1 freq
scarfs - 3 freq
surfies - 2 freq
surpassin - 2 freq
surpass - 2 freq
scrovchlin - 1 freq
service' - 2 freq
skreives - 1 freq
screives - 6 freq
skrieves - 3 freq
sairves - 1 freq
soor-pussed - 1 freq
sheriff's - 1 freq
sairvice-hyste - 1 freq
surpassed - 1 freq
sarvices - 5 freq
seerups - 1 freq
surpasses - 1 freq
servicemin - 1 freq
scrapbook - 1 freq
srfk - 1 freq
swarfega - 1 freq
sirbfac - 1 freq
surfacing - 1 freq
sherpish - 1 freq
swarovskioptik - 1 freq
sarahfstewart - 1 freq
szrpxcqybx - 1 freq
MetaPhone code - SRFSS
services - 66 freq
sairvices - 5 freq
services' - 1 freq
surfaces - 6 freq
sarvices - 5 freq
SURFACES
Time to execute Levenshtein function - 0.192659 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.346216 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027633 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.038601 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000824 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.