A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to surveys in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
surveys (0) - 10 freq
survey (1) - 33 freq
surveyed (2) - 5 freq
turkeys (2) - 3 freq
purves (2) - 21 freq
purvey (2) - 20 freq
surrey (2) - 2 freq
surley (2) - 3 freq
turves (2) - 1 freq
surveyar (2) - 1 freq
servers (2) - 1 freq
sure's (2) - 3 freq
curves (2) - 10 freq
purvey's (2) - 1 freq
serves (2) - 26 freq
surveyin (2) - 5 freq
sarves (2) - 2 freq
purveyor (3) - 1 freq
susie's (3) - 4 freq
surfen (3) - 1 freq
duvets (3) - 1 freq
hurrays (3) - 1 freq
sumwey (3) - 1 freq
screes (3) - 1 freq
staeys (3) - 1 freq
surveys (0) - 10 freq
serves (2) - 26 freq
sarves (2) - 2 freq
survey (2) - 33 freq
sure's (3) - 3 freq
servers (3) - 1 freq
surveyin (3) - 5 freq
surveyar (3) - 1 freq
sairves (3) - 1 freq
curves (3) - 10 freq
turves (3) - 1 freq
surveyed (3) - 5 freq
purves (3) - 21 freq
sarve (4) - 10 freq
snuves (4) - 1 freq
stoves (4) - 3 freq
salves (4) - 2 freq
steves (4) - 2 freq
slaves (4) - 18 freq
solves (4) - 3 freq
scoves (4) - 1 freq
nerves (4) - 44 freq
saves (4) - 11 freq
scarves (4) - 7 freq
sairvers (4) - 1 freq
SoundEx code - S612
soor-faced - 4 freq
sherpest - 1 freq
surface - 86 freq
service - 198 freq
scraps - 22 freq
sherpshuiters - 1 freq
scrieves - 14 freq
services - 66 freq
serves - 26 freq
scrapes - 10 freq
shrubs - 4 freq
shairpest - 2 freq
scrubs - 6 freq
scribes - 7 freq
sharpek - 1 freq
scarf's - 1 freq
scarfs - 4 freq
scerfs - 1 freq
screives - 7 freq
servaice - 1 freq
sharpish - 5 freq
scrappies - 2 freq
sharpishly - 1 freq
surfaced - 3 freq
skirps - 3 freq
serbs - 2 freq
'service - 1 freq
servicin - 1 freq
sairvices - 5 freq
servicemen - 2 freq
services' - 1 freq
surpasst - 1 freq
surveys - 10 freq
serbo-croat - 1 freq
scrapbuik - 5 freq
squarepeg - 1 freq
scrabba's - 1 freq
sarves - 2 freq
scrap-buik - 1 freq
surfaces - 6 freq
scrabster - 5 freq
'surfs' - 1 freq
scarves - 7 freq
sarvice - 5 freq
sairvice - 7 freq
scraeps - 1 freq
surfiece - 1 freq
surfeece - 1 freq
seraphic - 1 freq
surfies - 2 freq
surpassin - 2 freq
surpass - 2 freq
scrovchlin - 1 freq
service' - 2 freq
skreives - 1 freq
skrieves - 3 freq
sairves - 1 freq
soor-pussed - 1 freq
sheriff's - 1 freq
sairvice-hyste - 1 freq
surpassed - 1 freq
sarvices - 5 freq
seerups - 1 freq
surpasses - 1 freq
servicemin - 1 freq
scrapbook - 1 freq
srfk - 1 freq
swarfega - 1 freq
sirbfac - 1 freq
surfacing - 1 freq
sherpish - 1 freq
swarovskioptik - 1 freq
sarahfstewart - 1 freq
szrpxcqybx - 1 freq
MetaPhone code - SRFS
surface - 86 freq
service - 198 freq
serves - 26 freq
scerfs - 1 freq
servaice - 1 freq
'service - 1 freq
surveys - 10 freq
sarves - 2 freq
'surfs' - 1 freq
sarvice - 5 freq
sairvice - 7 freq
surfiece - 1 freq
surfeece - 1 freq
surfies - 2 freq
service' - 2 freq
sairves - 1 freq
SURVEYS
Time to execute Levenshtein function - 0.222929 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.333477 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027671 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.037795 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.001089 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.