A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to bentos in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
bentos (0) - 1 freq
bents (1) - 10 freq
bestow (2) - 1 freq
ectos (2) - 1 freq
kent's (2) - 2 freq
lents (2) - 1 freq
bettys (2) - 1 freq
ents (2) - 1 freq
beet's (2) - 2 freq
bends (2) - 13 freq
ments (2) - 1 freq
tents (2) - 22 freq
beuts (2) - 6 freq
beaton (2) - 35 freq
renton (2) - 4 freq
belt's (2) - 1 freq
gents (2) - 8 freq
sentor (2) - 1 freq
bingos (2) - 1 freq
buenos (2) - 1 freq
bens (2) - 47 freq
bert's (2) - 7 freq
fenton (2) - 8 freq
beit's (2) - 5 freq
becos (2) - 1 freq
bentos (0) - 1 freq
bents (1) - 10 freq
bants (2) - 5 freq
bunts (2) - 1 freq
bent (3) - 104 freq
bests (3) - 2 freq
bannos (3) - 1 freq
vents (3) - 1 freq
bnto (3) - 1 freq
beets (3) - 49 freq
rents (3) - 11 freq
gentis (3) - 1 freq
banjos (3) - 2 freq
cents (3) - 3 freq
berts (3) - 1 freq
belts (3) - 15 freq
pents (3) - 5 freq
ben's (3) - 2 freq
benks (3) - 2 freq
benkis (3) - 1 freq
bentaps (3) - 1 freq
beats (3) - 34 freq
mentis (3) - 1 freq
bets (3) - 8 freq
ments (3) - 1 freq
SoundEx code - B532
bunnets - 9 freq
benedict - 5 freq
boonds - 7 freq
bounds - 11 freq
bandaged - 2 freq
bonds - 9 freq
bends - 13 freq
bands - 41 freq
bents - 10 freq
bonnets - 5 freq
benediction - 4 freq
bounteous - 1 freq
binds - 7 freq
bondsmen - 2 freq
bunts - 1 freq
bondic - 1 freq
bennett's - 2 freq
bane-ticht - 1 freq
benedictus - 1 freq
baunds - 5 freq
biomedical - 1 freq
bondage - 2 freq
bands' - 1 freq
bunty's - 2 freq
benedictine - 1 freq
bounds' - 1 freq
band's - 1 freq
banties - 1 freq
bandage - 8 freq
bandwagon - 1 freq
bandages - 5 freq
baends - 1 freq
baundstaund - 1 freq
baundsmen - 1 freq
baundsman's - 1 freq
bounties - 2 freq
ben-the-hoose - 1 freq
baund's - 1 freq
bandies - 1 freq
bayonets - 3 freq
behinds - 1 freq
bentset - 1 freq
benedick - 3 freq
bandicoot - 1 freq
bond's - 1 freq
bent-shots - 1 freq
bandagin - 1 freq
bants - 5 freq
bentos - 1 freq
bandcamp - 1 freq
benthicorganism - 3 freq
bountys - 1 freq
'bants' - 1 freq
MetaPhone code - BNTS
bunnets - 9 freq
boonds - 7 freq
bounds - 11 freq
bonds - 9 freq
bends - 13 freq
bands - 41 freq
bents - 10 freq
bonnets - 5 freq
bounteous - 1 freq
binds - 7 freq
bunts - 1 freq
bennett's - 2 freq
baunds - 5 freq
bands' - 1 freq
bunty's - 2 freq
bounds' - 1 freq
band's - 1 freq
banties - 1 freq
baends - 1 freq
bounties - 2 freq
baund's - 1 freq
bandies - 1 freq
bond's - 1 freq
bants - 5 freq
bentos - 1 freq
bountys - 1 freq
'bants' - 1 freq
BENTOS
Time to execute Levenshtein function - 0.332683 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.582353 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.029152 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.078791 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000901 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.