A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to bents in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
bents (0) - 10 freq
bests (1) - 2 freq
bens (1) - 47 freq
ents (1) - 1 freq
bets (1) - 8 freq
benks (1) - 2 freq
beets (1) - 49 freq
rents (1) - 11 freq
bunts (1) - 1 freq
lents (1) - 1 freq
belts (1) - 15 freq
ben's (1) - 2 freq
ments (1) - 1 freq
bent (1) - 104 freq
berts (1) - 1 freq
bants (1) - 5 freq
pents (1) - 5 freq
beuts (1) - 6 freq
bentos (1) - 1 freq
bends (1) - 13 freq
tents (1) - 22 freq
vents (1) - 1 freq
beats (1) - 34 freq
cents (1) - 3 freq
gents (1) - 8 freq
bents (0) - 10 freq
bunts (1) - 1 freq
bentos (1) - 1 freq
bants (1) - 5 freq
berts (2) - 1 freq
beuts (2) - 6 freq
pents (2) - 5 freq
bends (2) - 13 freq
gents (2) - 8 freq
cents (2) - 3 freq
beats (2) - 34 freq
vents (2) - 1 freq
bent (2) - 104 freq
tents (2) - 22 freq
bets (2) - 8 freq
bens (2) - 47 freq
ments (2) - 1 freq
bests (2) - 2 freq
benks (2) - 2 freq
ents (2) - 1 freq
ben's (2) - 2 freq
belts (2) - 15 freq
lents (2) - 1 freq
rents (2) - 11 freq
beets (2) - 49 freq
SoundEx code - B532
bunnets - 9 freq
benedict - 5 freq
boonds - 7 freq
bounds - 11 freq
bandaged - 2 freq
bonds - 9 freq
bends - 13 freq
bands - 41 freq
bents - 10 freq
bonnets - 5 freq
benediction - 4 freq
bounteous - 1 freq
binds - 7 freq
bondsmen - 2 freq
bunts - 1 freq
bondic - 1 freq
bennett's - 2 freq
bane-ticht - 1 freq
benedictus - 1 freq
baunds - 5 freq
biomedical - 1 freq
bondage - 2 freq
bands' - 1 freq
bunty's - 2 freq
benedictine - 1 freq
bounds' - 1 freq
band's - 1 freq
banties - 1 freq
bandage - 8 freq
bandwagon - 1 freq
bandages - 5 freq
baends - 1 freq
baundstaund - 1 freq
baundsmen - 1 freq
baundsman's - 1 freq
bounties - 2 freq
ben-the-hoose - 1 freq
baund's - 1 freq
bandies - 1 freq
bayonets - 3 freq
behinds - 1 freq
bentset - 1 freq
benedick - 3 freq
bandicoot - 1 freq
bond's - 1 freq
bent-shots - 1 freq
bandagin - 1 freq
bants - 5 freq
bentos - 1 freq
bandcamp - 1 freq
benthicorganism - 3 freq
bountys - 1 freq
'bants' - 1 freq
MetaPhone code - BNTS
bunnets - 9 freq
boonds - 7 freq
bounds - 11 freq
bonds - 9 freq
bends - 13 freq
bands - 41 freq
bents - 10 freq
bonnets - 5 freq
bounteous - 1 freq
binds - 7 freq
bunts - 1 freq
bennett's - 2 freq
baunds - 5 freq
bands' - 1 freq
bunty's - 2 freq
bounds' - 1 freq
band's - 1 freq
banties - 1 freq
baends - 1 freq
bounties - 2 freq
baund's - 1 freq
bandies - 1 freq
bond's - 1 freq
bants - 5 freq
bentos - 1 freq
bountys - 1 freq
'bants' - 1 freq
BENTS
Time to execute Levenshtein function - 0.222938 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.342193 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027711 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.043881 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000866 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.