A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to barber in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
barber (0) - 7 freq
barker (1) - 1 freq
barbers (1) - 3 freq
barbera (1) - 1 freq
bamber (1) - 2 freq
barter (1) - 3 freq
barbar (1) - 29 freq
barbed (1) - 4 freq
darker (2) - 9 freq
warbler (2) - 1 freq
barbs (2) - 1 freq
batter (2) - 68 freq
burger (2) - 13 freq
caber (2) - 4 freq
barbara (2) - 22 freq
barrel (2) - 40 freq
barber's (2) - 3 freq
marker (2) - 11 freq
barnet (2) - 3 freq
carer (2) - 10 freq
barney (2) - 25 freq
barke (2) - 1 freq
barges (2) - 2 freq
herber (2) - 5 freq
baxter (2) - 16 freq
barber (0) - 7 freq
barbar (1) - 29 freq
barbera (1) - 1 freq
barbed (2) - 4 freq
barbara (2) - 22 freq
barbour (2) - 17 freq
barbra (2) - 1 freq
barter (2) - 3 freq
bamber (2) - 2 freq
barbers (2) - 3 freq
barker (2) - 1 freq
brer (3) - 2 freq
bairbed (3) - 1 freq
boarder (3) - 5 freq
bearer (3) - 7 freq
brier (3) - 5 freq
barbies (3) - 3 freq
hairber (3) - 35 freq
barberin (3) - 1 freq
braer (3) - 1 freq
barr (3) - 15 freq
i'barber (3) - 1 freq
burbury (3) - 2 freq
barbt (3) - 1 freq
border (3) - 145 freq
SoundEx code - B616
barbour - 17 freq
'barbara - 1 freq
barber's - 3 freq
barberin - 1 freq
barbara - 22 freq
bravery - 6 freq
berry-broune - 1 freq
bravehairt - 1 freq
braveheart - 19 freq
barbarity - 1 freq
barbarous - 3 freq
barbarities - 1 freq
bravehearts - 4 freq
'braveheart' - 1 freq
bravehearted - 1 freq
barber - 7 freq
barbers - 3 freq
barbarian - 2 freq
barbarian-governor - 1 freq
byre-brush - 1 freq
burberry - 2 freq
bribery - 6 freq
barbarismno - 1 freq
burbury - 2 freq
'burberry' - 1 freq
barbarically - 2 freq
barbwire - 1 freq
brave-heartit - 1 freq
braver - 4 freq
barbour's - 1 freq
barbar - 29 freq
barbar's - 5 freq
barbra - 1 freq
barbarians - 2 freq
breevery - 1 freq
burrafirt - 1 freq
bravehert - 2 freq
breviary - 1 freq
€˜braveheart - 1 freq
bravura - 1 freq
barbera - 1 freq
barbarella - 1 freq
burrafirth - 3 freq
barbarik - 2 freq
barbaradickson - 3 freq
barbering - 1 freq
barbaramcmahon - 1 freq
barbaranairn - 3 freq
barbarajmar - 6 freq
burrabears - 1 freq
barbarossaoz - 1 freq
MetaPhone code - BRBR
barbour - 17 freq
'barbara - 1 freq
barbara - 22 freq
barber - 7 freq
burberry - 2 freq
bribery - 6 freq
burbury - 2 freq
'burberry' - 1 freq
barbar - 29 freq
barbra - 1 freq
barbera - 1 freq
BARBER
Time to execute Levenshtein function - 0.176856 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.355129 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027077 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.036610 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000791 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.