A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to barber in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
barber (0) - 7 freq
barbers (1) - 3 freq
barbera (1) - 1 freq
barter (1) - 3 freq
barker (1) - 1 freq
barbar (1) - 29 freq
bamber (1) - 2 freq
barbed (1) - 4 freq
barbour (2) - 17 freq
farter (2) - 1 freq
barber's (2) - 3 freq
babes (2) - 6 freq
bardet (2) - 1 freq
burger (2) - 13 freq
barbie (2) - 3 freq
bared (2) - 6 freq
i'barber (2) - 1 freq
barbra (2) - 1 freq
aaber (2) - 22 freq
barred (2) - 15 freq
babe (2) - 11 freq
bare (2) - 175 freq
warner (2) - 1 freq
barges (2) - 2 freq
barge (2) - 15 freq
barber (0) - 7 freq
barbar (1) - 29 freq
barbera (1) - 1 freq
barbra (2) - 1 freq
barbara (2) - 22 freq
barbour (2) - 17 freq
barbed (2) - 4 freq
barter (2) - 3 freq
barbers (2) - 3 freq
barker (2) - 1 freq
bamber (2) - 2 freq
braer (3) - 1 freq
brier (3) - 5 freq
burner (3) - 3 freq
barbies (3) - 3 freq
brer (3) - 2 freq
barbs (3) - 1 freq
bieber (3) - 1 freq
hairber (3) - 35 freq
birker (3) - 1 freq
boarder (3) - 5 freq
burker (3) - 2 freq
breer (3) - 11 freq
bribery (3) - 6 freq
burbury (3) - 2 freq
SoundEx code - B616
barbour - 17 freq
'barbara - 1 freq
barber's - 3 freq
barberin - 1 freq
barbara - 22 freq
bravery - 6 freq
berry-broune - 1 freq
bravehairt - 1 freq
braveheart - 19 freq
barbarity - 1 freq
barbarous - 3 freq
barbarities - 1 freq
bravehearts - 4 freq
'braveheart' - 1 freq
bravehearted - 1 freq
barber - 7 freq
barbers - 3 freq
burberry - 3 freq
barbarian - 2 freq
barbarian-governor - 1 freq
byre-brush - 1 freq
bribery - 6 freq
barbarismno - 1 freq
burbury - 2 freq
'burberry' - 1 freq
barbarically - 2 freq
barbwire - 1 freq
brave-heartit - 1 freq
braver - 4 freq
barbour's - 1 freq
barbar - 29 freq
barbar's - 5 freq
barbra - 1 freq
barbarians - 2 freq
breevery - 1 freq
burrafirt - 1 freq
bravehert - 2 freq
breviary - 1 freq
€˜braveheart - 1 freq
bravura - 1 freq
barbera - 1 freq
barbarella - 1 freq
burrafirth - 3 freq
barbarik - 2 freq
barbaradickson - 3 freq
barbering - 1 freq
barbaramcmahon - 1 freq
barbaranairn - 3 freq
barbarajmar - 6 freq
burrabears - 1 freq
barbarossaoz - 1 freq
MetaPhone code - BRBR
barbour - 17 freq
'barbara - 1 freq
barbara - 22 freq
barber - 7 freq
burberry - 3 freq
bribery - 6 freq
burbury - 2 freq
'burberry' - 1 freq
barbar - 29 freq
barbra - 1 freq
barbera - 1 freq
BARBER
Time to execute Levenshtein function - 0.184810 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.347026 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027151 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.044754 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000877 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.