A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to barbour in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
barbour (0) - 17 freq
harbour (1) - 49 freq
laubour (2) - 2 freq
barroun (2) - 1 freq
balfour (2) - 50 freq
ardour (2) - 1 freq
bordour (2) - 1 freq
labour (2) - 276 freq
herbour (2) - 23 freq
barber (2) - 7 freq
harbours (2) - 2 freq
barbar (2) - 29 freq
balmour (2) - 1 freq
wardour (2) - 1 freq
hairbour (2) - 7 freq
harbour' (2) - 1 freq
parlour (2) - 17 freq
armour (2) - 10 freq
barbour's (2) - 1 freq
lawbour (2) - 5 freq
faavour (3) - 1 freq
harboured (3) - 1 freq
barton (3) - 3 freq
careous (3) - 1 freq
barbie (3) - 3 freq
barbour (0) - 17 freq
barber (2) - 7 freq
harbour (2) - 49 freq
barbar (2) - 29 freq
hairbour (3) - 7 freq
barbara (3) - 22 freq
barbra (3) - 1 freq
burbury (3) - 2 freq
barbera (3) - 1 freq
herbour (3) - 23 freq
bordour (3) - 1 freq
barr (4) - 16 freq
herbeur (4) - 1 freq
barbed (4) - 4 freq
barbt (4) - 1 freq
herbor (4) - 1 freq
barbecue (4) - 8 freq
belabour (4) - 1 freq
barrier (4) - 13 freq
barbies (4) - 3 freq
barter (4) - 3 freq
barker (4) - 1 freq
barbers (4) - 3 freq
baronboar (4) - 1 freq
brur (4) - 23 freq
SoundEx code - B616
barbour - 17 freq
'barbara - 1 freq
barber's - 3 freq
barberin - 1 freq
barbara - 22 freq
bravery - 6 freq
berry-broune - 1 freq
bravehairt - 1 freq
braveheart - 19 freq
barbarity - 1 freq
barbarous - 3 freq
barbarities - 1 freq
bravehearts - 4 freq
'braveheart' - 1 freq
bravehearted - 1 freq
barber - 7 freq
barbers - 3 freq
burberry - 3 freq
barbarian - 2 freq
barbarian-governor - 1 freq
byre-brush - 1 freq
bribery - 6 freq
barbarismno - 1 freq
burbury - 2 freq
'burberry' - 1 freq
barbarically - 2 freq
barbwire - 1 freq
brave-heartit - 1 freq
braver - 4 freq
barbour's - 1 freq
barbar - 29 freq
barbar's - 5 freq
barbra - 1 freq
barbarians - 2 freq
breevery - 1 freq
burrafirt - 1 freq
bravehert - 2 freq
breviary - 1 freq
€˜braveheart - 1 freq
bravura - 1 freq
barbera - 1 freq
barbarella - 1 freq
burrafirth - 3 freq
barbarik - 2 freq
barbaradickson - 3 freq
barbering - 1 freq
barbaramcmahon - 1 freq
barbaranairn - 3 freq
barbarajmar - 6 freq
burrabears - 1 freq
barbarossaoz - 1 freq
MetaPhone code - BRBR
barbour - 17 freq
'barbara - 1 freq
barbara - 22 freq
barber - 7 freq
burberry - 3 freq
bribery - 6 freq
burbury - 2 freq
'burberry' - 1 freq
barbar - 29 freq
barbra - 1 freq
barbera - 1 freq
BARBOUR
Time to execute Levenshtein function - 0.184248 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.348450 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027350 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.037087 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000893 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.