A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to barbara in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
barbara (0) - 22 freq
'barbara (1) - 1 freq
barbar (1) - 29 freq
barbra (1) - 1 freq
barbera (1) - 1 freq
bandara (2) - 11 freq
barbar's (2) - 5 freq
barra (2) - 27 freq
barber (2) - 7 freq
barbers (2) - 3 freq
barbarian (2) - 2 freq
babaza (2) - 1 freq
barbarik (2) - 2 freq
barras (3) - 13 freq
braar (3) - 1 freq
basebaa (3) - 1 freq
yardarm (3) - 1 freq
gabbana (3) - 1 freq
aroart (3) - 1 freq
barr (3) - 16 freq
baars (3) - 1 freq
barcardi (3) - 1 freq
narraa (3) - 1 freq
farrar (3) - 1 freq
bakward (3) - 2 freq
barbara (0) - 22 freq
barbra (1) - 1 freq
barbera (1) - 1 freq
barbar (1) - 29 freq
'barbara (2) - 1 freq
barber (2) - 7 freq
barbour (3) - 17 freq
burbury (3) - 2 freq
barbarian (3) - 2 freq
barbarik (3) - 2 freq
barbers (3) - 3 freq
barra (3) - 27 freq
burra (4) - 62 freq
briar (4) - 1 freq
barter (4) - 3 freq
bamber (4) - 2 freq
brora (4) - 15 freq
barbed (4) - 4 freq
bribery (4) - 6 freq
barberin (4) - 1 freq
barbarous (4) - 3 freq
birra (4) - 9 freq
barbies (4) - 3 freq
barbarity (4) - 1 freq
barbwire (4) - 1 freq
SoundEx code - B616
barbour - 17 freq
'barbara - 1 freq
barber's - 3 freq
barberin - 1 freq
barbara - 22 freq
bravery - 6 freq
berry-broune - 1 freq
bravehairt - 1 freq
braveheart - 19 freq
barbarity - 1 freq
barbarous - 3 freq
barbarities - 1 freq
bravehearts - 4 freq
'braveheart' - 1 freq
bravehearted - 1 freq
barber - 7 freq
barbers - 3 freq
burberry - 3 freq
barbarian - 2 freq
barbarian-governor - 1 freq
byre-brush - 1 freq
bribery - 6 freq
barbarismno - 1 freq
burbury - 2 freq
'burberry' - 1 freq
barbarically - 2 freq
barbwire - 1 freq
brave-heartit - 1 freq
braver - 4 freq
barbour's - 1 freq
barbar - 29 freq
barbar's - 5 freq
barbra - 1 freq
barbarians - 2 freq
breevery - 1 freq
burrafirt - 1 freq
bravehert - 2 freq
breviary - 1 freq
€˜braveheart - 1 freq
bravura - 1 freq
barbera - 1 freq
barbarella - 1 freq
burrafirth - 3 freq
barbarik - 2 freq
barbaradickson - 3 freq
barbering - 1 freq
barbaramcmahon - 1 freq
barbaranairn - 3 freq
barbarajmar - 6 freq
burrabears - 1 freq
barbarossaoz - 1 freq
MetaPhone code - BRBR
barbour - 17 freq
'barbara - 1 freq
barbara - 22 freq
barber - 7 freq
burberry - 3 freq
bribery - 6 freq
burbury - 2 freq
'burberry' - 1 freq
barbar - 29 freq
barbra - 1 freq
barbera - 1 freq
BARBARA
Time to execute Levenshtein function - 0.284274 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.527605 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027910 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.072880 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000904 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.