A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to barbar in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
barbar (0) - 29 freq
barbara (1) - 22 freq
barber (1) - 7 freq
barter (2) - 3 freq
farrar (2) - 1 freq
barbers (2) - 3 freq
wardar (2) - 1 freq
tarbat (2) - 1 freq
barman (2) - 25 freq
barbt (2) - 1 freq
barbra (2) - 1 freq
'barbara (2) - 1 freq
barca (2) - 1 freq
barbarik (2) - 2 freq
baar (2) - 6 freq
farar (2) - 1 freq
baba (2) - 2 freq
barbar's (2) - 5 freq
sarwar (2) - 3 freq
barbed (2) - 4 freq
braar (2) - 1 freq
baobab (2) - 5 freq
barbs (2) - 1 freq
barbie (2) - 3 freq
barker (2) - 1 freq
barbar (0) - 29 freq
barber (1) - 7 freq
barbara (1) - 22 freq
barbera (2) - 1 freq
barbra (2) - 1 freq
barbour (2) - 17 freq
barker (3) - 1 freq
braar (3) - 1 freq
barbed (3) - 4 freq
barbs (3) - 1 freq
barbie (3) - 3 freq
bruar (3) - 5 freq
burbury (3) - 2 freq
barra (3) - 27 freq
briar (3) - 1 freq
bamber (3) - 2 freq
barrae (3) - 2 freq
barr (3) - 15 freq
barbt (3) - 1 freq
barbers (3) - 3 freq
barter (3) - 3 freq
barbarik (3) - 2 freq
'barbara (3) - 1 freq
bomber (4) - 3 freq
bearer (4) - 7 freq
SoundEx code - B616
barbour - 17 freq
'barbara - 1 freq
barber's - 3 freq
barberin - 1 freq
barbara - 22 freq
bravery - 6 freq
berry-broune - 1 freq
bravehairt - 1 freq
braveheart - 19 freq
barbarity - 1 freq
barbarous - 3 freq
barbarities - 1 freq
bravehearts - 4 freq
'braveheart' - 1 freq
bravehearted - 1 freq
barber - 7 freq
barbers - 3 freq
barbarian - 2 freq
barbarian-governor - 1 freq
byre-brush - 1 freq
burberry - 2 freq
bribery - 6 freq
barbarismno - 1 freq
burbury - 2 freq
'burberry' - 1 freq
barbarically - 2 freq
barbwire - 1 freq
brave-heartit - 1 freq
braver - 4 freq
barbour's - 1 freq
barbar - 29 freq
barbar's - 5 freq
barbra - 1 freq
barbarians - 2 freq
breevery - 1 freq
burrafirt - 1 freq
bravehert - 2 freq
breviary - 1 freq
€˜braveheart - 1 freq
bravura - 1 freq
barbera - 1 freq
barbarella - 1 freq
burrafirth - 3 freq
barbarik - 2 freq
barbaradickson - 3 freq
barbering - 1 freq
barbaramcmahon - 1 freq
barbaranairn - 3 freq
barbarajmar - 6 freq
burrabears - 1 freq
barbarossaoz - 1 freq
MetaPhone code - BRBR
barbour - 17 freq
'barbara - 1 freq
barbara - 22 freq
barber - 7 freq
burberry - 2 freq
bribery - 6 freq
burbury - 2 freq
'burberry' - 1 freq
barbar - 29 freq
barbra - 1 freq
barbera - 1 freq
BARBAR
Time to execute Levenshtein function - 0.212209 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.352114 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027255 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.037237 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000804 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.