A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to bar in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
bar (0) - 493 freq
bab (1) - 13 freq
bap (1) - 16 freq
ban (1) - 48 freq
aar (1) - 5 freq
iar (1) - 2 freq
bag (1) - 334 freq
bay (1) - 82 freq
bad (1) - 949 freq
bas (1) - 9 freq
bat (1) - 50 freq
baer (1) - 2 freq
bare (1) - 175 freq
rar (1) - 1 freq
brr (1) - 5 freq
bax (1) - 2 freq
bark (1) - 34 freq
bah (1) - 4 freq
gar (1) - 162 freq
car (1) - 413 freq
baq (1) - 1 freq
ear (1) - 143 freq
dar (1) - 86 freq
bak (1) - 376 freq
baf (1) - 1 freq
bar (0) - 493 freq
bor (1) - 4 freq
baar (1) - 6 freq
boar (1) - 20 freq
baur (1) - 67 freq
bear (1) - 217 freq
bir (1) - 2 freq
br (1) - 4 freq
bair (1) - 1 freq
bur (1) - 8 freq
bare (1) - 175 freq
ber (1) - 4 freq
baer (1) - 2 freq
boru (2) - 6 freq
byro (2) - 1 freq
bray (2) - 1 freq
ar (2) - 208 freq
yar (2) - 3 freq
ebr (2) - 1 freq
bai (2) - 10 freq
biro (2) - 6 freq
ba (2) - 140 freq
boer (2) - 3 freq
war (2) - 1446 freq
bury (2) - 24 freq
SoundEx code - B600
brae - 285 freq
braw - 1237 freq
bare - 175 freq
bury - 24 freq
bra - 31 freq
borrow - 16 freq
bar - 493 freq
barrae - 2 freq
bear - 217 freq
be'-or - 3 freq
bree - 58 freq
brou - 29 freq
bourie - 5 freq
buiry - 4 freq
boure - 1 freq
bewaur - 15 freq
bour - 1 freq
beir - 18 freq
bere - 13 freq
bheur - 1 freq
beerie - 7 freq
bore - 41 freq
byre - 111 freq
burrae - 1 freq
beer - 142 freq
brew - 31 freq
beery - 8 freq
'bear - 1 freq
bur - 8 freq
bor - 4 freq
bru - 28 freq
broo - 164 freq
buyer - 8 freq
bray - 1 freq
baur - 67 freq
baira - 1 freq
bree-ee - 8 freq
'braw - 12 freq
be'er - 1 freq
boer - 3 freq
buroo - 10 freq
boar - 20 freq
buir - 10 freq
burrow - 7 freq
berry - 17 freq
barra - 27 freq
braa - 60 freq
bir - 2 freq
barrie - 15 freq
boru - 6 freq
borrae - 10 freq
bair - 1 freq
bier - 3 freq
behere - 2 freq
bu'er - 2 freq
barry - 34 freq
'braw' - 1 freq
beware - 5 freq
bower - 10 freq
bouer - 18 freq
brow - 10 freq
burroo - 2 freq
bro - 7 freq
brah - 4 freq
baer - 2 freq
brey - 1 freq
birr - 24 freq
bear' - 1 freq
burie - 4 freq
borra - 2 freq
borra' - 1 freq
ber - 4 freq
burra - 62 freq
biro - 6 freq
böre - 1 freq
brö - 1 freq
broo' - 1 freq
beirie - 1 freq
bure - 3 freq
brø - 1 freq
bær - 1 freq
berr - 2 freq
burry - 1 freq
burro - 1 freq
bureau - 11 freq
brio - 1 freq
baar - 6 freq
bewaar - 1 freq
burou - 1 freq
borroo - 1 freq
brahe - 1 freq
brae' - 1 freq
barr - 16 freq
brie - 3 freq
burr - 2 freq
birra - 9 freq
über - 1 freq
€˜braw - 2 freq
brooie - 1 freq
birry - 2 freq
borr - 1 freq
brrrr - 2 freq
brrr - 2 freq
bu---r - 1 freq
€˜barrie - 1 freq
€žbrae' - 1 freq
€˜brew - 1 freq
bawhair - 1 freq
bah-yerr - 1 freq
byro - 1 freq
“bier” - 1 freq
bauer - 1 freq
br - 4 freq
brewÂ’ - 1 freq
brr - 5 freq
brrrrr - 1 freq
brrrrrrrr - 2 freq
brawwee - 1 freq
borh - 1 freq
bri - 2 freq
'bru - 1 freq
berry' - 1 freq
MetaPhone code - BR
brae - 285 freq
braw - 1237 freq
bare - 175 freq
bury - 24 freq
bra - 31 freq
borrow - 16 freq
bar - 493 freq
barrae - 2 freq
bear - 217 freq
be'-or - 3 freq
bree - 58 freq
brou - 29 freq
bourie - 5 freq
buiry - 4 freq
boure - 1 freq
bour - 1 freq
beir - 18 freq
bere - 13 freq
beerie - 7 freq
bore - 41 freq
byre - 111 freq
burrae - 1 freq
beer - 142 freq
brew - 31 freq
beery - 8 freq
'bear - 1 freq
burgh - 18 freq
bur - 8 freq
bor - 4 freq
bru - 28 freq
broo - 164 freq
bray - 1 freq
baur - 67 freq
baira - 1 freq
bree-ee - 8 freq
'braw - 12 freq
be'er - 1 freq
boer - 3 freq
buroo - 10 freq
boar - 20 freq
buir - 10 freq
burrow - 7 freq
berry - 17 freq
barra - 27 freq
braa - 60 freq
bir - 2 freq
barrie - 15 freq
boru - 6 freq
borrae - 10 freq
bair - 1 freq
bier - 3 freq
bu'er - 2 freq
barry - 34 freq
'braw' - 1 freq
bouer - 18 freq
brow - 10 freq
burroo - 2 freq
bro - 7 freq
brah - 4 freq
baer - 2 freq
brey - 1 freq
birr - 24 freq
bear' - 1 freq
burie - 4 freq
borra - 2 freq
borra' - 1 freq
ber - 4 freq
burra - 62 freq
biro - 6 freq
böre - 1 freq
brö - 1 freq
broo' - 1 freq
beirie - 1 freq
bure - 3 freq
brø - 1 freq
bær - 1 freq
berr - 2 freq
burry - 1 freq
burro - 1 freq
bureau - 11 freq
brio - 1 freq
baar - 6 freq
burou - 1 freq
borroo - 1 freq
brae' - 1 freq
barr - 16 freq
brie - 3 freq
brugh - 2 freq
burr - 2 freq
birra - 9 freq
über - 1 freq
€˜braw - 2 freq
brooie - 1 freq
birry - 2 freq
borr - 1 freq
brrrr - 2 freq
brrr - 2 freq
bu---r - 1 freq
€˜barrie - 1 freq
€žbrae' - 1 freq
€˜brew - 1 freq
byro - 1 freq
“bier” - 1 freq
bauer - 1 freq
br - 4 freq
brewÂ’ - 1 freq
brr - 5 freq
brrrrr - 1 freq
brrrrrrrr - 2 freq
brawwee - 1 freq
borh - 1 freq
bri - 2 freq
hburgh - 1 freq
'bru - 1 freq
berry' - 1 freq
BAR
bar - 493 freq
bars - 92 freq
barring - freq
barred - 15 freq
Time to execute Levenshtein function - 0.196573 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.515757 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027596 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.070103 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.001100 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.