A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to bill� in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
bill's (3) - 22 freq
billets (3) - 3 freq
bills (3) - 18 freq
billy” (3) - 1 freq
bille (3) - 2 freq
billia (3) - 1 freq
bill (3) - 551 freq
bill'll (3) - 1 freq
billys (3) - 1 freq
billion (3) - 34 freq
billowy (3) - 1 freq
billed (3) - 2 freq
billy' (3) - 1 freq
billy-o (3) - 1 freq
billy's (3) - 12 freq
billie (3) - 33 freq
billows (3) - 1 freq
billet (3) - 1 freq
billies (3) - 14 freq
billy (3) - 261 freq
billo (3) - 1 freq
filly (4) - 8 freq
hillman (4) - 3 freq
sillie (4) - 2 freq
wille (4) - 1 freq
billie (6) - 33 freq
billy's (6) - 12 freq
billy-o (6) - 1 freq
billy' (6) - 1 freq
billows (6) - 1 freq
billet (6) - 1 freq
bill's (6) - 22 freq
billo (6) - 1 freq
billy (6) - 261 freq
billed (6) - 2 freq
billies (6) - 14 freq
bille (6) - 2 freq
bills (6) - 18 freq
billowy (6) - 1 freq
billets (6) - 3 freq
billia (6) - 1 freq
billy” (6) - 1 freq
bill (6) - 551 freq
billys (6) - 1 freq
billion (6) - 34 freq
bill'll (6) - 1 freq
bell-it (7) - 1 freq
byllie (7) - 1 freq
bullyin (7) - 17 freq
billie's (7) - 1 freq
SoundEx code - B400
blae - 56 freq
blue - 567 freq
below - 86 freq
bell - 215 freq
blaw - 200 freq
blew - 117 freq
belly - 163 freq
bile - 56 freq
bul - 3 freq
bowl - 73 freq
billy - 261 freq
bill - 551 freq
bull - 92 freq
bawl - 4 freq
bale - 8 freq
beal - 3 freq
bail - 14 freq
blaa - 24 freq
bol - 2 freq
byle - 16 freq
bully - 24 freq
bellow - 3 freq
bailey - 6 freq
belle' - 1 freq
blo - 2 freq
bole - 4 freq
billie - 33 freq
'bella' - 1 freq
bella - 96 freq
bella' - 1 freq
bela - 1 freq
bleeeooo - 1 freq
ball - 62 freq
bowhill - 5 freq
bowlie - 7 freq
'blaw - 2 freq
'blue - 3 freq
bowel - 16 freq
belle - 17 freq
belaw - 1 freq
belie - 2 freq
buile - 1 freq
boyle - 17 freq
blow - 51 freq
bellie - 6 freq
blah - 29 freq
bloo - 11 freq
beyl - 2 freq
boil - 19 freq
baal - 14 freq
bheil - 2 freq
bailie - 7 freq
bul' - 1 freq
bool - 17 freq
billy' - 1 freq
beliey - 2 freq
billo - 1 freq
baelow - 10 freq
bille - 2 freq
ballo - 1 freq
bleu - 16 freq
boll - 1 freq
blew' - 1 freq
boul - 15 freq
behoul - 4 freq
bïll - 2 freq
behol - 1 freq
buhl - 17 freq
bal' - 1 freq
'blue' - 1 freq
'bowly' - 1 freq
bowly - 3 freq
'bile - 2 freq
bøl - 1 freq
bowil - 1 freq
bal - 6 freq
bill'll - 1 freq
bowle - 2 freq
bel' - 1 freq
bali - 1 freq
be-all - 1 freq
blaaw - 2 freq
buull - 1 freq
böl - 2 freq
'bully - 1 freq
billia - 1 freq
boule - 3 freq
blyue - 1 freq
bial - 1 freq
blye - 2 freq
baillie - 17 freq
boay'll - 1 freq
bla - 13 freq
'bale - 2 freq
blee - 2 freq
billy-o - 1 freq
blui - 1 freq
blue' - 1 freq
bl - 2 freq
bylie - 1 freq
blowy - 1 freq
€˜blah - 2 freq
€œblue - 1 freq
bee'll - 2 freq
€œbilly - 8 freq
€˜blue - 1 freq
bylaw - 1 freq
blawy - 1 freq
€œblaaaah - 1 freq
€œbill - 10 freq
€œbilli - 1 freq
bally - 5 freq
bluey - 1 freq
bla' - 1 freq
beelo - 1 freq
byel - 1 freq
€˜bowel - 3 freq
byllie - 1 freq
bulla - 32 freq
baul - 2 freq
blueÂ’ - 1 freq
ballÂ’ - 1 freq
biÂ’el - 1 freq
boily - 1 freq
blwh - 1 freq
billy” - 1 freq
bayley - 1 freq
bil - 1 freq
bailly - 1 freq
billowy - 1 freq
MetaPhone code - BL
blae - 56 freq
blue - 567 freq
below - 86 freq
bell - 215 freq
blaw - 200 freq
blew - 117 freq
belly - 163 freq
bile - 56 freq
bul - 3 freq
bowl - 73 freq
billy - 261 freq
bill - 551 freq
bull - 92 freq
bawl - 4 freq
bale - 8 freq
beal - 3 freq
bail - 14 freq
blaa - 24 freq
bol - 2 freq
byle - 16 freq
bully - 24 freq
bellow - 3 freq
bailey - 6 freq
belle' - 1 freq
blo - 2 freq
bole - 4 freq
billie - 33 freq
'bella' - 1 freq
bella - 96 freq
bella' - 1 freq
bela - 1 freq
bleeeooo - 1 freq
ball - 62 freq
bowlie - 7 freq
'blaw - 2 freq
'blue - 3 freq
belle - 17 freq
belaw - 1 freq
belie - 2 freq
buile - 1 freq
boyle - 17 freq
blow - 51 freq
bellie - 6 freq
blah - 29 freq
bloo - 11 freq
beyl - 2 freq
boil - 19 freq
baal - 14 freq
bailie - 7 freq
bul' - 1 freq
bool - 17 freq
billy' - 1 freq
beliey - 2 freq
billo - 1 freq
baelow - 10 freq
bille - 2 freq
ballo - 1 freq
bleu - 16 freq
boll - 1 freq
blew' - 1 freq
boul - 15 freq
bïll - 2 freq
buhl - 17 freq
bal' - 1 freq
'blue' - 1 freq
'bowly' - 1 freq
bowly - 3 freq
'bile - 2 freq
bøl - 1 freq
bal - 6 freq
bowle - 2 freq
bel' - 1 freq
bali - 1 freq
be-all - 1 freq
blaaw - 2 freq
buull - 1 freq
böl - 2 freq
'bully - 1 freq
billia - 1 freq
boule - 3 freq
bial - 1 freq
baillie - 17 freq
boay'll - 1 freq
bla - 13 freq
'bale - 2 freq
blee - 2 freq
billy-o - 1 freq
blui - 1 freq
blue' - 1 freq
bl - 2 freq
bylie - 1 freq
blowy - 1 freq
€˜blah - 2 freq
€œblue - 1 freq
bee'll - 2 freq
€œbilly - 8 freq
€˜blue - 1 freq
bylaw - 1 freq
blawy - 1 freq
€œblaaaah - 1 freq
€œbill - 10 freq
€œbilli - 1 freq
bally - 5 freq
bluey - 1 freq
bla' - 1 freq
beelo - 1 freq
byllie - 1 freq
bulla - 32 freq
baul - 2 freq
blueÂ’ - 1 freq
ballÂ’ - 1 freq
biÂ’el - 1 freq
boily - 1 freq
blwh - 1 freq
billy” - 1 freq
bayley - 1 freq
bil - 1 freq
bailly - 1 freq
billowy - 1 freq
BILL�
Time to execute Levenshtein function - 0.186529 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.343594 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.028299 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.038116 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000953 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.