A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to bap in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
bap (0) - 16 freq
bat (1) - 50 freq
yap (1) - 17 freq
kap (1) - 1 freq
baa (1) - 73 freq
zap (1) - 1 freq
bcp (1) - 1 freq
bnp (1) - 1 freq
gap (1) - 49 freq
bam (1) - 38 freq
baw (1) - 130 freq
bhp (1) - 1 freq
sap (1) - 10 freq
map (1) - 90 freq
jap (1) - 2 freq
bdp (1) - 1 freq
wap (1) - 3 freq
bae (1) - 855 freq
ba (1) - 139 freq
bxp (1) - 1 freq
bad (1) - 925 freq
bai (1) - 10 freq
bay (1) - 80 freq
bas (1) - 9 freq
pap (1) - 20 freq
bap (0) - 16 freq
bapu (1) - 14 freq
bp (1) - 8 freq
tap (2) - 757 freq
bak (2) - 376 freq
baps (2) - 5 freq
bag (2) - 321 freq
baq (2) - 1 freq
hap (2) - 53 freq
rap (2) - 20 freq
bar (2) - 492 freq
nap (2) - 17 freq
btp (2) - 1 freq
aap (2) - 26 freq
lap (2) - 70 freq
bah (2) - 4 freq
bac (2) - 11 freq
beep (2) - 9 freq
obp (2) - 1 freq
dap (2) - 1 freq
ban (2) - 48 freq
bax (2) - 2 freq
baf (2) - 1 freq
bab (2) - 13 freq
bal (2) - 6 freq
SoundEx code - B100
bob - 213 freq
bowff - 1 freq
bobby - 45 freq
boab - 53 freq
baby - 117 freq
babe - 11 freq
babbie - 34 freq
bevie - 2 freq
boof - 3 freq
boeuf - 1 freq
beef - 79 freq
bobbie - 15 freq
bib - 8 freq
babby - 25 freq
bawbee - 23 freq
babie - 1 freq
bowf - 9 freq
boabby - 23 freq
befaw - 3 freq
beefie - 1 freq
bevvy - 18 freq
befa - 2 freq
baba - 2 freq
baffy - 3 freq
bye-bye - 1 freq
behave - 37 freq
befaa - 6 freq
bappie - 1 freq
bap - 16 freq
baff - 3 freq
bihaive - 1 freq
byfe - 1 freq
bab - 13 freq
baffie - 62 freq
beehive - 1 freq
buff - 15 freq
beef' - 1 freq
behauf - 17 freq
beeb' - 1 freq
bewvy - 2 freq
baabie - 21 freq
bubba - 1 freq
boaby - 5 freq
bafhe - 2 freq
baffe - 2 freq
bafhie - 2 freq
boob - 5 freq
bibby - 3 freq
bevy - 5 freq
bauf - 1 freq
baby' - 1 freq
booby - 9 freq
bobo - 9 freq
behaiv - 1 freq
baby-e - 1 freq
bebe - 3 freq
bevvie - 2 freq
bouff - 1 freq
behouf - 1 freq
'bib - 1 freq
beif - 2 freq
bappy - 27 freq
behufe - 1 freq
beep - 9 freq
'babe' - 1 freq
beve - 1 freq
beb - 1 freq
bawbee- - 2 freq
bapu - 14 freq
€œbapu - 1 freq
baw-bee - 1 freq
baef - 6 freq
boabie - 2 freq
boba - 2 freq
biffy - 1 freq
bev - 1 freq
baf - 1 freq
bebo - 1 freq
bhheypya - 1 freq
bhp - 1 freq
biffo - 2 freq
beeb - 1 freq
baif - 4 freq
bpub - 1 freq
bwv - 1 freq
befio - 1 freq
byf - 1 freq
bufw - 1 freq
bov - 1 freq
bovvy - 2 freq
MetaPhone code - BP
bappie - 1 freq
bap - 16 freq
bp - 8 freq
bappy - 27 freq
beep - 9 freq
bapu - 14 freq
€œbapu - 1 freq
bhp - 1 freq
BAP
Time to execute Levenshtein function - 0.216622 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.369065 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027596 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.036723 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000803 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.