A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to ab in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
ab (0) - 25 freq
ar (1) - 208 freq
abz (1) - 17 freq
apb (1) - 1 freq
aib (1) - 2 freq
abo (1) - 2 freq
wb (1) - 4 freq
lab (1) - 29 freq
fab (1) - 18 freq
xb (1) - 3 freq
wab (1) - 126 freq
bab (1) - 13 freq
ub (1) - 3 freq
sab (1) - 9 freq
ay (1) - 2131 freq
ib (1) - 1 freq
aq (1) - 2 freq
dab (1) - 58 freq
ad (1) - 126 freq
aby (1) - 11 freq
gab (1) - 32 freq
ax (1) - 76 freq
ah (1) - 16973 freq
abg (1) - 1 freq
ag (1) - 12 freq
ab (0) - 25 freq
ob (1) - 2 freq
aab (1) - 3 freq
ub (1) - 3 freq
aby (1) - 11 freq
abe (1) - 3 freq
eb (1) - 6 freq
abu (1) - 2 freq
b (1) - 745 freq
ib (1) - 1 freq
aib (1) - 2 freq
abo (1) - 2 freq
abi (1) - 3 freq
yb (1) - 3 freq
mb (2) - 5 freq
sb (2) - 4 freq
-ab (2) - 1 freq
aj (2) - 6 freq
am (2) - 1527 freq
aa (2) - 7091 freq
as (2) - 17482 freq
al (2) - 237 freq
abs (2) - 1 freq
ae (2) - 5537 freq
gb (2) - 2 freq
SoundEx code - A100
aff - 4336 freq
awfie - 392 freq
ava - 487 freq
affa - 931 freq
awfae - 22 freq
a've - 235 freq
awfu - 128 freq
ah've - 858 freq
awfy - 449 freq
ave - 42 freq
app - 47 freq
avaa - 30 freq
ape - 5 freq
abbey - 33 freq
apö - 53 freq
'abi - 1 freq
'ave - 5 freq
auv - 2 freq
affy - 93 freq
'a've - 9 freq
aby - 11 freq
'ah've - 41 freq
abee - 3 freq
'affa - 2 freq
awfa - 90 freq
ah'v - 114 freq
aw've - 1 freq
ap - 94 freq
aa've - 2 freq
ab - 25 freq
auf - 13 freq
av - 161 freq
a'v - 48 freq
aaf - 23 freq
awf - 2 freq
aib - 2 freq
ahve - 10 freq
ah''ve - 2 freq
ahi've - 1 freq
awap - 1 freq
abooy - 1 freq
'aff - 4 freq
'awfie - 8 freq
ah'vy - 1 freq
affie - 6 freq
aap - 26 freq
a'af - 1 freq
aawfu - 1 freq
af - 14 freq
afa - 12 freq
abou - 1 freq
abo - 2 freq
ah'p - 1 freq
aif - 6 freq
awfe - 4 freq
'aefie - 1 freq
aff' - 1 freq
apy - 1 freq
abe - 3 freq
aip - 10 freq
abiah - 1 freq
affo - 5 freq
a''ve - 3 freq
a'va - 1 freq
a'foo - 1 freq
apo - 150 freq
ahv - 7 freq
'awfe - 1 freq
abie - 1 freq
avaw - 5 freq
apae - 4 freq
ah'vv - 1 freq
aaaaahhhhhhhbhh - 2 freq
affyi - 1 freq
awfou - 2 freq
apø - 14 freq
aafa - 14 freq
avou - 3 freq
abbie - 59 freq
avà - 1 freq
abow - 1 freq
awb - 6 freq
afu - 3 freq
ab- - 1 freq
aafu - 14 freq
awfu' - 1 freq
afffa - 1 freq
abbay - 1 freq
€œaffa - 1 freq
-ab - 1 freq
abbé - 1 freq
abby - 16 freq
abu - 2 freq
apa - 2 freq
aboo - 2 freq
aab - 3 freq
€œawfie - 1 freq
abbow - 1 freq
aoife - 5 freq
€˜av - 1 freq
ay'be - 1 freq
awefu - 1 freq
affae - 3 freq
€œave - 2 freq
€˜awfie - 1 freq
afo - 4 freq
awfi - 5 freq
avw - 2 freq
a'hve - 1 freq
appy - 1 freq
aÂ’ve - 7 freq
ahÂ’ve - 29 freq
abi - 3 freq
aafe - 1 freq
apao - 1 freq
aov - 1 freq
ah‘ve - 1 freq
apio - 1 freq
aph - 1 freq
ava' - 1 freq
aaib - 1 freq
avva - 1 freq
awffie - 1 freq
aÂ’v - 1 freq
“aff - 1 freq
aybba - 1 freq
avi - 1 freq
aff- - 1 freq
aup - 1 freq
aifb - 1 freq
apb - 1 freq
afpe - 1 freq
'affy' - 1 freq
awfv - 1 freq
aypa - 1 freq
aibbey - 1 freq
aafo - 8 freq
'av - 1 freq
awffy - 1 freq
MetaPhone code - AB
abbey - 33 freq
'abi - 1 freq
aby - 11 freq
abee - 3 freq
ab - 25 freq
aib - 2 freq
abooy - 1 freq
abou - 1 freq
abo - 2 freq
abe - 3 freq
abiah - 1 freq
abie - 1 freq
aaaaahhhhhhhbhh - 2 freq
abbie - 59 freq
abow - 1 freq
awb - 6 freq
ab- - 1 freq
abbay - 1 freq
-ab - 1 freq
abbé - 1 freq
abby - 16 freq
abu - 2 freq
aboo - 2 freq
aab - 3 freq
abbow - 1 freq
ay'be - 1 freq
abi - 3 freq
aaib - 1 freq
aybba - 1 freq
aibbey - 1 freq
AB
Time to execute Levenshtein function - 0.179646 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.377516 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.037145 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.042774 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.001202 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.