A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to a-while in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
a-while (0) - 2 freq
awhile (1) - 15 freq
umwhile (2) - 16 freq
awhie (2) - 2 freq
€˜while (2) - 1 freq
a while (2) - 1 freq
'while (2) - 1 freq
wwhile (2) - 1 freq
while (2) - 1436 freq
aftwhiles (3) - 3 freq
w-will (3) - 1 freq
whice (3) - 1 freq
agile (3) - 2 freq
aff-white (3) - 1 freq
whilie (3) - 85 freq
wile (3) - 164 freq
a'wie (3) - 1 freq
whiles (3) - 478 freq
whill (3) - 13 freq
a-wye (3) - 1 freq
-mile (3) - 1 freq
whele (3) - 1 freq
white (3) - 943 freq
awtie (3) - 1 freq
awhe (3) - 2 freq
a-while (0) - 2 freq
awhile (2) - 15 freq
while (3) - 1436 freq
umwhile (3) - 16 freq
'while (3) - 1 freq
wwhile (3) - 1 freq
whilie (4) - 85 freq
whele (4) - 1 freq
whole (4) - 179 freq
whale (4) - 31 freq
whyle (4) - 49 freq
€˜while (4) - 1 freq
awhie (4) - 2 freq
a while (4) - 1 freq
whiley (4) - 12 freq
asweill (5) - 2 freq
qwhilk (5) - 1 freq
awhyles (5) - 1 freq
'whiles (5) - 2 freq
atweill (5) - 27 freq
awhare (5) - 2 freq
-why (5) - 1 freq
a-wee (5) - 3 freq
whilk (5) - 193 freq
akchilee (5) - 1 freq
SoundEx code - A400
all - 1278 freq
aal - 755 freq
ah'll - 739 freq
alloo - 51 freq
a'll - 528 freq
aweill - 6 freq
a'l - 52 freq
ahll - 7 freq
'ah'll - 36 freq
allow - 32 freq
aul - 395 freq
a-hai-ll - 1 freq
'a'll - 37 freq
allou - 35 freq
al - 237 freq
all' - 2 freq
'all - 9 freq
ali - 162 freq
aloo - 8 freq
ah'l - 13 freq
a-lee - 3 freq
allie - 9 freq
ale - 51 freq
'ah'l - 1 freq
alow - 46 freq
aul' - 5 freq
alloa - 4 freq
aa'll - 3 freq
aalie - 1 freq
aly - 5 freq
aall - 3 freq
alowe - 13 freq
'a'l - 1 freq
awwuliy - 1 freq
awol - 9 freq
ah''ll - 8 freq
a-while - 2 freq
ah'il - 23 freq
awhile - 15 freq
'al - 3 freq
allah - 1 freq
'aul' - 1 freq
a'il - 3 freq
alley - 4 freq
ally - 309 freq
a''ll - 4 freq
a-low - 2 freq
al' - 24 freq
ah'lll - 1 freq
'aul - 2 freq
ail - 5 freq
alwa - 2 freq
a'li - 1 freq
'aal' - 1 freq
'a''ll - 1 freq
alll - 1 freq
'alley - 2 freq
aail - 1 freq
alloway - 3 freq
allyie - 1 freq
ael - 1 freq
ahill - 1 freq
ahil - 1 freq
aa'l - 1 freq
aa-heal - 1 freq
aweel - 4 freq
aale - 1 freq
aweliy - 1 freq
awely - 8 freq
aloe - 1 freq
alh - 1 freq
€˜aweel - 1 freq
€œallow - 1 freq
€œall - 10 freq
€˜all - 5 freq
€™allou - 1 freq
allye - 1 freq
ayl - 1 freq
€™all - 1 freq
€œa'll - 4 freq
alloy - 1 freq
€˜aal - 3 freq
allay - 1 freq
alo - 1 freq
€˜al - 1 freq
allo - 2 freq
€œaal - 9 freq
€™aal - 1 freq
al- - 2 freq
ailie - 1 freq
aule - 1 freq
aÂ’wl - 1 freq
a'yl - 1 freq
ahÂ’ll - 10 freq
aillie - 1 freq
ayel - 1 freq
aulÂ’ - 2 freq
aÂ’ll - 5 freq
a while - 1 freq
aloa - 1 freq
a'all - 1 freq
ahl - 1 freq
alya - 1 freq
aalia - 1 freq
alley' - 1 freq
alie - 1 freq
allowa - 1 freq
MetaPhone code - AHL
a-hai-ll - 1 freq
a-while - 2 freq
awhile - 15 freq
ahill - 1 freq
ahil - 1 freq
aa-heal - 1 freq
a while - 1 freq
A-WHILE
Time to execute Levenshtein function - 0.420698 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.750893 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.031157 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.094648 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000794 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.