A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to awauk in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
awauk (0) - 11 freq
wauk (1) - 40 freq
awaur (1) - 43 freq
awak (1) - 1 freq
'wauk (1) - 1 freq
waek (2) - 7 freq
awfu (2) - 128 freq
awan (2) - 1 freq
rauk (2) - 1 freq
waik (2) - 28 freq
aware (2) - 60 freq
waut (2) - 1 freq
swaak (2) - 2 freq
awae (2) - 22 freq
awork (2) - 1 freq
awud (2) - 1 freq
wakk (2) - 3 freq
'walk (2) - 1 freq
award (2) - 59 freq
watk (2) - 2 freq
alaek (2) - 2 freq
aaud (2) - 1 freq
waur (2) - 221 freq
waak (2) - 54 freq
walk (2) - 478 freq
awauk (0) - 11 freq
wauk (1) - 40 freq
awak (1) - 1 freq
waik (2) - 28 freq
waak (2) - 54 freq
awake (2) - 31 freq
awk (2) - 1 freq
wak (2) - 53 freq
awaur (2) - 43 freq
'wauk (2) - 1 freq
waek (2) - 7 freq
await (3) - 7 freq
wauks (3) - 5 freq
abak (3) - 1 freq
waky (3) - 2 freq
aways (3) - 58 freq
awaw (3) - 72 freq
awah (3) - 14 freq
cauk (3) - 5 freq
abaak (3) - 1 freq
awaey (3) - 4 freq
wauy (3) - 1 freq
wauf (3) - 1 freq
qwak (3) - 1 freq
awayr (3) - 1 freq
SoundEx code - A200
as - 17602 freq
ach - 313 freq
'as - 15 freq
aiks - 3 freq
ajee - 21 freq
ash - 32 freq
ask - 526 freq
ago - 394 freq
aes - 4 freq
ayewis - 156 freq
ack - 42 freq
aige - 11 freq
age - 421 freq
ak - 14 freq
ayeweys - 87 freq
aw's - 3 freq
awauk - 11 freq
ache - 14 freq
aik - 55 freq
aggie - 49 freq
ayweys - 25 freq
ags - 1 freq
ask-' - 1 freq
ashe - 1 freq
asks - 214 freq
awake - 31 freq
aix - 42 freq
aisy - 27 freq
aigs - 1 freq
ax - 76 freq
assay - 5 freq
'ach - 27 freq
azzy - 25 freq
ahs - 1 freq
'azzy - 3 freq
ac - 24 freq
'awk - 1 freq
ask' - 1 freq
aywiss - 7 freq
'awchhh - 1 freq
aa-weys - 1 freq
axe - 22 freq
aaweys - 5 freq
aik's - 2 freq
acks - 11 freq
aywis - 14 freq
ayewiz - 44 freq
aywiz - 1 freq
ayewes - 10 freq
aqua - 12 freq
ace - 19 freq
ahoj - 2 freq
ag - 13 freq
auche - 1 freq
awoke - 10 freq
aish - 2 freq
awash - 2 freq
aise - 14 freq
aisie - 11 freq
a''ways - 1 freq
a'ways - 3 freq
ass - 26 freq
ayweis - 2 freq
ask's - 2 freq
ayeweis - 4 freq
aij - 2 freq
auch - 3 freq
ayeways - 24 freq
askew - 2 freq
ahhhhs - 1 freq
asa - 6 freq
aks - 18 freq
'ask - 1 freq
aas - 6 freq
akh - 1 freq
aussie - 1 freq
asia - 12 freq
aa's - 4 freq
ayewyes - 9 freq
agai - 2 freq
'ach' - 1 freq
agi - 1 freq
ahice - 1 freq
ahk - 1 freq
ahhce - 1 freq
aka - 12 freq
aye-wiz - 20 freq
aese - 8 freq
agh - 11 freq
aiwyes - 1 freq
'age - 4 freq
'agh - 2 freq
ayes - 2 freq
ah'g - 1 freq
a's - 2 freq
ahj - 1 freq
ahaz - 4 freq
aq - 2 freq
ax' - 1 freq
ais - 18 freq
agae - 2 freq
aga - 8 freq
a-z - 13 freq
'ayewis - 3 freq
assie - 1 freq
aiggs - 4 freq
¬‚ask - 1 freq
aesy - 57 freq
aigg - 6 freq
'aussie - 1 freq
ayewss - 1 freq
auga - 1 freq
as-she - 1 freq
ayoch - 1 freq
a'ess - 1 freq
acce - 1 freq
aisk - 4 freq
ayesha - 1 freq
age' - 1 freq
aich - 2 freq
asky - 2 freq
akse - 1 freq
aiss - 3 freq
aaagh - 1 freq
ase - 1 freq
aisse - 4 freq
aus - 4 freq
agg - 6 freq
agg's - 1 freq
aach - 1 freq
€œach - 16 freq
€˜ach - 3 freq
€˜as - 8 freq
aagh - 1 freq
€œas - 14 freq
aze - 2 freq
ake - 7 freq
aywise - 1 freq
aywyes - 1 freq
aawyes - 1 freq
ajie - 3 freq
awyes - 3 freq
ayways - 2 freq
aikey - 4 freq
aw-weys - 1 freq
aesie - 2 freq
awk - 1 freq
awak - 1 freq
aug - 6 freq
aeyways - 1 freq
ay'ways - 1 freq
aeways - 3 freq
auss - 2 freq
€™as - 1 freq
aawise - 1 freq
awis - 3 freq
€˜ago - 2 freq
aig - 1 freq
ayewys - 1 freq
agee - 1 freq
awch - 2 freq
aj - 6 freq
akio - 1 freq
aiko - 1 freq
akxjz - 1 freq
aÂ’s - 1 freq
az - 1 freq
ayj - 1 freq
aezzc - 1 freq
aocau - 1 freq
aecc - 5 freq
ahg - 1 freq
asjjx - 1 freq
ayz - 1 freq
awayego - 1 freq
assq - 1 freq
ajya - 1 freq
aaywiz - 1 freq
aghw - 1 freq
axae - 1 freq
axj - 1 freq
aggggh - 1 freq
agaaaaaa - 1 freq
aÂ’wik - 1 freq
agcc - 5 freq
ajay - 2 freq
a'yese - 1 freq
ahgxo - 1 freq
aaach - 1 freq
aws - 1 freq
aghhhhhhhhh - 1 freq
aghhhhhhhhhh - 1 freq
“as - 1 freq
aqs - 1 freq
ayÂ’s - 1 freq
aikawa - 1 freq
aox - 1 freq
aoxckse - 1 freq
aic - 2 freq
aoagaw - 1 freq
aways - 58 freq
ayechihuahua - 1 freq
aussie' - 1 freq
auk - 1 freq
aye's - 1 freq
a'sae - 1 freq
awsae - 3 freq
aaaaagh - 1 freq
aok - 1 freq
ayxz - 1 freq
awjh - 1 freq
aqsc - 1 freq
akk - 1 freq
agyw - 1 freq
agj - 1 freq
awys - 1 freq
aque - 1 freq
ayq - 1 freq
auqi - 1 freq
aaways - 1 freq
'ack - 2 freq
MetaPhone code - AWK
awauk - 11 freq
awake - 31 freq
awoke - 10 freq
awak - 1 freq
aÂ’wik - 1 freq
AWAUK
Time to execute Levenshtein function - 0.179071 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.364514 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.028396 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.037378 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000860 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.