A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to avoid in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
avoid (0) - 57 freq
ovoid (1) - 1 freq
avodd (1) - 1 freq
avoids (1) - 5 freq
void (1) - 17 freq
hoid (2) - 10 freq
acrid (2) - 1 freq
auid (2) - 1 freq
vod (2) - 1 freq
vodd (2) - 2 freq
vid (2) - 4 freq
aheid (2) - 194 freq
awid (2) - 2 freq
avoided (2) - 6 freq
devoid (2) - 7 freq
oid (2) - 1 freq
aid (2) - 37 freq
droid (2) - 1 freq
avi (2) - 1 freq
avoidin (2) - 17 freq
voyd (2) - 2 freq
arid (2) - 5 freq
avows (2) - 1 freq
avail (2) - 12 freq
avoidit (2) - 8 freq
avoid (0) - 57 freq
void (1) - 17 freq
ovoid (1) - 1 freq
ovid (2) - 1 freq
vid (2) - 4 freq
voyd (2) - 2 freq
vod (2) - 1 freq
vood (2) - 2 freq
avoids (2) - 5 freq
avodd (2) - 1 freq
ajod (3) - 1 freq
i'void (3) - 1 freq
ahid (3) - 2 freq
avowed (3) - 1 freq
avoidet (3) - 1 freq
avin (3) - 2 freq
aloud (3) - 22 freq
voil (3) - 1 freq
avou (3) - 3 freq
vidi (3) - 1 freq
vido (3) - 1 freq
ovd (3) - 3 freq
vd (3) - 2 freq
voodo (3) - 1 freq
voyied (3) - 1 freq
SoundEx code - A130
aboot - 10010 freq
'awbody - 4 freq
awbody - 266 freq
about - 686 freq
'aboot - 10 freq
aabody - 342 freq
avoid - 57 freq
aboit - 1 freq
aboot' - 3 freq
aabodie - 33 freq
aft - 69 freq
a'body - 24 freq
apt - 45 freq
abooot - 2 freq
awb'dy - 2 freq
aabiddy - 1 freq
abody - 90 freq
abide - 15 freq
abodie - 2 freq
abdy - 31 freq
aabdy - 9 freq
abate - 3 freq
abid - 1 freq
abbot - 5 freq
aabeit - 3 freq
aboat - 2 freq
aift - 14 freq
abudee - 10 freq
abudy - 11 freq
abuddee - 1 freq
awbuddee - 1 freq
awbudee - 5 freq
awbudy - 6 freq
abyde - 2 freq
awbeit - 9 freq
abdie - 1 freq
awbodie - 37 freq
ab'd - 1 freq
awbuddie - 15 freq
awboady - 1 freq
ahbuddie - 3 freq
afuit - 3 freq
abet - 1 freq
awboadie - 3 freq
abiud - 4 freq
abot - 1 freq
'aabody - 1 freq
afte - 1 freq
avodd - 1 freq
awfte - 1 freq
aff-white - 1 freq
afta - 1 freq
af't - 1 freq
abiddy - 28 freq
a'biddy - 2 freq
aff-pit - 6 freq
apathy - 8 freq
aabody' - 1 freq
awfid - 1 freq
abode' - 1 freq
a'bdy - 3 freq
aa-but - 1 freq
awbidy - 2 freq
abit - 1 freq
abaot - 1 freq
€˜abide - 1 freq
abott - 2 freq
afoot - 1 freq
abaat - 1 freq
abaaht - 1 freq
awbdy - 8 freq
abidy - 36 freq
aabudy - 1 freq
abd - 2 freq
awbuddy - 5 freq
abittie - 1 freq
avowed - 1 freq
aebody - 1 freq
afouth - 1 freq
€œaabody - 2 freq
€œaboot - 1 freq
aff-the-wa - 1 freq
apd - 2 freq
afft - 1 freq
apoot - 1 freq
abaht - 2 freq
abeit - 1 freq
a'bidy - 8 freq
€œabdee - 1 freq
abdee - 1 freq
aabidy - 2 freq
aÂ’body - 8 freq
aÂ’bidy - 6 freq
aywuypvbwd - 1 freq
ab'dy - 3 freq
ayepad - 1 freq
ahbody - 6 freq
abut - 1 freq
abootÂ’ - 1 freq
a'aboot - 1 freq
aab'dy - 1 freq
awebody - 10 freq
appt - 1 freq
awebuddy - 1 freq
MetaPhone code - AFT
avoid - 57 freq
aft - 69 freq
aift - 14 freq
aught - 4 freq
afuit - 3 freq
afte - 1 freq
avodd - 1 freq
awfte - 1 freq
afta - 1 freq
af't - 1 freq
awfid - 1 freq
€˜aught - 1 freq
afoot - 1 freq
afft - 1 freq
AVOID
Time to execute Levenshtein function - 0.502218 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.720357 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.075992 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.042346 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000874 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.