A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to annoying in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
annoying (0) - 10 freq
annoyin (1) - 25 freq
annoyince (2) - 2 freq
enjoying (2) - 24 freq
annoyinly (2) - 1 freq
annoyingly (2) - 2 freq
annoyan (2) - 1 freq
denoting (3) - 1 freq
annone (3) - 1 freq
inning (3) - 1 freq
enjoyin' (3) - 1 freq
unknoting (3) - 1 freq
knowing (3) - 5 freq
sannyin (3) - 1 freq
banning (3) - 2 freq
tanning (3) - 3 freq
anchoring (3) - 1 freq
injoyin (3) - 2 freq
annexin (3) - 1 freq
anoint (3) - 1 freq
incoming (3) - 1 freq
snowing (3) - 4 freq
allowing (3) - 6 freq
unifying (3) - 1 freq
manning (3) - 2 freq
annoying (0) - 10 freq
annoyin (2) - 25 freq
inning (3) - 1 freq
annoyan (3) - 1 freq
annoyingly (3) - 2 freq
annoyince (3) - 2 freq
annoyinly (3) - 1 freq
enjoying (3) - 24 freq
annint (4) - 1 freq
nohing (4) - 3 freq
manning (4) - 2 freq
atoning (4) - 1 freq
ongoing (4) - 2 freq
unifying (4) - 1 freq
denying (4) - 3 freq
anyhing (4) - 33 freq
annoyance (4) - 12 freq
banning (4) - 2 freq
tanning (4) - 3 freq
annone (4) - 1 freq
needing (5) - 6 freq
yin-yang (5) - 1 freq
naming (5) - 2 freq
annoonce (5) - 4 freq
cunning (5) - 1 freq
SoundEx code - A552
amang - 690 freq
amangst - 40 freq
among - 128 freq
amongst - 62 freq
anyhing - 33 freq
announcement - 10 freq
announces - 12 freq
announced - 32 freq
announcer's - 1 freq
aming - 3 freq
anyone's - 2 freq
annoyance - 12 freq
announcements - 5 freq
annuncee - 1 freq
amaing - 1 freq
amencg - 1 freq
annoyingly - 2 freq
amung - 3 freq
annunciation - 1 freq
announcer - 1 freq
aaaamang - 1 freq
annoying - 10 freq
annoonce - 4 freq
'among - 1 freq
annoyince - 2 freq
amungst - 2 freq
anooncements - 2 freq
announce - 7 freq
annoonced - 9 freq
announcing - 4 freq
anoonced - 2 freq
ananias - 2 freq
amangits - 1 freq
annunced - 2 freq
annooncement - 2 freq
anyhin's - 1 freq
annooncet - 1 freq
animacy - 1 freq
announcers - 1 freq
€˜among - 1 freq
amangat - 1 freq
annuncement - 1 freq
annuncit - 1 freq
announcin - 4 freq
annoonces - 1 freq
annooncements - 1 freq
annooncit - 2 freq
anenst - 52 freq
-anenst - 1 freq
anaemic - 1 freq
annooncin - 3 freq
anoonce - 1 freq
amunsgt - 1 freq
€˜amang - 1 freq
€™anyhing - 1 freq
amymacleanpod - 1 freq
annemclaughlin - 8 freq
anumqaisarjaved - 1 freq
MetaPhone code - ANYNK
annoying - 10 freq
ANNOYING
Time to execute Levenshtein function - 0.249505 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.428206 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.029567 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.040884 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.001004 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.