A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to annoying in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
annoying (0) - 11 freq
annoyin (1) - 25 freq
annoyingly (2) - 2 freq
annoyinly (2) - 1 freq
annoyan (2) - 1 freq
enjoying (2) - 24 freq
annoyince (2) - 2 freq
annoys (3) - 4 freq
manning (3) - 2 freq
antonine (3) - 2 freq
unknoting (3) - 1 freq
annoyance (3) - 12 freq
ongoing (3) - 2 freq
njoyin (3) - 1 freq
banning (3) - 2 freq
anything (3) - 138 freq
ignoring (3) - 2 freq
nohing (3) - 3 freq
annoy (3) - 12 freq
applying (3) - 8 freq
annint (3) - 1 freq
allowing (3) - 6 freq
annexin (3) - 1 freq
inning (3) - 1 freq
anchoring (3) - 1 freq
annoying (0) - 11 freq
annoyin (2) - 25 freq
annoyince (3) - 2 freq
enjoying (3) - 24 freq
inning (3) - 1 freq
annoyan (3) - 1 freq
annoyingly (3) - 2 freq
annoyinly (3) - 1 freq
annint (4) - 1 freq
anyhing (4) - 33 freq
unifying (4) - 1 freq
nohing (4) - 3 freq
tanning (4) - 3 freq
denying (4) - 3 freq
atoning (4) - 1 freq
annone (4) - 1 freq
ongoing (4) - 2 freq
annoyance (4) - 12 freq
banning (4) - 2 freq
manning (4) - 2 freq
announce (5) - 7 freq
caining (5) - 1 freq
dunning (5) - 1 freq
onding (5) - 9 freq
oniehing (5) - 1 freq
SoundEx code - A552
amang - 699 freq
amangst - 40 freq
among - 128 freq
amongst - 62 freq
anyhing - 33 freq
announcement - 10 freq
announces - 12 freq
announced - 33 freq
announcer's - 1 freq
aming - 3 freq
anyone's - 2 freq
annoyance - 12 freq
announcements - 5 freq
annuncee - 1 freq
amaing - 1 freq
amencg - 1 freq
annoyingly - 2 freq
amung - 3 freq
annunciation - 1 freq
announcer - 1 freq
aaaamang - 1 freq
annoying - 11 freq
annoonce - 4 freq
'among - 1 freq
annoyince - 2 freq
amungst - 2 freq
anooncements - 2 freq
announce - 7 freq
annoonced - 9 freq
announcing - 4 freq
anoonced - 2 freq
ananias - 2 freq
amangits - 1 freq
annunced - 2 freq
annooncement - 2 freq
anyhin's - 1 freq
annooncet - 1 freq
animacy - 1 freq
announcers - 1 freq
€˜among - 1 freq
amangat - 1 freq
annuncement - 1 freq
annuncit - 1 freq
announcin - 4 freq
annoonces - 1 freq
annooncements - 1 freq
annooncit - 2 freq
anenst - 52 freq
-anenst - 1 freq
anaemic - 1 freq
annooncin - 3 freq
anoonce - 1 freq
amunsgt - 1 freq
€˜amang - 1 freq
€™anyhing - 1 freq
amymacleanpod - 1 freq
annemclaughlin - 8 freq
anumqaisarjaved - 1 freq
MetaPhone code - ANYNK
annoying - 11 freq
ANNOYING
Time to execute Levenshtein function - 0.235333 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.387272 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027532 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.045014 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.001237 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.