A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to amongst in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
amongst (0) - 62 freq
amangst (1) - 40 freq
amungst (1) - 2 freq
amost (2) - 1 freq
amangat (2) - 1 freq
mangst (2) - 2 freq
among (2) - 128 freq
angst (2) - 4 freq
plongs (3) - 10 freq
mons (3) - 12 freq
'mont (3) - 1 freq
moggit (3) - 1 freq
const (3) - 1 freq
moness (3) - 2 freq
amang (3) - 699 freq
mongw (3) - 1 freq
amung (3) - 3 freq
amends (3) - 3 freq
moght (3) - 1 freq
monged (3) - 1 freq
amoont (3) - 44 freq
monet (3) - 2 freq
amaist (3) - 24 freq
aginst (3) - 19 freq
admoneist (3) - 1 freq
amongst (0) - 62 freq
amungst (1) - 2 freq
amangst (1) - 40 freq
mangst (2) - 2 freq
angst (3) - 4 freq
amangat (3) - 1 freq
amnesty (4) - 4 freq
longest (4) - 4 freq
mings (4) - 1 freq
among (4) - 128 freq
amost (4) - 1 freq
amoonts (5) - 10 freq
honest (5) - 195 freq
aganest (5) - 2 freq
amount (5) - 50 freq
amidst (5) - 7 freq
monts (5) - 5 freq
ameest (5) - 1 freq
ainst (5) - 2 freq
amunsgt (5) - 1 freq
amendit (5) - 2 freq
monks' (5) - 1 freq
monoglot (5) - 3 freq
aming (5) - 3 freq
moist (5) - 9 freq
SoundEx code - A552
amang - 699 freq
amangst - 40 freq
among - 128 freq
amongst - 62 freq
anyhing - 33 freq
announcement - 10 freq
announces - 12 freq
announced - 33 freq
announcer's - 1 freq
aming - 3 freq
anyone's - 2 freq
annoyance - 12 freq
announcements - 5 freq
annuncee - 1 freq
amaing - 1 freq
amencg - 1 freq
annoyingly - 2 freq
amung - 3 freq
annunciation - 1 freq
announcer - 1 freq
aaaamang - 1 freq
annoying - 11 freq
annoonce - 4 freq
'among - 1 freq
annoyince - 2 freq
amungst - 2 freq
anooncements - 2 freq
announce - 7 freq
annoonced - 9 freq
announcing - 4 freq
anoonced - 2 freq
ananias - 2 freq
amangits - 1 freq
annunced - 2 freq
annooncement - 2 freq
anyhin's - 1 freq
annooncet - 1 freq
animacy - 1 freq
announcers - 1 freq
€˜among - 1 freq
amangat - 1 freq
annuncement - 1 freq
annuncit - 1 freq
announcin - 4 freq
annoonces - 1 freq
annooncements - 1 freq
annooncit - 2 freq
anenst - 52 freq
-anenst - 1 freq
anaemic - 1 freq
annooncin - 3 freq
anoonce - 1 freq
amunsgt - 1 freq
€˜amang - 1 freq
€™anyhing - 1 freq
amymacleanpod - 1 freq
annemclaughlin - 8 freq
anumqaisarjaved - 1 freq
MetaPhone code - AMNKST
amangst - 40 freq
amongst - 62 freq
amungst - 2 freq
AMONGST
amang - 699 freq
among - 128 freq
amongst - 62 freq
Time to execute Levenshtein function - 0.192771 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.357002 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.029798 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.038789 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000972 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.