A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to armies in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
armies (0) - 4 freq
argies (1) - 9 freq
airmies (1) - 16 freq
aries (1) - 3 freq
gardies (2) - 1 freq
eries (2) - 1 freq
mammies (2) - 8 freq
aedies (2) - 1 freq
armrest (2) - 2 freq
carnies (2) - 2 freq
ariel (2) - 5 freq
argied (2) - 22 freq
abies (2) - 3 freq
arses (2) - 7 freq
arins (2) - 1 freq
rammies (2) - 4 freq
admits (2) - 2 freq
army's (2) - 3 freq
parties (2) - 20 freq
arches (2) - 17 freq
sarnies (2) - 1 freq
lammies (2) - 5 freq
orgies (2) - 2 freq
armit (2) - 4 freq
hardies (2) - 1 freq
armies (0) - 4 freq
airmies (1) - 16 freq
argies (2) - 9 freq
armous (2) - 1 freq
aries (2) - 3 freq
armys (2) - 1 freq
arms (2) - 67 freq
ruims (3) - 10 freq
ares (3) - 3 freq
gamies (3) - 11 freq
erms (3) - 48 freq
airies (3) - 1 freq
airmie (3) - 15 freq
yarries (3) - 1 freq
aromas (3) - 1 freq
armed (3) - 18 freq
rymes (3) - 1 freq
arises (3) - 5 freq
rims (3) - 3 freq
arraes (3) - 1 freq
airms (3) - 451 freq
arles (3) - 5 freq
ermie (3) - 1 freq
damies (3) - 1 freq
arins (3) - 1 freq
SoundEx code - A652
airms - 451 freq
airm-chyne - 1 freq
airmies - 16 freq
arrangin - 2 freq
airm's - 8 freq
arrange - 14 freq
arms - 67 freq
arrangement - 17 freq
arrangements - 18 freq
airm-chair - 4 freq
arran's - 1 freq
arreengit - 1 freq
aaarms - 1 freq
airnest - 2 freq
airnestlie - 1 freq
airmchair - 8 freq
arins - 1 freq
arraynged - 1 freq
armous - 1 freq
awareness - 44 freq
arranged - 33 freq
armies - 4 freq
armchair - 6 freq
airms-sales - 1 freq
arms-dealers - 1 freq
armstrong - 15 freq
aurms - 1 freq
arrangers - 1 freq
aroon's - 1 freq
armageddon - 2 freq
armistice - 1 freq
airnstane - 1 freq
airm-shair - 1 freq
arreinged - 4 freq
arreingin - 2 freq
arrangemint - 1 freq
arnicht - 1 freq
arrangemints - 1 freq
armagh - 5 freq
airn-stith - 1 freq
arranges - 1 freq
arrainjin - 1 freq
army's - 3 freq
airmchairs - 1 freq
armstrong' - 1 freq
airmistice - 1 freq
airmstrangs - 1 freq
aranese - 1 freq
armstrang - 1 freq
awaurness - 7 freq
arreinge - 1 freq
armys - 1 freq
airmstrong - 5 freq
aromas - 1 freq
arraingit - 1 freq
aramaic - 3 freq
arreengement - 1 freq
arreenged - 1 freq
arrynge - 1 freq
awareness-raisin - 1 freq
arrengment - 1 freq
arnage - 1 freq
airing - 1 freq
awormsstory - 1 freq
arnistonrangers - 3 freq
arniston - 2 freq
aaronmackiee - 2 freq
arranging - 1 freq
arianagrande - 1 freq
MetaPhone code - ARMS
airms - 451 freq
airmies - 16 freq
airm's - 8 freq
arms - 67 freq
aaarms - 1 freq
armous - 1 freq
armies - 4 freq
aurms - 1 freq
army's - 3 freq
armys - 1 freq
aromas - 1 freq
ARMIES
Time to execute Levenshtein function - 0.202379 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.403259 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.032191 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.041260 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.001182 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.