A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to armagh in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
armagh (0) - 5 freq
aagh (2) - 1 freq
armani (2) - 1 freq
argh (2) - 4 freq
omagh (2) - 2 freq
arnage (2) - 1 freq
anagh (2) - 1 freq
armand (2) - 1 freq
aaagh (2) - 1 freq
r-agh (2) - 1 freq
armata (2) - 1 freq
armys (3) - 1 freq
rags (3) - 12 freq
brags (3) - 3 freq
are-ach (3) - 1 freq
almas' (3) - 1 freq
alma (3) - 1 freq
frag (3) - 1 freq
ahah (3) - 1 freq
crag (3) - 8 freq
ramage (3) - 2 freq
warman (3) - 2 freq
doagh (3) - 1 freq
ragu (3) - 1 freq
maga (3) - 4 freq
armagh (0) - 5 freq
r-agh (3) - 1 freq
omagh (3) - 2 freq
argh (3) - 4 freq
aireamh (4) - 2 freq
ramah (4) - 1 freq
ramage (4) - 2 freq
urgh (4) - 2 freq
rugh (4) - 2 freq
a-muigh (4) - 1 freq
rough (4) - 48 freq
aaaargh (4) - 1 freq
reugh (4) - 1 freq
righ (4) - 1 freq
armand (4) - 1 freq
aaagh (4) - 1 freq
arnage (4) - 1 freq
armani (4) - 1 freq
aagh (4) - 1 freq
armata (4) - 1 freq
anagh (4) - 1 freq
aramaic (5) - 3 freq
homage (5) - 5 freq
mah (5) - 379 freq
army's (5) - 3 freq
SoundEx code - A652
airms - 447 freq
airm-chyne - 1 freq
airmies - 16 freq
arrangin - 2 freq
airm's - 8 freq
arrange - 14 freq
arms - 64 freq
arrangement - 17 freq
arrangements - 18 freq
airm-chair - 4 freq
arran's - 1 freq
arreengit - 1 freq
aaarms - 1 freq
airnest - 2 freq
airnestlie - 1 freq
airmchair - 8 freq
arins - 1 freq
arraynged - 1 freq
armous - 1 freq
armies - 4 freq
arranged - 32 freq
armchair - 6 freq
awareness - 41 freq
airms-sales - 1 freq
arms-dealers - 1 freq
armstrong - 15 freq
aurms - 1 freq
arrangers - 1 freq
aroon's - 1 freq
armageddon - 2 freq
armistice - 1 freq
airnstane - 1 freq
airm-shair - 1 freq
arreinged - 4 freq
arreingin - 2 freq
arrangemint - 1 freq
arnicht - 1 freq
arrangemints - 1 freq
armagh - 5 freq
airn-stith - 1 freq
arranges - 1 freq
arrainjin - 1 freq
army's - 3 freq
airmchairs - 1 freq
armstrong' - 1 freq
airmistice - 1 freq
airmstrangs - 1 freq
aranese - 1 freq
armstrang - 1 freq
awaurness - 7 freq
arreinge - 1 freq
armys - 1 freq
airmstrong - 5 freq
aromas - 1 freq
arraingit - 1 freq
aramaic - 3 freq
arreengement - 1 freq
arreenged - 1 freq
arrynge - 1 freq
awareness-raisin - 1 freq
arrengment - 1 freq
arnage - 1 freq
airing - 1 freq
awormsstory - 1 freq
arnistonrangers - 3 freq
arniston - 2 freq
aaronmackiee - 2 freq
arranging - 1 freq
arianagrande - 1 freq
MetaPhone code - ARMF
armagh - 5 freq
ARMAGH
Time to execute Levenshtein function - 0.400104 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.714139 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.074352 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.037091 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000871 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.