A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to agrees in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
agrees (0) - 13 freq
agreet (1) - 17 freq
agreed (1) - 106 freq
grees (1) - 18 freq
'grees (1) - 2 freq
agree (1) - 156 freq
ageest (2) - 2 freq
trees (2) - 430 freq
grebes (2) - 2 freq
degrees (2) - 29 freq
airses (2) - 3 freq
agness (2) - 2 freq
age's (2) - 2 freq
ages (2) - 151 freq
greens (2) - 55 freq
agnes (2) - 29 freq
agreean (2) - 1 freq
ares (2) - 3 freq
drees (2) - 6 freq
greys (2) - 5 freq
agee (2) - 1 freq
greek (2) - 75 freq
threes (2) - 11 freq
greet (2) - 283 freq
gress (2) - 199 freq
agrees (0) - 13 freq
grees (1) - 18 freq
greys (2) - 5 freq
agreet (2) - 17 freq
agree (2) - 156 freq
grues (2) - 1 freq
agreed (2) - 106 freq
'grees (2) - 2 freq
ayres (3) - 1 freq
gregs (3) - 1 freq
rees (3) - 6 freq
gers (3) - 8 freq
green (3) - 612 freq
aries (3) - 3 freq
greasy (3) - 23 freq
greer (3) - 6 freq
egreer (3) - 2 freq
gros (3) - 8 freq
agreeit (3) - 2 freq
prees (3) - 2 freq
agates (3) - 1 freq
agreein (3) - 8 freq
grease (3) - 12 freq
gree (3) - 102 freq
greesie (3) - 1 freq
SoundEx code - A262
across - 584 freq
acroass - 104 freq
ackers - 16 freq
acres - 24 freq
aggression - 6 freq
acoorse - 7 freq
acrostic - 2 freq
acreage - 1 freq
accursit - 1 freq
azores - 1 freq
age-wrocht - 1 freq
agrees - 13 freq
ascherson - 1 freq
'agricola' - 1 freq
agricola - 2 freq
accuracy - 7 freq
acroas - 4 freq
acroos - 1 freq
across' - 1 freq
akros - 3 freq
agricultural - 7 freq
acorss - 9 freq
ascraeus - 2 freq
aikeray's - 1 freq
acoorsh - 1 freq
acroess - 1 freq
agricola's - 1 freq
akross - 2 freq
ashores' - 1 freq
accressin - 1 freq
acourse - 2 freq
ascreus - 1 freq
assuires - 1 freq
acers - 1 freq
acrass - 1 freq
aggressive - 6 freq
aggrege - 1 freq
aggregit - 1 freq
agriculture - 5 freq
aggressively - 2 freq
agricultur - 1 freq
agrostis - 1 freq
aggregate - 1 freq
acrose - 1 freq
€˜aggressive - 1 freq
a'course - 1 freq
asorkbwy - 1 freq
aggir's - 1 freq
ajcorrigan - 4 freq
acrossparents - 1 freq
akrjbsnz - 1 freq
MetaPhone code - AKRS
across - 584 freq
acroass - 104 freq
ackers - 16 freq
acres - 24 freq
acoorse - 7 freq
agrees - 13 freq
acroas - 4 freq
acroos - 1 freq
across' - 1 freq
akros - 3 freq
acorss - 9 freq
aikeray's - 1 freq
acroess - 1 freq
akross - 2 freq
acourse - 2 freq
acrass - 1 freq
acrose - 1 freq
a'course - 1 freq
aggir's - 1 freq
AGREES
Time to execute Levenshtein function - 0.165655 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.314424 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027474 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.038239 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.001071 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.