A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to acrass in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
acrass (0) - 1 freq
crass (1) - 4 freq
acroass (1) - 104 freq
across (1) - 587 freq
crams (2) - 2 freq
crags (2) - 7 freq
acrose (2) - 1 freq
actress (2) - 24 freq
crabs (2) - 7 freq
craks (2) - 1 freq
scrans (2) - 3 freq
craws (2) - 60 freq
criss (2) - 5 freq
cass (2) - 14 freq
acroess (2) - 1 freq
arraws (2) - 1 freq
across' (2) - 1 freq
scraps (2) - 22 freq
croass (2) - 8 freq
arabs (2) - 4 freq
harass (2) - 2 freq
crans (2) - 1 freq
arase (2) - 1 freq
cross (2) - 260 freq
akross (2) - 2 freq
acrass (0) - 1 freq
acroass (1) - 104 freq
across (1) - 587 freq
crass (1) - 4 freq
acroess (2) - 1 freq
croass (2) - 8 freq
acorss (2) - 9 freq
cross (2) - 260 freq
criss (2) - 5 freq
actoss (3) - 1 freq
corss (3) - 9 freq
craps (3) - 16 freq
acres (3) - 24 freq
crisis (3) - 31 freq
cracs (3) - 1 freq
class (3) - 452 freq
creases (3) - 1 freq
acoss (3) - 1 freq
acroos (3) - 1 freq
access (3) - 74 freq
craas (3) - 6 freq
aroass (3) - 1 freq
crises (3) - 1 freq
brass (3) - 31 freq
grass (3) - 106 freq
SoundEx code - A262
across - 587 freq
acroass - 104 freq
ackers - 16 freq
acres - 24 freq
aggression - 6 freq
acoorse - 7 freq
acrostic - 2 freq
acreage - 1 freq
accursit - 1 freq
azores - 1 freq
aggressive - 9 freq
age-wrocht - 1 freq
agrees - 13 freq
ascherson - 1 freq
'agricola' - 1 freq
agricola - 2 freq
accuracy - 7 freq
acroas - 4 freq
acroos - 1 freq
across' - 1 freq
akros - 3 freq
agricultural - 7 freq
acorss - 9 freq
ascraeus - 2 freq
aikeray's - 1 freq
acoorsh - 1 freq
acroess - 1 freq
agricola's - 1 freq
akross - 2 freq
ashores' - 1 freq
accressin - 1 freq
acourse - 2 freq
ascreus - 1 freq
assuires - 1 freq
acers - 1 freq
acrass - 1 freq
aggrege - 1 freq
aggregit - 1 freq
agriculture - 5 freq
aggressively - 2 freq
agricultur - 1 freq
agrostis - 1 freq
aggregate - 1 freq
acrose - 1 freq
€˜aggressive - 1 freq
a'course - 1 freq
asorkbwy - 1 freq
aggir's - 1 freq
ajcorrigan - 4 freq
acrossparents - 1 freq
akrjbsnz - 1 freq
MetaPhone code - AKRS
across - 587 freq
acroass - 104 freq
ackers - 16 freq
acres - 24 freq
acoorse - 7 freq
agrees - 13 freq
acroas - 4 freq
acroos - 1 freq
across' - 1 freq
akros - 3 freq
acorss - 9 freq
aikeray's - 1 freq
acroess - 1 freq
akross - 2 freq
acourse - 2 freq
acrass - 1 freq
acrose - 1 freq
a'course - 1 freq
aggir's - 1 freq
ACRASS
Time to execute Levenshtein function - 0.252944 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.553355 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027572 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.070608 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000938 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.