A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to acroos in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
acroos (0) - 1 freq
acrood (1) - 1 freq
acroas (1) - 4 freq
across (1) - 587 freq
scroos (1) - 1 freq
acroo (1) - 1 freq
roos (2) - 2 freq
acres (2) - 24 freq
acoss (2) - 1 freq
lavroos (2) - 1 freq
acrose (2) - 1 freq
croys (2) - 1 freq
croo (2) - 7 freq
acos (2) - 2 freq
coos (2) - 74 freq
alloos (2) - 38 freq
aprons (2) - 1 freq
cross (2) - 260 freq
akross (2) - 2 freq
aloos (2) - 4 freq
crooks (2) - 3 freq
crows (2) - 4 freq
across' (2) - 1 freq
scroo (2) - 2 freq
acroass (2) - 104 freq
acroos (0) - 1 freq
acroas (1) - 4 freq
cros (2) - 1 freq
acrose (2) - 1 freq
croys (2) - 1 freq
croas (2) - 1 freq
acres (2) - 24 freq
croose (2) - 14 freq
acrood (2) - 1 freq
scroos (2) - 1 freq
across (2) - 587 freq
acroo (2) - 1 freq
crook (3) - 12 freq
croog (3) - 1 freq
coors (3) - 5 freq
crys (3) - 1 freq
croons (3) - 14 freq
cloos (3) - 1 freq
acers (3) - 1 freq
broos (3) - 46 freq
nacrous (3) - 1 freq
croon (3) - 76 freq
actors (3) - 31 freq
crood (3) - 111 freq
acoorse (3) - 7 freq
SoundEx code - A262
across - 587 freq
acroass - 104 freq
ackers - 16 freq
acres - 24 freq
aggression - 6 freq
acoorse - 7 freq
acrostic - 2 freq
acreage - 1 freq
accursit - 1 freq
azores - 1 freq
aggressive - 9 freq
age-wrocht - 1 freq
agrees - 13 freq
ascherson - 1 freq
'agricola' - 1 freq
agricola - 2 freq
accuracy - 7 freq
acroas - 4 freq
acroos - 1 freq
across' - 1 freq
akros - 3 freq
agricultural - 7 freq
acorss - 9 freq
ascraeus - 2 freq
aikeray's - 1 freq
acoorsh - 1 freq
acroess - 1 freq
agricola's - 1 freq
akross - 2 freq
ashores' - 1 freq
accressin - 1 freq
acourse - 2 freq
ascreus - 1 freq
assuires - 1 freq
acers - 1 freq
acrass - 1 freq
aggrege - 1 freq
aggregit - 1 freq
agriculture - 5 freq
aggressively - 2 freq
agricultur - 1 freq
agrostis - 1 freq
aggregate - 1 freq
acrose - 1 freq
€˜aggressive - 1 freq
a'course - 1 freq
asorkbwy - 1 freq
aggir's - 1 freq
ajcorrigan - 4 freq
acrossparents - 1 freq
akrjbsnz - 1 freq
MetaPhone code - AKRS
across - 587 freq
acroass - 104 freq
ackers - 16 freq
acres - 24 freq
acoorse - 7 freq
agrees - 13 freq
acroas - 4 freq
acroos - 1 freq
across' - 1 freq
akros - 3 freq
acorss - 9 freq
aikeray's - 1 freq
acroess - 1 freq
akross - 2 freq
acourse - 2 freq
acrass - 1 freq
acrose - 1 freq
a'course - 1 freq
aggir's - 1 freq
ACROOS
Time to execute Levenshtein function - 0.195819 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.333076 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027437 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.037178 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000958 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.