A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to abstain in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
abstain (0) - 6 freq
abstains (1) - 2 freq
abstained (2) - 1 freq
obtain (2) - 2 freq
abatin (2) - 1 freq
astair (2) - 1 freq
abettin (2) - 2 freq
abstainit (2) - 3 freq
stain (2) - 26 freq
distain (2) - 1 freq
attain (2) - 2 freq
sustain (2) - 5 freq
abstainin (2) - 1 freq
astarn (2) - 1 freq
abain (2) - 23 freq
austin (2) - 4 freq
aistlin (2) - 1 freq
'spain (3) - 2 freq
askin (3) - 176 freq
altai (3) - 2 freq
testan (3) - 1 freq
upstair (3) - 5 freq
restin (3) - 29 freq
stair (3) - 173 freq
castan (3) - 1 freq
abstain (0) - 6 freq
abstains (2) - 2 freq
abstainin (3) - 1 freq
sustain (3) - 5 freq
austin (3) - 4 freq
bustan (3) - 1 freq
bastin (3) - 1 freq
stain (3) - 26 freq
distain (3) - 1 freq
abstained (3) - 1 freq
obtain (3) - 2 freq
abettin (3) - 2 freq
abstainit (3) - 3 freq
abatin (3) - 1 freq
abusin (4) - 3 freq
buttin (4) - 5 freq
costan (4) - 1 freq
wastan (4) - 1 freq
hostin (4) - 16 freq
wystin (4) - 2 freq
justin (4) - 6 freq
rostin (4) - 2 freq
styin (4) - 4 freq
staan (4) - 13 freq
costin (4) - 5 freq
SoundEx code - A123
apostrophes - 17 freq
apostrophe - 13 freq
affectionately - 5 freq
affection - 22 freq
affecting - 8 freq
awbesit - 2 freq
affectin - 1 freq
affected - 10 freq
affixed - 1 freq
aff-cut - 1 freq
abstraction - 3 freq
afaistane - 5 freq
affset - 2 freq
abstainit - 3 freq
affect - 10 freq
abstrack - 4 freq
affects - 9 freq
abjoot - 1 freq
abstain - 6 freq
affest - 1 freq
affections - 2 freq
apostles - 6 freq
apostrophe' - 2 freq
abasht - 1 freq
affstage - 1 freq
abused - 6 freq
affectionate - 3 freq
affside - 4 freq
affectation - 4 freq
aff-stage - 3 freq
aff-step - 1 freq
abstract - 12 freq
apostophe - 1 freq
abstractions - 1 freq
abstains - 2 freq
aufsteigt - 1 freq
apostolic - 1 freq
affectioun - 1 freq
affectit - 11 freq
avysit - 1 freq
avocado - 2 freq
apostolis - 1 freq
abuised - 1 freq
avast - 3 freq
apposeet - 1 freq
affshoots - 1 freq
aff-piste - 1 freq
affshuit - 1 freq
appeased - 1 freq
afektan - 1 freq
awfastraight - 1 freq
avacado - 1 freq
abcedminded - 1 freq
afcct - 3 freq
afecwuat - 1 freq
avochat - 1 freq
awbesits - 1 freq
afoust - 1 freq
avocados - 1 freq
aavzidhr - 1 freq
abstained - 1 freq
afctranent - 6 freq
abstainin - 1 freq
afcdunbar - 1 freq
abstaining - 1 freq
appggtr - 1 freq
abbeycottage - 1 freq
MetaPhone code - ABSTN
abstain - 6 freq
ABSTAIN
Time to execute Levenshtein function - 0.208353 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.361567 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.029590 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.044417 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000842 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.