A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to abstract in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
abstract (0) - 12 freq
abstrack (1) - 4 freq
astrict (2) - 3 freq
attract (2) - 6 freq
distract (2) - 12 freq
detract (3) - 2 freq
absorbit (3) - 4 freq
extract (3) - 14 freq
restrict (3) - 1 freq
stracht (3) - 75 freq
atrack (3) - 1 freq
astray (3) - 3 freq
subtract (3) - 1 freq
abstain (3) - 6 freq
strait (3) - 6 freq
strack (3) - 9 freq
attrack (3) - 4 freq
attracts (3) - 2 freq
strict (3) - 16 freq
straet (3) - 1 freq
contract (3) - 26 freq
district (3) - 33 freq
tract (3) - 1 freq
instruct (3) - 4 freq
abstraction (3) - 3 freq
abstract (0) - 12 freq
abstrack (2) - 4 freq
distract (3) - 12 freq
astrict (3) - 3 freq
restrict (4) - 1 freq
instruct (4) - 4 freq
strict (4) - 16 freq
district (4) - 33 freq
attract (4) - 6 freq
abstraction (4) - 3 freq
straet (5) - 1 freq
contract (5) - 26 freq
astricts (5) - 1 freq
strayt (5) - 1 freq
strach (5) - 1 freq
tract (5) - 1 freq
bastart (5) - 67 freq
stracht (5) - 75 freq
extract (5) - 14 freq
absorbit (5) - 4 freq
subtract (5) - 1 freq
detract (5) - 2 freq
strait (5) - 6 freq
strack (5) - 9 freq
districk (6) - 3 freq
SoundEx code - A123
apostrophes - 17 freq
apostrophe - 13 freq
affectionately - 5 freq
affection - 22 freq
affecting - 8 freq
awbesit - 2 freq
affectin - 1 freq
affected - 10 freq
affixed - 1 freq
aff-cut - 1 freq
abstraction - 3 freq
afaistane - 5 freq
affset - 2 freq
abstainit - 3 freq
affect - 10 freq
abstrack - 4 freq
affects - 9 freq
abjoot - 1 freq
abstain - 6 freq
affest - 1 freq
affections - 2 freq
apostles - 6 freq
apostrophe' - 2 freq
abasht - 1 freq
affstage - 1 freq
abused - 6 freq
affectionate - 3 freq
affside - 4 freq
affectation - 4 freq
aff-stage - 3 freq
aff-step - 1 freq
abstract - 12 freq
apostophe - 1 freq
abstractions - 1 freq
abstains - 2 freq
aufsteigt - 1 freq
apostolic - 1 freq
affectioun - 1 freq
affectit - 11 freq
avysit - 1 freq
avocado - 2 freq
apostolis - 1 freq
abuised - 1 freq
avast - 3 freq
apposeet - 1 freq
affshoots - 1 freq
aff-piste - 1 freq
affshuit - 1 freq
appeased - 1 freq
afektan - 1 freq
awfastraight - 1 freq
avacado - 1 freq
abcedminded - 1 freq
afcct - 3 freq
afecwuat - 1 freq
avochat - 1 freq
awbesits - 1 freq
afoust - 1 freq
avocados - 1 freq
aavzidhr - 1 freq
abstained - 1 freq
afctranent - 6 freq
abstainin - 1 freq
afcdunbar - 1 freq
abstaining - 1 freq
appggtr - 1 freq
abbeycottage - 1 freq
MetaPhone code - ABSTRKT
abstract - 12 freq
ABSTRACT
Time to execute Levenshtein function - 0.290060 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.364224 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027680 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.037670 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000974 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.