A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to answer in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
answer (0) - 443 freq
answeir (1) - 2 freq
answers (1) - 74 freq
'answer (1) - 2 freq
anser (1) - 5 freq
nswer (1) - 1 freq
ansuer (1) - 1 freq
unswer (1) - 6 freq
aunswer (1) - 2 freq
awnswer (1) - 2 freq
answer- (1) - 1 freq
answert (1) - 158 freq
niwer (2) - 28 freq
anwey (2) - 1 freq
aner (2) - 2 freq
awnser (2) - 5 freq
aunser (2) - 2 freq
ans'ers (2) - 1 freq
answerin (2) - 21 freq
anywey (2) - 36 freq
hansper (2) - 1 freq
aister (2) - 2 freq
ansir (2) - 1 freq
aunswert (2) - 1 freq
inower (2) - 10 freq
answer (0) - 443 freq
aunswer (1) - 2 freq
unswer (1) - 6 freq
nswer (1) - 1 freq
answeir (1) - 2 freq
answer- (2) - 1 freq
awnswer (2) - 2 freq
answert (2) - 158 freq
ansuer (2) - 1 freq
answers (2) - 74 freq
'answer (2) - 2 freq
anser (2) - 5 freq
unser (3) - 3 freq
answeirt (3) - 1 freq
ainswert (3) - 1 freq
enster (3) - 2 freq
ansar (3) - 1 freq
unswers (3) - 1 freq
inower (3) - 10 freq
answerit (3) - 1 freq
newer (3) - 7 freq
answeran (3) - 1 freq
ainster (3) - 2 freq
aunswert (3) - 1 freq
aunser (3) - 2 freq
SoundEx code - A526
answer - 443 freq
answert - 158 freq
angry - 84 freq
angrily - 19 freq
answerin - 21 freq
ainswert - 1 freq
answers - 74 freq
answered - 115 freq
anchor - 28 freq
angert - 16 freq
angriest - 2 freq
anger - 71 freq
angert-like - 1 freq
aunswert - 1 freq
angrier - 6 freq
answerable - 2 freq
anchored - 10 freq
aunchorit - 1 freq
angered - 2 freq
awnswer - 2 freq
'answer - 2 freq
answer't - 18 freq
anachronistic - 2 freq
answeran - 1 freq
answer's - 3 freq
ang'r - 2 freq
ang'rie - 7 freq
ans'ers - 1 freq
anchort - 3 freq
anacreon - 1 freq
anggert - 1 freq
angirt-lik - 1 freq
awnser - 5 freq
annacker's - 2 freq
anocher - 1 freq
anchorin - 1 freq
angerow - 1 freq
ansuer - 1 freq
an-soor - 1 freq
anchors - 1 freq
ainger - 3 freq
angry' - 1 freq
aince-a-year - 1 freq
answer-na - 1 freq
angersome - 1 freq
answer''t - 1 freq
anchorage - 1 freq
aunser - 2 freq
answerit - 1 freq
answering - 4 freq
awnsert - 4 freq
awnsers - 1 freq
angirt - 3 freq
aunsert - 1 freq
anachronisms - 2 freq
anser - 5 freq
anchoring - 1 freq
aunswer - 2 freq
aunsir - 1 freq
ancrum - 1 freq
answer- - 1 freq
angeret - 1 freq
answeirt - 1 freq
angreen - 1 freq
answeir - 2 freq
ansir - 1 freq
anmgroup - 1 freq
angryscotland - 3 freq
answeryourphonr - 1 freq
anassarwar - 9 freq
angrya - 3 freq
amycharlton - 1 freq
anagram - 1 freq
amshru - 1 freq
angrysalmond - 3 freq
amygear - 4 freq
angrykid - 1 freq
ansar - 1 freq
amygarbett - 1 freq
MetaPhone code - ANSWR
answer - 443 freq
awnswer - 2 freq
'answer - 2 freq
aunswer - 2 freq
answer- - 1 freq
answeir - 2 freq
ANSWER
answer - 443 freq
answers - 74 freq
answered - 115 freq
answering - 4 freq
answerin - 21 freq
Time to execute Levenshtein function - 0.258963 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.377406 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027308 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.037359 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.001043 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.