A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to answers in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
answers (0) - 74 freq
answer's (1) - 3 freq
answer- (1) - 1 freq
answert (1) - 158 freq
answer (1) - 444 freq
ans'ers (1) - 1 freq
unswers (1) - 1 freq
answered (2) - 116 freq
anglers (2) - 1 freq
answerit (2) - 1 freq
answeir (2) - 2 freq
unswer (2) - 6 freq
pansers (2) - 2 freq
awnsers (2) - 1 freq
anyweys (2) - 1 freq
aunswer (2) - 2 freq
antlers (2) - 11 freq
anklers (2) - 1 freq
ainswert (2) - 1 freq
aunswert (2) - 1 freq
awnswer (2) - 2 freq
answerin (2) - 21 freq
ansuer (2) - 1 freq
answeirt (2) - 1 freq
answeran (2) - 1 freq
answers (0) - 74 freq
unswers (1) - 1 freq
answer's (2) - 3 freq
answer- (2) - 1 freq
ans'ers (2) - 1 freq
answert (2) - 158 freq
answer (2) - 444 freq
answeirt (3) - 1 freq
aunswer (3) - 2 freq
answerin (3) - 21 freq
aunswert (3) - 1 freq
answeran (3) - 1 freq
ainswert (3) - 1 freq
answerit (3) - 1 freq
answeir (3) - 2 freq
nswer (3) - 1 freq
unswer (3) - 6 freq
answered (3) - 116 freq
anters (4) - 1 freq
anders (4) - 1 freq
swears (4) - 5 freq
unawars (4) - 1 freq
sewers (4) - 5 freq
'answer (4) - 2 freq
sweirs (4) - 2 freq
SoundEx code - A526
answer - 444 freq
answert - 158 freq
angry - 84 freq
angrily - 19 freq
answerin - 21 freq
ainswert - 1 freq
answers - 74 freq
answered - 116 freq
anchor - 28 freq
angert - 16 freq
angriest - 2 freq
anger - 73 freq
angert-like - 1 freq
aunswert - 1 freq
angrier - 6 freq
answerable - 2 freq
anchored - 10 freq
aunchorit - 1 freq
angered - 2 freq
awnswer - 2 freq
'answer - 2 freq
answer't - 18 freq
anachronistic - 2 freq
answeran - 1 freq
answer's - 3 freq
ang'r - 2 freq
ang'rie - 7 freq
ans'ers - 1 freq
anchort - 3 freq
anacreon - 1 freq
anggert - 1 freq
angirt-lik - 1 freq
awnser - 5 freq
annacker's - 2 freq
anocher - 1 freq
anchorin - 1 freq
angerow - 1 freq
ansuer - 1 freq
an-soor - 1 freq
anchors - 1 freq
ainger - 3 freq
angry' - 1 freq
aince-a-year - 1 freq
answer-na - 1 freq
angersome - 1 freq
answer''t - 1 freq
anchorage - 1 freq
aunser - 2 freq
answerit - 1 freq
answering - 4 freq
awnsert - 4 freq
awnsers - 1 freq
angirt - 3 freq
aunsert - 1 freq
anachronisms - 2 freq
anser - 5 freq
anchoring - 1 freq
aunswer - 2 freq
aunsir - 1 freq
ancrum - 1 freq
answer- - 1 freq
angeret - 1 freq
answeirt - 1 freq
angreen - 1 freq
answeir - 2 freq
ansir - 1 freq
anmgroup - 1 freq
angryscotland - 3 freq
answeryourphonr - 1 freq
anassarwar - 9 freq
angrya - 3 freq
amycharlton - 1 freq
anagram - 1 freq
amshru - 1 freq
angrysalmond - 3 freq
amygear - 4 freq
angrykid - 1 freq
ansar - 1 freq
amygarbett - 1 freq
MetaPhone code - ANSWRS
answers - 74 freq
answer's - 3 freq
ANSWERS
answer - 444 freq
answers - 74 freq
answered - 116 freq
answering - 4 freq
answerin - 21 freq
Time to execute Levenshtein function - 0.256096 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.454785 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.032852 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.043590 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.001110 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.