A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to question in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
question (0) - 410 freq
quastion (1) - 2 freq
questions (1) - 193 freq
quistion (1) - 8 freq
questiont (1) - 1 freq
questin (1) - 11 freq
'question (1) - 4 freq
queestion (1) - 17 freq
questionÂ’ (2) - 1 freq
questiones (2) - 1 freq
quaistion (2) - 21 freq
quastioun (2) - 10 freq
quistions (2) - 1 freq
queetin (2) - 1 freq
queestions (2) - 4 freq
question's (2) - 1 freq
questiouns (2) - 1 freq
questionin (2) - 11 freq
questins (2) - 1 freq
quentin (2) - 6 freq
questioned (2) - 10 freq
cushion (3) - 28 freq
function (3) - 32 freq
election (3) - 87 freq
quotin (3) - 9 freq
question (0) - 410 freq
quastion (1) - 2 freq
questin (1) - 11 freq
queestion (1) - 17 freq
quistion (1) - 8 freq
quaistion (2) - 21 freq
quastioun (2) - 10 freq
questiont (2) - 1 freq
questions (2) - 193 freq
'question (2) - 4 freq
questins (3) - 1 freq
questionin (3) - 11 freq
questioned (3) - 10 freq
quaesten (3) - 2 freq
quentin (3) - 6 freq
questiouns (3) - 1 freq
quistions (3) - 1 freq
queetin (3) - 1 freq
questiones (3) - 1 freq
queestions (3) - 4 freq
justin (4) - 6 freq
nestin (4) - 3 freq
peston (4) - 2 freq
quoitin (4) - 1 freq
quiltin (4) - 1 freq
SoundEx code - Q235
question - 410 freq
qweestions - 1 freq
questions - 193 freq
quaisten - 23 freq
questionnaire - 6 freq
quaistens - 13 freq
questionin - 11 freq
questin - 11 freq
'question - 4 freq
questenin - 1 freq
question's - 1 freq
questioned - 10 freq
questins - 1 freq
questionable - 3 freq
quaistion - 21 freq
quaistions - 5 freq
question-ask-everyunionist-gers - 1 freq
queestion - 17 freq
queestions - 4 freq
questionsas - 1 freq
quastioun - 10 freq
question-like - 1 freq
qestyins - 1 freq
quistion - 8 freq
quistions - 1 freq
questiouns - 1 freq
quaistiouns - 2 freq
€˜quaisten - 1 freq
quaistenin - 1 freq
quaistent - 1 freq
questiones - 1 freq
quastiouned - 1 freq
quastiouns - 4 freq
quaistins - 1 freq
quastion - 2 freq
quaisteen - 1 freq
quick-thinking - 1 freq
questiont - 1 freq
questioninly - 1 freq
questioning - 1 freq
quaesten - 2 freq
questionÂ’ - 1 freq
MetaPhone code - KSXN
question - 410 freq
'question - 4 freq
quaistion - 21 freq
queestion - 17 freq
quastioun - 10 freq
quistion - 8 freq
quastion - 2 freq
questionÂ’ - 1 freq
QUESTION
question - 410 freq
questions - 193 freq
questioned - 10 freq
questioning - 1 freq
questionin - 11 freq
questioned - 10 freq
questionnaire - 6 freq
Time to execute Levenshtein function - 0.218118 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.373487 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.028113 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.046288 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.001286 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.