A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to misheard in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
misheard (0) - 2 freq
misread (2) - 2 freq
misleart (2) - 1 freq
mislead (2) - 3 freq
disregard (3) - 1 freq
misers (3) - 1 freq
pithead (3) - 1 freq
miscaad (3) - 2 freq
shears (3) - 11 freq
missed (3) - 192 freq
mither' (3) - 4 freq
fisher (3) - 41 freq
mither (3) - 1376 freq
mishearing (3) - 1 freq
mashed (3) - 5 freq
fishers (3) - 13 freq
pished (3) - 31 freq
shard (3) - 2 freq
mulhearn (3) - 1 freq
misted (3) - 2 freq
pisher (3) - 1 freq
wishart (3) - 19 freq
miscaa'd (3) - 1 freq
mistery (3) - 1 freq
iainheard (3) - 1 freq
misheard (0) - 2 freq
misread (3) - 2 freq
shard (4) - 2 freq
mushed (4) - 1 freq
shoard (4) - 4 freq
mashed (4) - 5 freq
sheared (4) - 6 freq
mustard (4) - 18 freq
mislead (4) - 3 freq
misleart (4) - 1 freq
meshed (4) - 1 freq
mishell (5) - 2 freq
muirhead (5) - 1 freq
miser (5) - 4 freq
unheard (5) - 7 freq
wished (5) - 61 freq
shear (5) - 9 freq
moshean (5) - 1 freq
richard (5) - 71 freq
dished (5) - 10 freq
mithert (5) - 1 freq
mister (5) - 83 freq
mishap (5) - 3 freq
mishaps (5) - 2 freq
ushered (5) - 5 freq
SoundEx code - M263
musardrie - 14 freq
miscried - 1 freq
meisured - 5 freq
moagered - 1 freq
maugered - 3 freq
migration - 10 freq
majority - 70 freq
measured - 6 freq
mccready - 6 freq
mccready's - 1 freq
mccartney's - 1 freq
misured - 4 freq
mozart - 12 freq
majoritie - 3 freq
mchardy - 8 freq
majorities - 4 freq
masqueradin - 2 freq
mccartney - 1 freq
mizzered - 2 freq
misread - 2 freq
mizhurt - 1 freq
micro-waith - 1 freq
macgordon - 1 freq
migrates - 1 freq
migrations - 1 freq
mcarthur - 11 freq
mæsjirt - 1 freq
mizzert - 1 freq
migratin - 1 freq
mccarthy - 3 freq
measur't - 1 freq
musardries - 1 freq
migrating - 1 freq
magret - 2 freq
magaret - 2 freq
misheard - 2 freq
mcgrath - 1 freq
maserati - 1 freq
micro-dialect - 1 freq
micro-dialects - 2 freq
macro-dialect - 1 freq
migrate - 1 freq
meisurt - 1 freq
migratory - 1 freq
macroidart - 1 freq
mcward - 3 freq
mcredie - 1 freq
mccreadiejoanna - 4 freq
m'carty's - 1 freq
mikerid - 1 freq
mckirdy - 1 freq
maisiewrites - 1 freq
majorette - 2 freq
magrit - 1 freq
mogert - 1 freq
macarthur - 1 freq
microdod - 1 freq
masquerading - 1 freq
mezuwrddsl - 1 freq
majored - 1 freq
MetaPhone code - MXRT
mchardy - 8 freq
misheard - 2 freq
MISHEARD
Time to execute Levenshtein function - 0.262190 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.802487 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.032448 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.072361 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.001105 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.