A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to infodump in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
infodump (0) - 1 freq
inform (3) - 18 freq
informt (3) - 1 freq
infoarm (3) - 3 freq
informs (3) - 9 freq
inouth (4) - 1 freq
infur (4) - 1 freq
forum (4) - 37 freq
introduce (4) - 35 freq
info (4) - 36 freq
tfdmp (4) - 1 freq
infoarmin (4) - 1 freq
noup (4) - 2 freq
income (4) - 16 freq
infoarmed (4) - 2 freq
infirm (4) - 2 freq
influx (4) - 1 freq
frump (4) - 1 freq
infurms (4) - 1 freq
endup (4) - 1 freq
nondum (4) - 1 freq
infobox (4) - 1 freq
informal (4) - 18 freq
informed (4) - 19 freq
infor (4) - 1 freq
infodump (0) - 1 freq
tfdmp (5) - 1 freq
inform (5) - 18 freq
informs (5) - 9 freq
informt (5) - 1 freq
infoarm (5) - 3 freq
informed (6) - 19 freq
informal (6) - 18 freq
infamy (6) - 5 freq
endup (6) - 1 freq
dump (6) - 31 freq
infoarmal (6) - 1 freq
informit (6) - 1 freq
informin (6) - 4 freq
infurms (6) - 1 freq
indycamp (6) - 3 freq
nfum (6) - 2 freq
nondum (6) - 1 freq
infoarmed (6) - 2 freq
infoarmin (6) - 1 freq
frump (6) - 1 freq
infirm (6) - 2 freq
infamous (7) - 7 freq
nfudc (7) - 8 freq
damp (7) - 83 freq
SoundEx code - I513
impatiently - 20 freq
invitation - 27 freq
inbetween - 3 freq
inevitably - 10 freq
infatuation - 1 freq
impatient - 16 freq
inveetit - 4 freq
inobedience - 2 freq
inevitable - 29 freq
impediment - 10 freq
invite - 44 freq
invitin - 16 freq
inpit - 49 freq
invited - 38 freq
impident - 8 freq
invaders - 9 freq
invites - 11 freq
impudent - 2 freq
impidence - 8 freq
invitit - 39 freq
inpitten - 2 freq
inaboot - 35 freq
impty - 14 freq
invade - 5 freq
infatuated - 1 freq
invadit - 5 freq
invadin - 5 freq
invitatioun - 1 freq
invaded - 4 freq
'invited' - 2 freq
impidint - 1 freq
invitit's - 1 freq
impetus - 3 freq
impeedence - 1 freq
inhabitants - 8 freq
inhabit - 3 freq
input - 4 freq
invitet - 4 freq
invading - 1 freq
inaboots - 3 freq
innovations - 4 freq
inviterate - 1 freq
inpits - 5 freq
inpittin - 1 freq
inveit - 5 freq
invitaetion - 2 freq
infodump - 1 freq
invitin's - 1 freq
inviting - 3 freq
in-aboots - 1 freq
in-aboot - 1 freq
inybuddy - 1 freq
invitations - 4 freq
innovation - 6 freq
innovative - 9 freq
in-betweenies - 2 freq
impit - 1 freq
inapt - 1 freq
invye't - 1 freq
inbidin - 5 freq
inbye't - 1 freq
ineptly - 1 freq
inhibit - 2 freq
invyte - 1 freq
invied - 1 freq
inhabiters - 1 freq
inveterate - 1 freq
in-between - 1 freq
in-pittin - 1 freq
infit - 1 freq
inveits - 1 freq
inhabited - 2 freq
impediments - 2 freq
inevitability - 2 freq
innovator - 1 freq
impetuous - 2 freq
impudence - 2 freq
invitingly - 1 freq
inhabitit - 1 freq
impiddently - 1 freq
imbued - 1 freq
impotence - 1 freq
inhibition - 1 freq
inabootcomer - 1 freq
ianbhood - 2 freq
inhibitions - 1 freq
inept - 1 freq
ineptitude - 1 freq
infidelnumber - 5 freq
MetaPhone code - INFTMP
infodump - 1 freq
INFODUMP
Time to execute Levenshtein function - 0.255080 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.423862 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.029197 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.043186 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.001108 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.