A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to ideas in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
ideas (0) - 145 freq
idees (1) - 3 freq
iydeas (1) - 1 freq
idea (1) - 550 freq
ideal (1) - 34 freq
idea' (1) - 1 freq
deas (1) - 1 freq
ideals (1) - 6 freq
windeas (2) - 1 freq
adead (2) - 1 freq
idheal (2) - 1 freq
peas (2) - 52 freq
idaias (2) - 6 freq
iday (2) - 19 freq
rides (2) - 16 freq
deys (2) - 9 freq
das (2) - 23 freq
adews (2) - 1 freq
ie's (2) - 1 freq
'dear (2) - 4 freq
ikea (2) - 8 freq
ineae (2) - 1 freq
idols (2) - 4 freq
beas (2) - 3 freq
tides (2) - 22 freq
ideas (0) - 145 freq
idees (1) - 3 freq
iydeas (1) - 1 freq
deas (1) - 1 freq
des (2) - 23 freq
deys (2) - 9 freq
idase (2) - 1 freq
das (2) - 23 freq
deus (2) - 17 freq
deis (2) - 1 freq
odes (2) - 5 freq
ids (2) - 63 freq
dees (2) - 41 freq
ideal (2) - 34 freq
ideals (2) - 6 freq
idea' (2) - 1 freq
idaias (2) - 6 freq
idea (2) - 550 freq
dies (3) - 5 freq
iden (3) - 2 freq
ida's (3) - 1 freq
deks (3) - 1 freq
hides (3) - 14 freq
debs (3) - 1 freq
dean (3) - 18 freq
SoundEx code - I320
'it's - 303 freq
it's - 5354 freq
its - 3299 freq
ideas - 145 freq
idaias - 6 freq
itch - 5 freq
itchy - 36 freq
'its - 8 freq
idees - 3 freq
id's - 58 freq
ids - 63 freq
'ides - 1 freq
ida's - 1 freq
i'tuck - 5 freq
itis - 1 freq
i'dugs - 2 freq
i'days - 2 freq
i'dough - 1 freq
i'deck - 1 freq
i'dock - 1 freq
iydeas - 1 freq
ithaca - 2 freq
€˜itchy - 1 freq
i'ts - 2 freq
€œits - 12 freq
€˜its - 3 freq
itwes - 1 freq
€œit's - 5 freq
idiocy - 2 freq
€˜it's - 1 freq
itz - 2 freq
its' - 2 freq
itzjo - 1 freq
itÂ’s - 256 freq
idjz - 1 freq
iddq - 1 freq
iatj - 1 freq
idk - 1 freq
idzg - 1 freq
itsa - 1 freq
itc - 1 freq
 'it’s - 1 freq
‘it’s - 1 freq
it's' - 2 freq
idiocy' - 1 freq
idase - 1 freq
MetaPhone code - ITS
'it's - 303 freq
it's - 5354 freq
its - 3299 freq
ideas - 145 freq
idaias - 6 freq
'its - 8 freq
idees - 3 freq
id's - 58 freq
ids - 63 freq
'ides - 1 freq
ida's - 1 freq
itis - 1 freq
i'days - 2 freq
iydeas - 1 freq
i'ts - 2 freq
€œits - 12 freq
€˜its - 3 freq
€œit's - 5 freq
idiocy - 2 freq
€˜it's - 1 freq
itz - 2 freq
its' - 2 freq
itÂ’s - 256 freq
itsa - 1 freq
 'it’s - 1 freq
‘it’s - 1 freq
it's' - 2 freq
idiocy' - 1 freq
idase - 1 freq
IDEAS
idea - 550 freq
ideas - 145 freq
Time to execute Levenshtein function - 0.304195 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.525003 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.066005 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.091955 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.001109 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.