A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to orthographies in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
orthographies (0) - 9 freq
orthographie (1) - 3 freq
orthographic (2) - 6 freq
orthographical (3) - 1 freq
orthography (3) - 50 freq
autographs (5) - 2 freq
owthorities (5) - 2 freq
demographics (5) - 3 freq
lithograph (5) - 1 freq
biographie (5) - 3 freq
therapies (5) - 2 freq
irthografee (5) - 4 freq
cartographers (5) - 2 freq
orthodoxies (5) - 2 freq
geographie (5) - 2 freq
autographed (5) - 2 freq
orthographically (5) - 1 freq
aetgraphics (5) - 1 freq
thrapples (6) - 18 freq
photgraphic (6) - 1 freq
hagiographer (6) - 1 freq
authorities (6) - 37 freq
graphics (6) - 3 freq
boorachies (6) - 3 freq
stenographers (6) - 1 freq
orthographies (0) - 9 freq
orthographie (2) - 3 freq
orthographic (3) - 6 freq
orthography (4) - 50 freq
orthographical (5) - 1 freq
lithograph (7) - 1 freq
autographs (7) - 2 freq
autographed (8) - 2 freq
orthographically (8) - 1 freq
cartographers (8) - 2 freq
aetgraphics (8) - 1 freq
irthografee (8) - 4 freq
therapies (8) - 2 freq
trophies (9) - 5 freq
autograph (9) - 1 freq
stenographers (9) - 1 freq
monographs (9) - 2 freq
telegraphs (9) - 1 freq
photographs (9) - 6 freq
orthodoxies (9) - 2 freq
biographie (9) - 3 freq
demographics (9) - 3 freq
geographie (9) - 2 freq
owthorities (9) - 2 freq
geographic (10) - 4 freq
SoundEx code - O632
owre-watchfu - 1 freq
orthography - 50 freq
owerdose - 1 freq
owertuk - 1 freq
owertak - 2 freq
orthographic - 6 freq
owertakan - 2 freq
owreheids - 1 freq
owrtesettin - 1 freq
orts - 3 freq
owertook - 2 freq
orthographies - 9 freq
owretak - 1 freq
orthographie - 3 freq
orthographical - 1 freq
ower-ticht - 1 freq
owertaks - 1 freq
owertakken - 1 freq
ortygtp - 1 freq
orhotchkiss - 1 freq
orthographically - 1 freq
o’wurds - 2 freq
MetaPhone code - OR0KRFS
orthographies - 9 freq
ORTHOGRAPHIES
Time to execute Levenshtein function - 0.216672 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.445144 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027379 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.037503 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000930 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.