A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to definition in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
definition (0) - 30 freq
definitions (1) - 5 freq
defineition (1) - 2 freq
defineetion (2) - 9 freq
definitive (2) - 8 freq
detention (3) - 10 freq
definiteive (3) - 1 freq
definite (3) - 20 freq
dentition (3) - 1 freq
definitely (3) - 120 freq
designation (3) - 1 freq
definin (3) - 4 freq
defineitiouns (3) - 1 freq
destination (3) - 19 freq
defineetions (3) - 1 freq
defemination (3) - 1 freq
depiction (3) - 3 freq
deflection (3) - 2 freq
definit (3) - 3 freq
derivation (3) - 2 freq
repitition (3) - 1 freq
demolition (3) - 2 freq
definitly (3) - 1 freq
domination (3) - 1 freq
dedication (3) - 8 freq
definition (0) - 30 freq
defineition (1) - 2 freq
defineetion (2) - 9 freq
definitions (2) - 5 freq
definitive (3) - 8 freq
defineetions (4) - 1 freq
defemination (4) - 1 freq
domination (4) - 1 freq
defineitiouns (4) - 1 freq
definitly (4) - 1 freq
definit (4) - 3 freq
definin (4) - 4 freq
definite (4) - 20 freq
deflation (4) - 1 freq
definiteive (4) - 1 freq
detention (4) - 10 freq
definitely (4) - 120 freq
definietely (5) - 1 freq
admonition (5) - 3 freq
dominatin (5) - 6 freq
defendin (5) - 9 freq
fintin (5) - 1 freq
donation (5) - 12 freq
definet (5) - 1 freq
damnation (5) - 3 freq
SoundEx code - D153
definitely - 120 freq
depends - 38 freq
definite - 20 freq
definiteive - 1 freq
defineition - 2 freq
defiant - 12 freq
defended - 3 freq
dependin - 26 freq
definition - 30 freq
divinity - 2 freq
depend - 16 freq
defendin - 9 freq
depended - 5 freq
defend - 23 freq
defending - 5 freq
defined - 21 freq
definitive - 8 freq
defiantly - 7 freq
dependency - 2 freq
defenders - 8 freq
defendit - 2 freq
defineetion - 9 freq
dependit - 10 freq
dabhand - 1 freq
'definately - 1 freq
definietely - 1 freq
defends - 1 freq
divn't - 4 freq
dependable - 1 freq
defamed - 1 freq
defineetions - 1 freq
devined - 1 freq
dependent - 5 freq
depeindin - 1 freq
depeinds - 1 freq
dowp-end - 2 freq
dowpend - 1 freq
defender - 5 freq
depending - 6 freq
defineitiouns - 1 freq
definitions - 5 freq
€”depend - 1 freq
deepened - 2 freq
definit - 3 freq
defamatory - 1 freq
defendouris - 1 freq
defendent - 1 freq
defin-ately - 1 freq
divent - 1 freq
deviant - 1 freq
daub-haund - 1 freq
definitly - 1 freq
deviants - 1 freq
definet - 1 freq
dependan - 2 freq
€˜deviant - 1 freq
div'nt - 1 freq
dependin' - 1 freq
davemitch - 6 freq
defintootly - 1 freq
dépends - 1 freq
dependinÂ’ - 1 freq
MetaPhone code - TFNXN
defineition - 2 freq
definition - 30 freq
defineetion - 9 freq
devensian - 1 freq
DEFINITION
Time to execute Levenshtein function - 0.190514 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.361727 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.028515 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.038433 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.001150 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.