A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to defend in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
defend (0) - 23 freq
defends (1) - 1 freq
depend (1) - 16 freq
defen (1) - 1 freq
denend (1) - 1 freq
decent (2) - 112 freq
defined (2) - 21 freq
define (2) - 11 freq
defn (2) - 3 freq
refund (2) - 2 freq
legend (2) - 91 freq
defense (2) - 1 freq
deen' (2) - 1 freq
defeck (2) - 1 freq
depends (2) - 38 freq
deena (2) - 1 freq
deen- (2) - 1 freq
defendin (2) - 9 freq
peend (2) - 4 freq
resend (2) - 1 freq
defers (2) - 1 freq
deeid (2) - 15 freq
defied (2) - 5 freq
fend (2) - 28 freq
descend (2) - 14 freq
defend (0) - 23 freq
denend (2) - 1 freq
defen (2) - 1 freq
defined (2) - 21 freq
depend (2) - 16 freq
defends (2) - 1 freq
defreend (3) - 1 freq
defendit (3) - 2 freq
defended (3) - 3 freq
demand (3) - 51 freq
offend (3) - 12 freq
fend (3) - 28 freq
defendin (3) - 9 freq
defence (3) - 43 freq
define (3) - 11 freq
deafens (3) - 1 freq
defn (3) - 3 freq
refund (3) - 2 freq
defied (3) - 5 freq
defender (3) - 5 freq
defense (3) - 1 freq
definin (4) - 4 freq
defiant (4) - 11 freq
fund (4) - 563 freq
defyin (4) - 2 freq
SoundEx code - D153
definitely - 117 freq
depends - 38 freq
definite - 19 freq
definiteive - 1 freq
defineition - 2 freq
defiant - 11 freq
defended - 3 freq
dependin - 26 freq
definition - 30 freq
divinity - 2 freq
depend - 16 freq
defendin - 9 freq
depended - 5 freq
defend - 23 freq
defending - 5 freq
defined - 21 freq
definitive - 8 freq
defiantly - 7 freq
dependency - 2 freq
defenders - 8 freq
defendit - 2 freq
defineetion - 9 freq
dependit - 10 freq
dabhand - 1 freq
'definately - 1 freq
definietely - 1 freq
defends - 1 freq
divn't - 4 freq
dependable - 1 freq
defamed - 1 freq
defineetions - 1 freq
devined - 1 freq
dependent - 5 freq
depeindin - 1 freq
depeinds - 1 freq
dowp-end - 2 freq
dowpend - 1 freq
defender - 5 freq
depending - 6 freq
defineitiouns - 1 freq
definitions - 5 freq
€”depend - 1 freq
deepened - 2 freq
definit - 3 freq
defamatory - 1 freq
defendouris - 1 freq
defendent - 1 freq
defin-ately - 1 freq
divent - 1 freq
deviant - 1 freq
daub-haund - 1 freq
definitly - 1 freq
deviants - 1 freq
definet - 1 freq
dependan - 2 freq
€˜deviant - 1 freq
div'nt - 1 freq
dependin' - 1 freq
davemitch - 6 freq
defintootly - 1 freq
dépends - 1 freq
dependinÂ’ - 1 freq
MetaPhone code - TFNT
definite - 19 freq
defiant - 11 freq
divinity - 2 freq
defend - 23 freq
defined - 21 freq
divn't - 4 freq
devined - 1 freq
definit - 3 freq
divent - 1 freq
deviant - 1 freq
definet - 1 freq
€˜deviant - 1 freq
div'nt - 1 freq
DEFEND
Time to execute Levenshtein function - 0.180720 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.498478 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.063799 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.037704 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000884 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.