A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to defendin in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
defendin (0) - 9 freq
dependin (1) - 26 freq
defendit (1) - 2 freq
defending (1) - 5 freq
definin (2) - 4 freq
deafenin (2) - 3 freq
offendin (2) - 13 freq
defend (2) - 23 freq
demandin (2) - 11 freq
deefenin (2) - 2 freq
dependit (2) - 10 freq
depending (2) - 6 freq
defended (2) - 3 freq
fendin (2) - 5 freq
dependin' (2) - 1 freq
defends (2) - 1 freq
depeindin (2) - 1 freq
defender (2) - 5 freq
depennin (2) - 2 freq
descendin (2) - 11 freq
deifenin (2) - 1 freq
dependan (2) - 2 freq
defendent (2) - 1 freq
denend (3) - 1 freq
endin (3) - 43 freq
defendin (0) - 9 freq
defendit (2) - 2 freq
defending (2) - 5 freq
dependin (2) - 26 freq
defends (3) - 1 freq
fendin (3) - 5 freq
depeindin (3) - 1 freq
defendent (3) - 1 freq
defended (3) - 3 freq
dependan (3) - 2 freq
defender (3) - 5 freq
deifenin (3) - 1 freq
defend (3) - 23 freq
definin (3) - 4 freq
deafenin (3) - 3 freq
demandin (3) - 11 freq
deefenin (3) - 2 freq
offendin (3) - 13 freq
refoondin (4) - 1 freq
fendan (4) - 1 freq
dunedin (4) - 3 freq
deefnin (4) - 1 freq
droondin (4) - 1 freq
depending (4) - 6 freq
'fundin (4) - 1 freq
SoundEx code - D153
definitely - 120 freq
depends - 38 freq
definite - 20 freq
definiteive - 1 freq
defineition - 2 freq
defiant - 12 freq
defended - 3 freq
dependin - 26 freq
definition - 30 freq
divinity - 2 freq
depend - 16 freq
defendin - 9 freq
depended - 5 freq
defend - 23 freq
defending - 5 freq
defined - 21 freq
definitive - 8 freq
defiantly - 7 freq
dependency - 2 freq
defenders - 8 freq
defendit - 2 freq
defineetion - 9 freq
dependit - 10 freq
dabhand - 1 freq
'definately - 1 freq
definietely - 1 freq
defends - 1 freq
divn't - 4 freq
dependable - 1 freq
defamed - 1 freq
defineetions - 1 freq
devined - 1 freq
dependent - 5 freq
depeindin - 1 freq
depeinds - 1 freq
dowp-end - 2 freq
dowpend - 1 freq
defender - 5 freq
depending - 6 freq
defineitiouns - 1 freq
definitions - 5 freq
€”depend - 1 freq
deepened - 2 freq
definit - 3 freq
defamatory - 1 freq
defendouris - 1 freq
defendent - 1 freq
defin-ately - 1 freq
divent - 1 freq
deviant - 1 freq
daub-haund - 1 freq
definitly - 1 freq
deviants - 1 freq
definet - 1 freq
dependan - 2 freq
€˜deviant - 1 freq
div'nt - 1 freq
dependin' - 1 freq
davemitch - 6 freq
defintootly - 1 freq
dépends - 1 freq
dependinÂ’ - 1 freq
MetaPhone code - TFNTN
defendin - 9 freq
DEFENDIN
Time to execute Levenshtein function - 0.181439 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.342605 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027129 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.037051 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000898 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.