A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to encroach in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
encroach (0) - 1 freq
encroachin (2) - 2 freq
encroached (2) - 6 freq
coach (3) - 51 freq
crooch (3) - 1 freq
enyoch (3) - 36 freq
reproach (3) - 2 freq
eneoch (3) - 2 freq
enoch (3) - 17 freq
encroaching (3) - 2 freq
broach (3) - 6 freq
scronach (3) - 2 freq
crouch (3) - 8 freq
approach (3) - 105 freq
crotch (3) - 1 freq
enouch (3) - 4 freq
enrich (3) - 4 freq
scorch (4) - 1 freq
laroch (4) - 1 freq
broch (4) - 86 freq
brooch (4) - 9 freq
feerach (4) - 3 freq
crunch (4) - 18 freq
craack (4) - 1 freq
scootch (4) - 1 freq
encroach (0) - 1 freq
encroached (3) - 6 freq
encroachin (3) - 2 freq
crouch (4) - 8 freq
crooch (4) - 1 freq
enrich (4) - 4 freq
scorch (5) - 1 freq
enouch (5) - 4 freq
uncrc (5) - 1 freq
scriech (5) - 3 freq
screch (5) - 4 freq
crotch (5) - 1 freq
screech (5) - 5 freq
scraich (5) - 14 freq
screich (5) - 18 freq
enyoch (5) - 36 freq
coach (5) - 51 freq
enoch (5) - 17 freq
eneoch (5) - 2 freq
encroaching (5) - 2 freq
approach (5) - 105 freq
reproach (5) - 2 freq
broach (5) - 6 freq
scronach (5) - 2 freq
auroch (6) - 2 freq
SoundEx code - E526
engraved - 6 freq
encouraging - 8 freq
encouraginly - 2 freq
encouraged - 26 freq
encourage - 64 freq
enshrined - 2 freq
engrossed - 5 freq
enquire - 4 freq
enquiry - 7 freq
emigrants - 1 freq
ensure - 18 freq
encircled - 1 freq
encouragin' - 4 freq
enshore - 2 freq
encroaching - 2 freq
encore - 5 freq
encouragement - 28 freq
enquires - 5 freq
enquired - 8 freq
enchor't - 1 freq
'encouraged' - 1 freq
encouragin - 17 freq
engravins - 1 freq
ensurin - 3 freq
emigratet - 1 freq
encourages - 4 freq
emigrant - 2 freq
emigrant's - 1 freq
engorged - 1 freq
enquirt - 1 freq
enquiries - 6 freq
encouragemint - 1 freq
encroached - 6 freq
ensures - 3 freq
emissary - 2 freq
encouragan - 2 freq
emigrated - 10 freq
encrusted - 2 freq
ee-winkers - 2 freq
emigrate - 2 freq
eonger-eel - 1 freq
eimagerie - 1 freq
encouraget - 1 freq
encouragit - 4 freq
enshair - 1 freq
encroachin - 2 freq
encroach - 1 freq
enquirin - 3 freq
emigratit - 1 freq
enshuir - 1 freq
engert - 1 freq
encooraged - 1 freq
ensuring - 2 freq
engerlush - 4 freq
engurlesh - 1 freq
engurlish - 3 freq
engurland - 2 freq
engurlush - 2 freq
emmagraeauthor - 12 freq
enquiring - 1 freq
MetaPhone code - ENKRX
encroach - 1 freq
ENCROACH
Time to execute Levenshtein function - 0.309668 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.811310 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.088295 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.098006 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000877 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.