A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to sub-heidin in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
sub-heidin (0) - 1 freq
sub-headin (1) - 1 freq
sub-heidings (2) - 3 freq
subsidin (3) - 1 freq
beheidin (3) - 3 freq
dug-heidit (3) - 2 freq
beheidan (4) - 1 freq
sub-sheeds (4) - 1 freq
shepherdin (4) - 1 freq
non-herdin (4) - 1 freq
succeedin (4) - 2 freq
sel-guidin (4) - 4 freq
surpreisin (4) - 1 freq
heidin (4) - 58 freq
speidin (4) - 1 freq
rid-heidit (4) - 2 freq
sunshein (4) - 1 freq
sublettin (4) - 1 freq
shaidin (4) - 28 freq
sheilin (4) - 2 freq
steidin (4) - 10 freq
baa-heidit (4) - 1 freq
pig-heidit (4) - 1 freq
sheddin (4) - 5 freq
subsidit (4) - 1 freq
sub-heidin (0) - 1 freq
sub-headin (1) - 1 freq
sub-heidings (4) - 3 freq
beheidin (5) - 3 freq
subsidin (5) - 1 freq
shaidin (6) - 28 freq
sel-guidin (6) - 4 freq
bee-heidit (6) - 1 freq
beheidan (6) - 1 freq
baa-heidit (6) - 1 freq
sub-sheeds (6) - 1 freq
dug-heidit (6) - 2 freq
hie-heidit (7) - 1 freq
het-heidit (7) - 1 freq
beheidit (7) - 6 freq
big-heidit (7) - 3 freq
spreidin (7) - 21 freq
spearheedin (7) - 1 freq
shadin (7) - 1 freq
seed-heids (7) - 1 freq
behaudin (7) - 3 freq
tow-heidit (7) - 1 freq
ba-heid (7) - 1 freq
subhuman (7) - 1 freq
sea-maiden (7) - 1 freq
SoundEx code - S135
saften - 7 freq
shiftin - 32 freq
saftness - 4 freq
subduing - 1 freq
saftent - 3 freq
september - 124 freq
spitten - 1 freq
spittin - 34 freq
softener - 1 freq
speidin - 1 freq
spoutin - 7 freq
saft-hingin - 1 freq
siftin - 2 freq
speedin - 4 freq
speedometer - 4 freq
saftened - 4 freq
septemmer - 9 freq
saftnin - 2 freq
spottin - 4 freq
saftens - 3 freq
spootin - 5 freq
sub-heidings - 3 freq
softened - 1 freq
spittan - 5 freq
shiftan - 4 freq
soften - 2 freq
spittin't - 1 freq
siftan - 1 freq
scabbitness - 1 freq
sputnik - 1 freq
saaften - 1 freq
spitin - 2 freq
spitting - 4 freq
sub-headin - 1 freq
swiftian - 1 freq
september's - 2 freq
spittoon - 1 freq
spouting - 3 freq
speed-mad - 1 freq
siptaimbur - 1 freq
sputum - 1 freq
shifting - 1 freq
spittins - 1 freq
softens - 1 freq
sub-heidin - 1 freq
softening - 1 freq
spoattin - 1 freq
showboating - 1 freq
speeding - 1 freq
swfdnvkrdu - 1 freq
spittingimage - 1 freq
svwdnuceov - 1 freq
skiptomyloulou - 3 freq
szvptdnlay - 1 freq
MetaPhone code - SBHTN
sub-headin - 1 freq
sub-heidin - 1 freq
SUB-HEIDIN
Time to execute Levenshtein function - 0.323486 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.591280 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.038354 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.048010 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.001066 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.