A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to hunner-an-fowr-an-thritie in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
hunner-an-fowr-an-thritie (0) - 1 freq
hunner-pund-a-time (12) - 1 freq
hunner-an-siventy-odd (13) - 1 freq
unner-growthe (14) - 1 freq
unner-appreciatit (14) - 1 freq
ane-an-twuntie (14) - 1 freq
heteronormative (15) - 2 freq
back-an-forrit (15) - 1 freq
unnerratit (15) - 2 freq
hunner-thoosan (15) - 1 freq
nae-use-fur-nothin (15) - 1 freq
unnernathe (15) - 1 freq
four-an'-thretty (15) - 1 freq
three-forty-three (15) - 1 freq
hunter-gaitherin (15) - 1 freq
haund-for-nieve (15) - 1 freq
echt-an-twuntie (16) - 1 freq
transformation (16) - 9 freq
singer-sangwriters (16) - 1 freq
horse-an-cairts (16) - 1 freq
black-an-white (16) - 2 freq
fower-an-twenty (16) - 1 freq
unnerrepresentit (16) - 2 freq
transformations (16) - 1 freq
five-an-echtie (16) - 1 freq
hunner-an-fowr-an-thritie (0) - 1 freq
hunner-an-siventy-odd (21) - 1 freq
hunner-pund-a-time (21) - 1 freq
nae-use-fur-nothin (23) - 1 freq
three-forty-three (24) - 1 freq
ane-an-twuntie (24) - 1 freq
unner-appreciatit (24) - 1 freq
unner-growthe (24) - 1 freq
hunter-gaitherin (25) - 1 freq
haund-for-nieve (25) - 1 freq
threi-an-twuntiet (25) - 1 freq
four-an'-thretty (25) - 1 freq
fower-an-twenty (25) - 1 freq
back-an-forrit (25) - 1 freq
hunner-thoosan (25) - 1 freq
thirty-fower-inch (26) - 1 freq
pairtner-in-crime (26) - 1 freq
unnernathe (26) - 1 freq
hunnerwecht (26) - 2 freq
guid-for-naethin (26) - 1 freq
hereawa-thereawa (26) - 1 freq
holier-than-thou (26) - 1 freq
back-an-forth (26) - 1 freq
sair-affrontit (26) - 1 freq
singer-sangwriters (26) - 1 freq
SoundEx code - H565
haimmerin - 6 freq
hinner-en - 25 freq
hammerin - 18 freq
hinnerend - 28 freq
hinnereyn - 8 freq
hinnerin - 2 freq
honourin - 2 freq
hinner-ens - 1 freq
hemmerin - 7 freq
hinnie-wairm - 1 freq
hinner-end - 3 freq
hinnermaist - 39 freq
humourin - 1 freq
hinneren - 43 freq
hinnerein - 6 freq
hinnerein's - 1 freq
hinnèr-en - 1 freq
hinnèrin - 1 freq
hammeran - 1 freq
hammermuggley - 1 freq
haun-wringin - 2 freq
hinnermost - 1 freq
hinnerenn - 1 freq
hinner-eyn - 1 freq
hinneryen - 1 freq
hinnerance - 1 freq
hunner-an-siventy-odd - 1 freq
hennerin - 1 freq
hinnereine - 1 freq
hunner-an-fowr-an-thritie - 1 freq
han-ower-han - 1 freq
haimerin - 1 freq
homerange - 1 freq
MetaPhone code - HNRNFRN0
HUNNER-AN-FOWR-AN-THRITIE
Time to execute Levenshtein function - 0.242932 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.446427 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027558 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.037530 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000851 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.