A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to halls in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
halls (0) - 13 freq
dalls (1) - 1 freq
halle (1) - 3 freq
haels (1) - 1 freq
yalls (1) - 2 freq
hells (1) - 1 freq
hales (1) - 1 freq
halps (1) - 1 freq
hallis (1) - 1 freq
walls (1) - 29 freq
falls (1) - 15 freq
hall' (1) - 1 freq
balls (1) - 19 freq
hills (1) - 245 freq
hails (1) - 6 freq
halts (1) - 1 freq
hally (1) - 1 freq
haalls (1) - 1 freq
haals (1) - 9 freq
hauls (1) - 1 freq
calls (1) - 35 freq
hulls (1) - 9 freq
shalls (1) - 6 freq
hall (1) - 190 freq
hallo (1) - 5 freq
halls (0) - 13 freq
hells (1) - 1 freq
hallis (1) - 1 freq
hulls (1) - 9 freq
hills (1) - 245 freq
haalls (1) - 1 freq
calls (2) - 35 freq
haals (2) - 9 freq
hauls (2) - 1 freq
shalls (2) - 6 freq
hallies (2) - 1 freq
huills (2) - 1 freq
palls (2) - 5 freq
hall (2) - 190 freq
hally (2) - 1 freq
hallo (2) - 5 freq
yalls (2) - 2 freq
hales (2) - 1 freq
haels (2) - 1 freq
halts (2) - 1 freq
dalls (2) - 1 freq
halps (2) - 1 freq
halle (2) - 3 freq
walls (2) - 29 freq
hall' (2) - 1 freq
SoundEx code - H420
hills - 245 freq
huils - 5 freq
heels - 77 freq
hulls - 9 freq
holes - 73 freq
hill's - 3 freq
hellish - 20 freq
hails - 6 freq
'hellish - 1 freq
hillock - 8 freq
hulks - 4 freq
hillocks - 3 freq
hallies - 1 freq
halls - 13 freq
hole's - 2 freq
howls - 15 freq
hallows - 2 freq
halse - 1 freq
hulk - 4 freq
heel's - 1 freq
hauls - 1 freq
heals - 5 freq
hollows - 5 freq
houls - 10 freq
hell's - 16 freq
hïlls - 4 freq
halies - 2 freq
hillies - 11 freq
hillies' - 5 freq
haals - 9 freq
helix - 7 freq
huls - 6 freq
haalage - 1 freq
hilligo - 1 freq
haelga - 2 freq
haalls - 1 freq
hols - 6 freq
haels - 1 freq
hallow's - 1 freq
huills - 1 freq
hools - 2 freq
'hallaig' - 1 freq
halo's - 1 freq
helios - 5 freq
helga - 1 freq
hailes - 3 freq
howells - 1 freq
heles - 1 freq
hail-hek - 1 freq
hallis - 1 freq
€˜hallos - 1 freq
helja - 1 freq
hells - 1 freq
hales - 1 freq
hollyoaks - 1 freq
hollies - 2 freq
hulgzhh - 1 freq
hailÂ’s - 1 freq
hls - 1 freq
hollyjo - 1 freq
MetaPhone code - HLS
hills - 245 freq
huils - 5 freq
heels - 77 freq
hulls - 9 freq
holes - 73 freq
hill's - 3 freq
hails - 6 freq
hallies - 1 freq
halls - 13 freq
hole's - 2 freq
howls - 15 freq
hallows - 2 freq
halse - 1 freq
heel's - 1 freq
hauls - 1 freq
heals - 5 freq
hollows - 5 freq
houls - 10 freq
hell's - 16 freq
halies - 2 freq
hillies - 11 freq
hillies' - 5 freq
haals - 9 freq
huls - 6 freq
haalls - 1 freq
hols - 6 freq
haels - 1 freq
hallow's - 1 freq
huills - 1 freq
hools - 2 freq
halo's - 1 freq
helios - 5 freq
hailes - 3 freq
heles - 1 freq
hallis - 1 freq
€˜hallos - 1 freq
hells - 1 freq
hales - 1 freq
hollies - 2 freq
hailÂ’s - 1 freq
HALLS
Time to execute Levenshtein function - 0.289718 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.491602 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.034017 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.079986 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000870 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.