A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to hawaii in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
hawaii (0) - 5 freq
hawai'i (1) - 1 freq
hawdin (2) - 8 freq
haain (2) - 1 freq
gawain (2) - 1 freq
hawkit (2) - 1 freq
hawaiian (2) - 1 freq
hawkie (2) - 1 freq
hawtin (2) - 1 freq
hawie (2) - 1 freq
hawkin (2) - 3 freq
halabi (2) - 1 freq
hawin (2) - 2 freq
await (2) - 7 freq
awaiy (2) - 4 freq
hashin (3) - 2 freq
wali (3) - 1 freq
haipit (3) - 1 freq
pawin (3) - 3 freq
shaain (3) - 15 freq
hahaha (3) - 10 freq
gawin (3) - 25 freq
owain (3) - 1 freq
ha'in (3) - 9 freq
hardie (3) - 26 freq
hawaii (0) - 5 freq
hawai'i (2) - 1 freq
hawie (2) - 1 freq
howie (3) - 52 freq
hawin (3) - 2 freq
haw (3) - 106 freq
awaiy (3) - 4 freq
hawkie (3) - 1 freq
haaw (3) - 2 freq
howay (3) - 4 freq
hawaiian (3) - 1 freq
haain (3) - 1 freq
howpie (4) - 1 freq
how (4) - 1607 freq
haaf (4) - 34 freq
haik (4) - 4 freq
howdie (4) - 10 freq
haap (4) - 1 freq
hewvie (4) - 1 freq
haan (4) - 95 freq
awaw (4) - 72 freq
haaed (4) - 1 freq
haig (4) - 1 freq
hait (4) - 16 freq
haalie (4) - 1 freq
SoundEx code - H000
he - 25273 freq
'he - 77 freq
hae - 8019 freq
hoo - 1051 freq
how - 1607 freq
'hoo - 79 freq
hea - 2 freq
'hae - 31 freq
'haw - 29 freq
ha - 180 freq
huh - 15 freq
hou - 956 freq
haw - 106 freq
'how - 52 freq
hey-you - 1 freq
hee-haw - 17 freq
hue - 24 freq
howie - 52 freq
hie - 113 freq
howe - 77 freq
huey - 31 freq
'huey - 1 freq
'ha - 8 freq
hey - 157 freq
'hey - 8 freq
'hou - 21 freq
hy - 5 freq
'hi - 6 freq
hay - 124 freq
haa - 93 freq
h - 257 freq
-hou - 1 freq
hau - 1 freq
'hay - 1 freq
hoo' - 2 freq
'howie - 1 freq
'howieeee - 1 freq
hee - 289 freq
heehaw - 194 freq
hoy - 27 freq
hoo-oo - 1 freq
ho - 56 freq
hew - 8 freq
huw - 3 freq
hi - 67 freq
haaaa - 2 freq
haha - 65 freq
haaw - 2 freq
hiya - 27 freq
'hiya - 11 freq
hai - 4 freq
huy - 3 freq
hye - 6 freq
he' - 8 freq
hoi - 8 freq
hei - 257 freq
-how - 2 freq
'hahahaha - 1 freq
heh - 41 freq
ho-ho - 9 freq
hae' - 4 freq
'he' - 4 freq
hö - 1 freq
'hae' - 2 freq
'how' - 2 freq
hawaii - 5 freq
ha' - 3 freq
heehaw' - 4 freq
'hie - 1 freq
'ho- - 2 freq
'ho-ho - 4 freq
'haw-haw - 1 freq
'heehaw - 4 freq
hehaw - 1 freq
hawie - 1 freq
how-- - 1 freq
hæ - 3 freq
haey - 1 freq
heeeeeee - 1 freq
hah - 4 freq
hawai'i - 1 freq
hii - 1 freq
hoo-hoo-hoo-hoo - 1 freq
heiße - 1 freq
híe - 1 freq
hyow - 2 freq
€œhe - 47 freq
€œhow - 15 freq
haue - 1 freq
€˜how - 22 freq
€¦he - 2 freq
€œhoo - 24 freq
€˜hae - 2 freq
€™hui - 1 freq
€œhi - 2 freq
hoe - 2 freq
€˜hoo - 4 freq
€˜hiyie - 1 freq
€˜h-h-h-hi-yie - 1 freq
€™he - 9 freq
€˜he - 23 freq
€˜hoaw - 4 freq
€˜hey - 2 freq
€˜hauw - 1 freq
€˜hi - 1 freq
€œhi-ay - 1 freq
€œhou - 1 freq
€˜ha - 1 freq
€¦how - 1 freq
€œhae - 7 freq
€œhaw - 2 freq
hiewey - 1 freq
€œha - 1 freq
€œhey - 6 freq
hui - 1 freq
€œhuh - 1 freq
hoaw - 1 freq
€”he - 1 freq
€œhoi - 1 freq
hiy - 1 freq
€œhowe - 1 freq
€œho - 1 freq
€œhiya - 1 freq
heu - 1 freq
€™how - 4 freq
hw - 1 freq
haeÂ’ - 1 freq
“how - 2 freq
hh - 3 freq
hahahaha - 7 freq
hahaha - 10 freq
heeee - 1 freq
hohoho - 2 freq
‘he - 1 freq
heehee - 2 freq
hahahaaha - 1 freq
hahahahahhahahahha - 1 freq
hahahahahaha - 1 freq
hahahahahahahahahaha - 1 freq
ha'e - 1 freq
hwou - 1 freq
“he - 3 freq
hu - 2 freq
hehehe - 2 freq
hhy - 1 freq
hyu - 1 freq
‘hey - 1 freq
hehe - 3 freq
hhoh - 1 freq
'haha - 1 freq
hiyi - 2 freq
hyyh - 1 freq
hoey - 1 freq
“hae - 1 freq
'h' - 1 freq
hoho - 1 freq
hahhh - 1 freq
hihi - 6 freq
howay - 4 freq
hoh - 1 freq
MetaPhone code - HW
howie - 52 freq
howe - 77 freq
'howie - 1 freq
'howieeee - 1 freq
hawaii - 5 freq
hawie - 1 freq
hawai'i - 1 freq
hiewey - 1 freq
€œhowe - 1 freq
howay - 4 freq
HAWAII
Time to execute Levenshtein function - 0.203320 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.337241 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027070 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.036940 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000825 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.