A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to hawai in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
hawaii (1) - 5 freq
haaw (2) - 2 freq
hawd (2) - 12 freq
hasni (2) - 1 freq
hawin (2) - 2 freq
haha (2) - 66 freq
haean (2) - 12 freq
hadni (2) - 1 freq
gawa (2) - 5 freq
howan (2) - 1 freq
haal (2) - 42 freq
haav (2) - 1 freq
agai (2) - 2 freq
haws (2) - 10 freq
hawl (2) - 1 freq
wai (2) - 1 freq
hanoi (2) - 1 freq
awaw (2) - 72 freq
hamar (2) - 5 freq
haain (2) - 1 freq
'away (2) - 3 freq
awa (2) - 4217 freq
hawkit (2) - 1 freq
hawn' (2) - 2 freq
haffi (2) - 3 freq
hawaii (1) - 5 freq
howay (2) - 4 freq
hawie (2) - 1 freq
haw (2) - 106 freq
haaw (2) - 2 freq
awaiy (3) - 4 freq
hlai (3) - 1 freq
hai (3) - 4 freq
hawai'i (3) - 1 freq
away (3) - 766 freq
hari (3) - 1 freq
haaf (3) - 34 freq
haas (3) - 10 freq
kwai (3) - 1 freq
how (3) - 1649 freq
'awa (3) - 22 freq
qawa (3) - 1 freq
hawes (3) - 2 freq
hoaw (3) - 1 freq
hawin (3) - 2 freq
awae (3) - 22 freq
hawkie (3) - 1 freq
hw (3) - 1 freq
haar (3) - 110 freq
haap (3) - 1 freq
SoundEx code - H000
he - 25549 freq
'he - 78 freq
hae - 8068 freq
hoo - 1076 freq
how - 1649 freq
'hoo - 79 freq
hea - 2 freq
'hae - 31 freq
'haw - 29 freq
ha - 181 freq
huh - 16 freq
hou - 958 freq
haw - 106 freq
'how - 51 freq
hey-you - 1 freq
hee-haw - 18 freq
hue - 24 freq
howie - 52 freq
hie - 113 freq
howe - 77 freq
huey - 31 freq
'huey - 1 freq
'ha - 8 freq
hey - 158 freq
'hey - 11 freq
'hou - 21 freq
hy - 5 freq
'hi - 6 freq
hay - 124 freq
haa - 93 freq
h - 258 freq
-hou - 1 freq
hau - 1 freq
'hay - 1 freq
hoo' - 2 freq
'howie - 1 freq
'howieeee - 1 freq
hee - 289 freq
heehaw - 194 freq
hoy - 29 freq
hoo-oo - 1 freq
ho - 56 freq
hew - 8 freq
huw - 3 freq
hi - 67 freq
haaaa - 2 freq
haha - 66 freq
haaw - 2 freq
hiya - 28 freq
'hiya - 11 freq
hai - 4 freq
hah - 7 freq
hahahahahahah - 1 freq
heeeeeeeooooo - 1 freq
hoooooo - 1 freq
huy - 4 freq
ha'e - 3 freq
hye - 6 freq
he' - 8 freq
hoi - 8 freq
hei - 257 freq
-how - 2 freq
'hahahaha - 1 freq
heh - 41 freq
ho-ho - 9 freq
hae' - 4 freq
'he' - 4 freq
hö - 1 freq
'hae' - 2 freq
'how' - 2 freq
hawaii - 5 freq
ha' - 3 freq
heehaw' - 4 freq
'hie - 1 freq
'ho- - 2 freq
'ho-ho - 4 freq
'haw-haw - 1 freq
'heehaw - 4 freq
hehaw - 1 freq
hawie - 1 freq
how-- - 1 freq
hæ - 3 freq
haey - 1 freq
heeeeeee - 1 freq
hawai'i - 1 freq
hii - 1 freq
hoo-hoo-hoo-hoo - 1 freq
heiße - 1 freq
híe - 1 freq
hyow - 2 freq
€œhe - 48 freq
€œhow - 18 freq
haue - 1 freq
€˜how - 22 freq
€¦he - 2 freq
€œhoo - 24 freq
€˜hae - 2 freq
€™hui - 1 freq
€œhi - 2 freq
hoe - 2 freq
€œhaw - 6 freq
€˜hoo - 4 freq
€˜hiyie - 1 freq
€˜h-h-h-hi-yie - 1 freq
€™he - 9 freq
€˜he - 23 freq
€˜hoaw - 4 freq
€˜hey - 2 freq
€˜hauw - 1 freq
€˜hi - 1 freq
€œhi-ay - 1 freq
€œhou - 1 freq
€˜ha - 1 freq
€¦how - 1 freq
€œhae - 7 freq
hiewey - 1 freq
€œha - 1 freq
€œhey - 6 freq
hui - 1 freq
€œhuh - 1 freq
hoaw - 1 freq
€”he - 1 freq
€œhoi - 1 freq
hiy - 1 freq
€œhowe - 1 freq
€œho - 1 freq
€œhiya - 1 freq
heu - 1 freq
€™how - 4 freq
hw - 1 freq
haeÂ’ - 1 freq
“how - 2 freq
hh - 3 freq
hahahaha - 7 freq
hahaha - 10 freq
heeee - 1 freq
hohoho - 2 freq
‘he - 1 freq
heehee - 2 freq
hahahaaha - 1 freq
hahahahahhahahahha - 1 freq
hahahahahaha - 1 freq
hahahahahahahahahaha - 1 freq
hwou - 1 freq
“he - 3 freq
hu - 2 freq
hehehe - 2 freq
hhy - 1 freq
hyu - 1 freq
‘hey - 1 freq
hehe - 3 freq
hhoh - 1 freq
'haha - 1 freq
hiyi - 2 freq
hyyh - 1 freq
hoey - 1 freq
“hae - 1 freq
'h' - 1 freq
hoho - 1 freq
hahhh - 1 freq
hihi - 6 freq
howay - 4 freq
hoh - 1 freq
MetaPhone code - HW
howie - 52 freq
howe - 77 freq
'howie - 1 freq
'howieeee - 1 freq
hawaii - 5 freq
hawie - 1 freq
hawai'i - 1 freq
hiewey - 1 freq
€œhowe - 1 freq
howay - 4 freq
HAWAI
Time to execute Levenshtein function - 0.250610 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.484649 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.028982 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.041516 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.001002 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.