A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to hostels in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
hostels (0) - 1 freq
hotels (1) - 9 freq
hostess (1) - 2 freq
hostel (1) - 14 freq
castels (2) - 2 freq
hosted (2) - 3 freq
possels (2) - 8 freq
gospels (2) - 9 freq
hotel (2) - 91 freq
boatels (2) - 1 freq
hosts (2) - 13 freq
ghosters (2) - 1 freq
tossels (2) - 1 freq
hostelry (2) - 7 freq
hovels (2) - 1 freq
hootel (2) - 3 freq
hotel's (2) - 4 freq
hostile (2) - 13 freq
hoses (2) - 1 freq
posters (2) - 28 freq
hooters (2) - 3 freq
hostit (3) - 7 freq
hoss (3) - 1 freq
hysted (3) - 3 freq
dowters (3) - 1 freq
hostels (0) - 1 freq
hostel (2) - 14 freq
hostess (2) - 2 freq
hotels (2) - 9 freq
hostelry (3) - 7 freq
hostile (3) - 13 freq
castels (3) - 2 freq
hosts (3) - 13 freq
hastily (4) - 21 freq
hysts (4) - 1 freq
bastils (4) - 1 freq
steals (4) - 2 freq
instils (4) - 1 freq
pistols (4) - 18 freq
hostages (4) - 1 freq
hostelrie (4) - 2 freq
heistely (4) - 1 freq
distils (4) - 2 freq
hoasts (4) - 8 freq
hospitals (4) - 7 freq
hostlery (4) - 1 freq
hustle (4) - 1 freq
staels (4) - 2 freq
pistils (4) - 4 freq
hostler (4) - 18 freq
SoundEx code - H234
hastily - 21 freq
hostile - 13 freq
hastylike - 1 freq
hostielities - 1 freq
hostel - 14 freq
hostelry - 7 freq
hostelrie - 2 freq
haistelly - 1 freq
hustle - 1 freq
hostlery - 1 freq
hustled - 1 freq
heistie-like - 1 freq
heistilie - 1 freq
hostels - 1 freq
hosteility - 1 freq
heistie-lik - 1 freq
heistely - 1 freq
hosteelity - 2 freq
hostler - 18 freq
hustlin - 1 freq
hzewdl - 1 freq
hightailed - 1 freq
hgdtlbqjrv - 1 freq
hcztlcqeca - 1 freq
hjetlandi - 16 freq
MetaPhone code - HSTLS
hostels - 1 freq
HOSTELS
Time to execute Levenshtein function - 0.209789 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.351233 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027351 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.037112 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000828 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.