A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to prospectus in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
prospectus (0) - 2 freq
prospects (1) - 7 freq
prospect's (1) - 1 freq
prospecks (2) - 2 freq
prosperous (2) - 2 freq
prospects' (2) - 1 freq
prospect (2) - 23 freq
'prospects' (3) - 1 freq
projects (3) - 48 freq
protects (3) - 1 freq
respects (3) - 8 freq
prospective (3) - 5 freq
prospeck (3) - 6 freq
prospert (3) - 1 freq
projectin (4) - 4 freq
propped-up (4) - 1 freq
prophits (4) - 24 freq
projeck's (4) - 3 freq
respectfu (4) - 5 freq
prospered (4) - 1 freq
phosphorus (4) - 1 freq
projecks (4) - 16 freq
proaphit's (4) - 2 freq
present's (4) - 1 freq
proteus (4) - 1 freq
prospectus (0) - 2 freq
prospects (1) - 7 freq
prospect's (2) - 1 freq
prospects' (3) - 1 freq
prospect (3) - 23 freq
prospecks (3) - 2 freq
respects (4) - 8 freq
prospective (4) - 5 freq
prosperous (4) - 2 freq
prospert (5) - 1 freq
projects (5) - 48 freq
prospeck (5) - 6 freq
protects (5) - 1 freq
'prospects' (5) - 1 freq
respected (6) - 11 freq
prophets (6) - 2 freq
prosperity (6) - 9 freq
products (6) - 20 freq
proaphits (6) - 8 freq
prophetis (6) - 1 freq
prosecuter (6) - 1 freq
prospectally (6) - 1 freq
inspects (6) - 1 freq
respectit (6) - 12 freq
presents (6) - 48 freq
SoundEx code - P621
presbyterian - 19 freq
presby - 1 freq
perception - 11 freq
prospect - 23 freq
press-ups - 1 freq
prospeck - 6 freq
presbytery - 4 freq
perspective - 33 freq
pear-shaped - 2 freq
presbyterians - 9 freq
prospects - 7 freq
perceivin - 3 freq
perseveert - 1 freq
prescibe - 1 freq
precipitately - 1 freq
prospect's - 1 freq
prosperity - 9 freq
pork-pie - 2 freq
presbyterianism - 5 freq
perceive - 3 freq
perceptive - 3 freq
preoccupation - 2 freq
perceptions - 6 freq
perceived - 7 freq
prospert - 1 freq
perseverance - 1 freq
prospects' - 1 freq
persevere - 10 freq
precipice - 3 freq
persevered - 2 freq
prosper - 5 freq
perseverence - 1 freq
perishables - 1 freq
prospered - 1 freq
persevering - 1 freq
pre-covid - 1 freq
precovid - 1 freq
prawsper - 1 freq
pre-occupeet - 1 freq
'perceptual - 1 freq
perceivit - 3 freq
preoccupied - 3 freq
parkvall - 2 freq
prespositions - 1 freq
'prospects' - 1 freq
perceptiouns - 1 freq
prose-poems - 1 freq
prospecks - 2 freq
perspicacity - 2 freq
prospective - 5 freq
perceives - 3 freq
porcupine - 1 freq
persavit - 1 freq
periscope - 7 freq
perceptioun - 1 freq
perspiration - 1 freq
prisbetarian - 2 freq
perspectives - 6 freq
precipitous - 1 freq
perceptiveness - 1 freq
prospectus - 2 freq
pro-exploitation - 1 freq
peruasive - 1 freq
persepshins - 1 freq
prosperous - 2 freq
prospectally - 1 freq
precept - 1 freq
porgiepuddingandpie - 1 freq
percyvader - 1 freq
perspicatious - 1 freq
percieved - 1 freq
perseverin - 1 freq
'presbyterian - 1 freq
MetaPhone code - PRSPKTS
prospects - 7 freq
prospect's - 1 freq
prospects' - 1 freq
'prospects' - 1 freq
prospectus - 2 freq
PROSPECTUS
Time to execute Levenshtein function - 0.469692 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.996077 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.029299 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.096940 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000993 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.