A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to worldÂ’ in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
worldÂ’ (0) - 1 freq
world' (2) - 3 freq
warld” (2) - 2 freq
world's (2) - 8 freq
worlds (2) - 13 freq
world (2) - 433 freq
worlde (2) - 1 freq
warlds (3) - 40 freq
old” (3) - 1 freq
wordin (3) - 3 freq
wouldnÂ’t (3) - 2 freq
worls (3) - 5 freq
wurlds (3) - 1 freq
wordie (3) - 14 freq
word' (3) - 2 freq
worl (3) - 35 freq
wold (3) - 1 freq
wouldÂ’ve (3) - 1 freq
words' (3) - 1 freq
firedÂ’ (3) - 1 freq
would (3) - 690 freq
gold” (3) - 1 freq
warlds' (3) - 1 freq
worded (3) - 1 freq
woarld (3) - 7 freq
worldÂ’ (0) - 1 freq
warld” (3) - 2 freq
world' (4) - 3 freq
world (4) - 433 freq
worlde (4) - 1 freq
world's (4) - 8 freq
worlds (4) - 13 freq
warld' (5) - 2 freq
woarld (5) - 7 freq
warlds' (5) - 1 freq
warld (5) - 820 freq
wirlds (5) - 3 freq
roundÂ’ (5) - 1 freq
wirld (5) - 73 freq
wirld's (5) - 5 freq
wurld's (5) - 2 freq
warld's (5) - 40 freq
wurld (5) - 55 freq
firedÂ’ (5) - 1 freq
warlds (5) - 40 freq
wouldÂ’ve (5) - 1 freq
warldly (5) - 4 freq
wurlds (5) - 1 freq
words (6) - 802 freq
wullieÂ’ (6) - 1 freq
SoundEx code - W643
warld - 820 freq
world - 433 freq
warld's - 40 freq
whirlt - 3 freq
wirld - 73 freq
world's - 8 freq
warlds - 40 freq
world' - 3 freq
worldwide - 4 freq
'world - 1 freq
wurld's - 2 freq
wurld - 55 freq
wurlds - 1 freq
warlwide - 5 freq
warldly - 4 freq
world-famous - 1 freq
warlt's - 2 freq
worlds - 13 freq
warldlie - 1 freq
world-player - 1 freq
warld-renooned - 1 freq
'warld-player' - 1 freq
warld-wide - 6 freq
whirlit - 1 freq
warld-class - 1 freq
world'll - 2 freq
warld-record-cowpin - 1 freq
warld-record - 1 freq
warldly-wise - 1 freq
warldwide - 5 freq
warld-renouned - 1 freq
wirlds - 3 freq
warl-wide - 1 freq
warld-like - 1 freq
worl'wide - 1 freq
warld' - 2 freq
warldlike - 1 freq
wirldwide - 1 freq
wirld's - 5 freq
warld-blinnd - 2 freq
whurlit - 1 freq
warldlie-wyce - 1 freq
warlds' - 1 freq
world-class - 1 freq
worldbookday - 5 freq
warld-kent - 1 freq
whurled - 1 freq
warld-famous - 1 freq
woarld - 7 freq
€˜world - 1 freq
waarld - 1 freq
worldsuicidepreventionday - 1 freq
wurruld - 4 freq
worldcuprussia - 1 freq
worldÂ’ - 1 freq
wurrild - 2 freq
warld” - 2 freq
worlde - 1 freq
worldceilidh - 6 freq
MetaPhone code - WRLT
warld - 820 freq
world - 433 freq
whirlt - 3 freq
wirld - 73 freq
world' - 3 freq
'world - 1 freq
wurld - 55 freq
whirlit - 1 freq
warld' - 2 freq
whurlit - 1 freq
whurled - 1 freq
woarld - 7 freq
€˜world - 1 freq
waarld - 1 freq
wurruld - 4 freq
worldÂ’ - 1 freq
wurrild - 2 freq
warld” - 2 freq
worlde - 1 freq
WORLDÂ’
Time to execute Levenshtein function - 0.209897 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.360428 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027129 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.036781 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000861 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.