A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to original in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
original (0) - 117 freq
oreiginal (1) - 27 freq
orignal (1) - 1 freq
originals (1) - 3 freq
'original (1) - 1 freq
origin (2) - 24 freq
oreeginal (2) - 25 freq
oríginal (2) - 1 freq
criminal (2) - 38 freq
origins (2) - 20 freq
originally (2) - 26 freq
originalfm (2) - 16 freq
liminal (3) - 2 freq
trivial (3) - 3 freq
regina (3) - 1 freq
marginal (3) - 2 freq
originality (3) - 3 freq
criminals (3) - 12 freq
signal (3) - 30 freq
orygynale (3) - 1 freq
originates (3) - 2 freq
tribunal (3) - 4 freq
creiminal (3) - 1 freq
orygnale (3) - 1 freq
critical (3) - 19 freq
original (0) - 117 freq
orignal (1) - 1 freq
oreiginal (1) - 27 freq
oreeginal (2) - 25 freq
'original (2) - 1 freq
originals (2) - 3 freq
orygynale (3) - 1 freq
regional (3) - 106 freq
originally (3) - 26 freq
orygnale (3) - 1 freq
origin (3) - 24 freq
origins (3) - 20 freq
oreigin (4) - 1 freq
rinnal (4) - 1 freq
urinal (4) - 6 freq
oreigins (4) - 1 freq
oríginal (4) - 1 freq
retinal (4) - 1 freq
criminal (4) - 38 freq
signal (4) - 30 freq
rgnl (4) - 1 freq
originalfm (4) - 16 freq
virginal (4) - 2 freq
regina (4) - 1 freq
originality (4) - 3 freq
SoundEx code - O625
organist - 3 freq
original - 117 freq
owergien - 1 freq
owergangs - 2 freq
oregano - 2 freq
orkney - 230 freq
oarsman's - 1 freq
oarsmen - 2 freq
owercomers - 1 freq
owercome - 22 freq
organised - 35 freq
organ - 16 freq
origin - 24 freq
oreeginally - 4 freq
organisation's - 1 freq
organisin - 6 freq
owerseein - 5 freq
oreegins - 2 freq
oreeginal - 25 freq
organisers - 9 freq
ower-come - 1 freq
orkneys - 1 freq
orkney's - 3 freq
organized - 2 freq
owercum - 6 freq
organs - 12 freq
originals - 3 freq
originally - 26 freq
owergangin - 5 freq
organics - 1 freq
organisation - 29 freq
owerganged - 1 freq
origins - 20 freq
orginised - 1 freq
orginise - 1 freq
'original - 1 freq
owercomin - 2 freq
organisations - 59 freq
owregangs - 1 freq
organise - 14 freq
organeesin - 1 freq
oreeginality - 1 freq
originated - 2 freq
organize - 1 freq
originality - 3 freq
o'er-come - 1 freq
owregang - 1 freq
owrecome - 1 freq
owercomes - 8 freq
oreegin - 4 freq
owergyan - 1 freq
owercame - 2 freq
owerseen - 4 freq
owercums - 9 freq
owersmen - 2 freq
owergang - 5 freq
organdised - 3 freq
'organised - 1 freq
orgain - 1 freq
oreiginal - 27 freq
orisioun - 1 freq
owrecam - 1 freq
organisan - 1 freq
originates - 2 freq
owrecum - 3 freq
owresimplifies - 1 freq
owresaen - 3 freq
organisational - 2 freq
owresaein - 1 freq
ower-suin - 1 freq
orkneyinga - 1 freq
oorisome - 1 freq
oercome - 1 freq
owregien - 1 freq
oreiginalitie - 1 freq
oreigins - 1 freq
oreigin - 1 freq
oercam - 1 freq
o'ergane - 1 freq
oreiginallie - 1 freq
oreeginatin - 1 freq
owergyaan - 1 freq
organeest - 1 freq
orygynale - 1 freq
€˜originally - 1 freq
orra-kinno - 1 freq
owergaein - 1 freq
organ-broker - 3 freq
organ-brokerage - 1 freq
organeesations - 1 freq
organeeser - 1 freq
organeesed - 1 freq
organises - 1 freq
€œoreeginal - 1 freq
orygnale - 1 freq
organisms - 1 freq
organeized - 1 freq
organisatiouns - 1 freq
oreeginators - 1 freq
owercomer - 1 freq
owergaun - 1 freq
oríginal - 1 freq
organ-playin - 1 freq
orgon - 4 freq
orgon's - 2 freq
organically - 3 freq
owergaeng - 1 freq
owergaengs - 1 freq
organic - 3 freq
oreeginallie - 1 freq
ower-concentratit - 1 freq
organising - 4 freq
ower-simplified - 1 freq
owersman - 1 freq
organizations - 1 freq
organiser - 1 freq
owergeen - 2 freq
owergaen - 1 freq
owercam - 1 freq
orichins - 1 freq
originalfm - 16 freq
orignal - 1 freq
ourweecountry - 1 freq
organisationsÂ’ - 1 freq
orjnhhrrek - 1 freq
owercontentit - 1 freq
owercomes” - 1 freq
orkneywirds - 3 freq
orkneylibrary - 2 freq
owrecomes - 1 freq
orkneyrd - 5 freq
orkneycom - 1 freq
orknithology - 2 freq
orkneyvole - 1 freq
MetaPhone code - ORJNL
original - 117 freq
oreeginally - 4 freq
oreeginal - 25 freq
originally - 26 freq
'original - 1 freq
oreiginal - 27 freq
oreiginallie - 1 freq
orygynale - 1 freq
€˜originally - 1 freq
€œoreeginal - 1 freq
oríginal - 1 freq
oreeginallie - 1 freq
ORIGINAL
Time to execute Levenshtein function - 0.172218 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.338264 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027926 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.037206 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000904 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.