A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to dirt in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
dirt (0) - 70 freq
airt (1) - 297 freq
dint (1) - 10 freq
dart (1) - 16 freq
durt (1) - 2 freq
dort (1) - 2 freq
birt (1) - 1 freq
wirt (1) - 27 freq
dirlt (1) - 4 freq
virt (1) - 1 freq
nirt (1) - 1 freq
dist (1) - 18 freq
dir (1) - 400 freq
dirty (1) - 107 freq
dairt (1) - 6 freq
diet (1) - 71 freq
irt (1) - 3 freq
pirt (1) - 1 freq
dirl (1) - 46 freq
dirk (1) - 29 freq
dirs (1) - 28 freq
dire (1) - 9 freq
dirt' (1) - 1 freq
dirts (1) - 1 freq
dird (1) - 2 freq
dirt (0) - 70 freq
dort (1) - 2 freq
dirty (1) - 107 freq
dairt (1) - 6 freq
durt (1) - 2 freq
dart (1) - 16 freq
dird (2) - 2 freq
dirs (2) - 28 freq
dirt' (2) - 1 freq
dirts (2) - 1 freq
daurt (2) - 6 freq
doort (2) - 1 freq
driet (2) - 2 freq
dorty (2) - 7 freq
drat (2) - 2 freq
dirtie (2) - 1 freq
dirk (2) - 29 freq
durty (2) - 11 freq
dire (2) - 9 freq
dirlt (2) - 4 freq
dirl (2) - 46 freq
wirt (2) - 27 freq
birt (2) - 1 freq
airt (2) - 297 freq
dint (2) - 10 freq
SoundEx code - D630
drouthy - 17 freq
dirt - 70 freq
dirty - 107 freq
driet - 2 freq
dreid - 51 freq
draaed - 1 freq
dried - 74 freq
durty - 11 freq
dairt - 6 freq
drouth - 49 freq
dorty - 7 freq
dread - 34 freq
doort - 1 freq
drooth - 20 freq
door-ti - 1 freq
daured - 24 freq
draw'd - 1 freq
dirt' - 1 freq
dorothy - 45 freq
dared - 11 freq
drouthie - 12 freq
droothie - 1 freq
dearth - 9 freq
dird - 2 freq
dreed - 13 freq
dryed - 4 freq
dirtie - 1 freq
droid - 1 freq
dart - 16 freq
dorito - 1 freq
drawed - 1 freq
drite - 1 freq
daurt - 6 freq
draa'd - 3 freq
durt - 2 freq
drat - 2 freq
dry't - 2 freq
daurd - 1 freq
droothy - 4 freq
druith - 1 freq
dorty-wye - 1 freq
daared - 2 freq
durtie - 2 freq
dort - 2 freq
derth - 2 freq
drowt - 1 freq
dredd - 6 freq
draed - 1 freq
darth - 3 freq
derrida - 1 freq
derd - 1 freq
dortie - 3 freq
dheireadh - 1 freq
druid - 2 freq
'dreid' - 1 freq
daar't - 1 freq
€˜dread - 1 freq
dairth - 1 freq
dorado - 1 freq
dryte - 1 freq
€œdorothy - 1 freq
dee-haird - 1 freq
dae-or-dee - 1 freq
€˜dirty - 1 freq
doherty - 1 freq
droit - 1 freq
drowth - 1 freq
MetaPhone code - TRT
troot - 55 freq
tried - 687 freq
dirt - 70 freq
dirty - 107 freq
driet - 2 freq
trade - 113 freq
tired - 102 freq
trot - 17 freq
treat - 141 freq
tread - 13 freq
trod - 14 freq
treid - 3 freq
dreid - 51 freq
draaed - 1 freq
tredd - 11 freq
tortie - 5 freq
dried - 74 freq
durty - 11 freq
dairt - 6 freq
dorty - 7 freq
dread - 34 freq
doort - 1 freq
treaty - 21 freq
tred - 33 freq
door-ti - 1 freq
daured - 24 freq
trait - 11 freq
draw'd - 1 freq
dirt' - 1 freq
toured - 3 freq
tarot - 5 freq
taured - 2 freq
dared - 11 freq
traid - 3 freq
tarty - 1 freq
dird - 2 freq
tirrt - 4 freq
tyred - 1 freq
try't - 23 freq
tart - 12 freq
tairt - 3 freq
treatie - 2 freq
dreed - 13 freq
treet - 6 freq
dirtie - 1 freq
tarred - 4 freq
trad - 12 freq
droid - 1 freq
dart - 16 freq
tirade - 2 freq
dorito - 1 freq
drite - 1 freq
daurt - 6 freq
draa'd - 3 freq
durt - 2 freq
tourit - 2 freq
trout - 6 freq
trehd - 1 freq
drat - 2 freq
tire't - 4 freq
dry't - 2 freq
daurd - 1 freq
trudy - 1 freq
turd - 3 freq
touered - 1 freq
trate - 2 freq
tirred - 2 freq
tardy - 2 freq
taurred - 1 freq
torrid - 1 freq
traet - 7 freq
daared - 2 freq
too'ered - 1 freq
durtie - 2 freq
teared - 1 freq
dort - 2 freq
tret - 3 freq
traed - 5 freq
tryd - 1 freq
drowt - 1 freq
tretty - 1 freq
traety - 1 freq
hytered - 3 freq
trootie - 3 freq
trott - 4 freq
truité - 1 freq
dredd - 6 freq
draed - 1 freq
drogheda - 1 freq
treed - 1 freq
triyd - 1 freq
derrida - 1 freq
derd - 1 freq
dortie - 3 freq
trou'd - 1 freq
hytert - 3 freq
druid - 2 freq
traute - 1 freq
tritt - 1 freq
'dreid' - 1 freq
daar't - 1 freq
€˜dread - 1 freq
dorado - 1 freq
tird - 1 freq
tooered - 1 freq
dryte - 1 freq
torty - 1 freq
tiret - 1 freq
dae-or-dee - 1 freq
troat - 3 freq
€˜dirty - 1 freq
droit - 1 freq
tìoraidh - 1 freq
DIRT
Time to execute Levenshtein function - 0.187922 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.330852 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.029423 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.039754 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000898 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.