A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to wumman in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
wumman (0) - 575 freq
wum-man (1) - 1 freq
wamman (1) - 1 freq
umman (1) - 15 freq
wuman (1) - 40 freq
wurmman (1) - 1 freq
wumman' (1) - 3 freq
womman (1) - 1 freq
wummin (1) - 231 freq
wummen (1) - 33 freq
wuamman (1) - 2 freq
cumman (1) - 11 freq
wummle (2) - 1 freq
wuhan (2) - 2 freq
cummen (2) - 26 freq
rumlan (2) - 1 freq
hummin (2) - 18 freq
wurkan (2) - 2 freq
comman (2) - 4 freq
wimmen (2) - 39 freq
wutnman (2) - 1 freq
wyman (2) - 1 freq
tummlan (2) - 1 freq
wumann (2) - 1 freq
summon (2) - 10 freq
wumman (0) - 575 freq
wummin (1) - 231 freq
wummen (1) - 33 freq
womman (1) - 1 freq
wuamman (1) - 2 freq
wamman (1) - 1 freq
wurmman (2) - 1 freq
wum-man (2) - 1 freq
wimmen (2) - 39 freq
cumman (2) - 11 freq
wumimin (2) - 1 freq
wumman' (2) - 3 freq
wuman (2) - 40 freq
wimmin (2) - 31 freq
umman (2) - 15 freq
bummin (3) - 20 freq
worman (3) - 1 freq
wumin (3) - 5 freq
cummin (3) - 30 freq
warman (3) - 2 freq
gummin (3) - 1 freq
wummer (3) - 1 freq
swimman (3) - 2 freq
wumen (3) - 7 freq
weiman (3) - 1 freq
SoundEx code - W550
wimmen - 39 freq
woman - 101 freq
wumman - 575 freq
weemin - 176 freq
wunnin - 10 freq
winnin - 88 freq
wummin - 231 freq
wuman - 40 freq
women - 77 freq
wemen - 16 freq
whinin - 8 freq
winnen - 1 freq
weimen - 12 freq
weemen - 176 freq
wanin - 2 freq
winnowin - 3 freq
wumann - 1 freq
wemeen - 1 freq
wummen - 33 freq
wumen - 7 freq
wamman - 1 freq
weemen' - 1 freq
wimmin - 31 freq
wumman' - 3 freq
wummin' - 1 freq
weeman - 9 freq
wuamman - 2 freq
wanun - 1 freq
weiman - 1 freq
womman - 1 freq
wan-man - 1 freq
weman - 1 freq
winnan - 1 freq
weimun - 1 freq
weimin - 1 freq
'woman' - 1 freq
whinneyin - 1 freq
wiemen - 1 freq
wyman - 1 freq
waamin - 1 freq
wumin - 5 freq
€œweemen - 2 freq
wum-man - 1 freq
wummim - 1 freq
wimen - 1 freq
wimin - 1 freq
‘women - 3 freq
women' - 1 freq
weewummin - 1 freq
MetaPhone code - WMN
wimmen - 39 freq
woman - 101 freq
wumman - 575 freq
weemin - 176 freq
wummin - 231 freq
wuman - 40 freq
women - 77 freq
wemen - 16 freq
weimen - 12 freq
weemen - 176 freq
wumann - 1 freq
wemeen - 1 freq
wummen - 33 freq
wumen - 7 freq
wamman - 1 freq
weemen' - 1 freq
wimmin - 31 freq
wumman' - 3 freq
wummin' - 1 freq
weeman - 9 freq
wuamman - 2 freq
weiman - 1 freq
womman - 1 freq
weman - 1 freq
weimun - 1 freq
weimin - 1 freq
'woman' - 1 freq
wiemen - 1 freq
waamin - 1 freq
wumin - 5 freq
€œweemen - 2 freq
wimen - 1 freq
wimin - 1 freq
‘women - 3 freq
women' - 1 freq
WUMMAN
wumman - 575 freq
woman - 101 freq
women - 77 freq
dug-wumman - 7 freq
wumman's - 28 freq
wuman - 40 freq
wummans - freq
wummin - 231 freq
wummin's - 10 freq
wummin-bodie - 4 freq
wummen - 33 freq
wumin - 5 freq
Time to execute Levenshtein function - 0.189233 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.330682 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027657 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.036876 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.001072 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.