Levenshtein | Double Levenshtein | SoundEx | MetaPhone | Manually curated |
---|---|---|---|---|
woman (0) - 100 freq wyman (1) - 1 freq wogan (1) - 1 freq womans (1) - 5 freq wuman (1) - 40 freq weman (1) - 1 freq oman (1) - 2 freq womman (1) - 1 freq women (1) - 77 freq worman (1) - 1 freq roman (1) - 76 freq coman (1) - 41 freq wimin (2) - 1 freq so-an (2) - 2 freq wona (2) - 1 freq warman (2) - 2 freq wumman (2) - 588 freq wooaa (2) - 4 freq coman' (2) - 1 freq jonan (2) - 2 freq hoban (2) - 5 freq tman (2) - 1 freq dosan (2) - 1 freq doan (2) - 9 freq 'coman (2) - 1 freq |
woman (0) - 100 freq women (1) - 77 freq wuman (1) - 40 freq weman (1) - 1 freq wyman (1) - 1 freq weeman (2) - 9 freq wumen (2) - 7 freq wemen (2) - 16 freq weiman (2) - 1 freq wimen (2) - 1 freq wumin (2) - 5 freq wimin (2) - 1 freq coman (2) - 41 freq womman (2) - 1 freq oman (2) - 2 freq womans (2) - 5 freq wogan (2) - 1 freq roman (2) - 76 freq worman (2) - 1 freq booman (3) - 2 freq whan (3) - 2757 freq womanly (3) - 2 freq comin (3) - 1066 freq somane (3) - 1 freq omen (3) - 5 freq |
SoundEx code - W550 wimmen - 39 freq woman - 100 freq wumman - 588 freq weemin - 178 freq wunnin - 10 freq winnin - 90 freq wummin - 233 freq wuman - 40 freq women - 77 freq wemen - 16 freq whinin - 8 freq winnen - 1 freq weimen - 12 freq weemen - 176 freq wanin - 2 freq winnowin - 3 freq wumann - 1 freq wemeen - 1 freq wummen - 36 freq wumen - 7 freq wamman - 1 freq weemen' - 1 freq wimmin - 31 freq wumman' - 3 freq wummin' - 1 freq weeman - 9 freq wuamman - 2 freq wanun - 1 freq weiman - 1 freq womman - 1 freq wan-man - 1 freq weman - 1 freq winnan - 1 freq weimun - 1 freq weimin - 1 freq 'woman' - 1 freq whinneyin - 1 freq wiemen - 1 freq wyman - 1 freq waamin - 1 freq wumin - 5 freq €œweemen - 2 freq wum-man - 1 freq wummim - 1 freq wimen - 1 freq wimin - 1 freq ‘women - 3 freq women' - 1 freq weewummin - 1 freq |
MetaPhone code - WMN wimmen - 39 freq woman - 100 freq wumman - 588 freq weemin - 178 freq wummin - 233 freq wuman - 40 freq women - 77 freq wemen - 16 freq weimen - 12 freq weemen - 176 freq wumann - 1 freq wemeen - 1 freq wummen - 36 freq wumen - 7 freq wamman - 1 freq weemen' - 1 freq wimmin - 31 freq wumman' - 3 freq wummin' - 1 freq weeman - 9 freq wuamman - 2 freq weiman - 1 freq womman - 1 freq weman - 1 freq weimun - 1 freq weimin - 1 freq 'woman' - 1 freq wiemen - 1 freq waamin - 1 freq wumin - 5 freq €œweemen - 2 freq wimen - 1 freq wimin - 1 freq ‘women - 3 freq women' - 1 freq |
WOMAN wumman - 588 freq woman - 100 freq women - 77 freq dug-wumman - 7 freq wumman's - 28 freq wuman - 40 freq wummans - freq wummin - 233 freq wummin's - 10 freq wummin-bodie - 4 freq wummen - 36 freq wumin - 5 freq |
Time to execute Levenshtein function - 0.187492 milliseconds The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings |
Time to execute Double Levenshtein function - 0.368139 milliseconds In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants. |
Time to execute SoundEx function - 0.028214 milliseconds Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling. |
Time to execute MetaPhone function - 0.037201 milliseconds Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar. |
Time to execute Manually curated function - 0.001072 milliseconds Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered. |