A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to polka in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
polka (0) - 6 freq
pola (1) - 6 freq
polkas (1) - 2 freq
polks (1) - 1 freq
polis (2) - 264 freq
yolk (2) - 6 freq
pbpka (2) - 1 freq
pouk (2) - 5 freq
poloko (2) - 27 freq
parka (2) - 2 freq
wonka (2) - 297 freq
holla (2) - 1 freq
ponky (2) - 1 freq
yolks (2) - 2 freq
folky (2) - 1 freq
ilka (2) - 887 freq
pooks (2) - 1 freq
tonka (2) - 2 freq
pock (2) - 6 freq
-ilka (2) - 1 freq
pilla (2) - 11 freq
zola (2) - 1 freq
lokka (2) - 1 freq
'ilka (2) - 4 freq
moka (2) - 1 freq
polka (0) - 6 freq
poloko (2) - 27 freq
polks (2) - 1 freq
pola (2) - 6 freq
polkas (2) - 2 freq
pocky (3) - 2 freq
poly (3) - 12 freq
porky (3) - 4 freq
palma (3) - 2 freq
plouk (3) - 11 freq
polar (3) - 24 freq
puka (3) - 1 freq
folke (3) - 1 freq
poky (3) - 8 freq
polos (3) - 1 freq
poles (3) - 30 freq
tolk (3) - 2 freq
pole (3) - 45 freq
packa (3) - 1 freq
plook (3) - 21 freq
powk (3) - 10 freq
polly (3) - 12 freq
pork (3) - 21 freq
pook (3) - 2 freq
folk (3) - 1713 freq
SoundEx code - P420
pools - 19 freq
please - 559 freq
place - 1701 freq
plays - 129 freq
ploys - 75 freq
pulls - 61 freq
poles - 30 freq
poloko - 27 freq
'poloko - 2 freq
polis - 264 freq
plus - 71 freq
pool-she - 3 freq
pals - 365 freq
plash - 6 freq
peals - 2 freq
plaque - 19 freq
polish - 70 freq
place' - 8 freq
'please - 35 freq
pal's - 15 freq
plush - 6 freq
playhous - 4 freq
plug - 28 freq
pillas - 3 freq
pills - 16 freq
police - 77 freq
pulse - 23 freq
paul's - 7 freq
pulys - 1 freq
pliss - 11 freq
ple-e-e-ase - 1 freq
puil-she - 1 freq
pleugh - 2 freq
pliskie - 11 freq
placey - 2 freq
pleece - 1 freq
pleys - 30 freq
palace - 98 freq
plack - 3 freq
plook - 21 freq
'polis - 1 freq
pals's - 1 freq
pleuch - 12 freq
pails - 14 freq
piles - 26 freq
plees - 6 freq
pleas - 9 freq
plague - 25 freq
pales - 2 freq
peels - 30 freq
plous - 3 freq
pluck - 12 freq
pillage - 4 freq
plaece - 11 freq
puuls - 1 freq
plaza - 3 freq
pillows - 4 freq
puils - 10 freq
plaice - 13 freq
peelce - 1 freq
plaise - 10 freq
pill's - 1 freq
paal's - 1 freq
plooks - 11 freq
plec - 1 freq
plaaaaaacceee - 1 freq
pollok - 8 freq
pals' - 3 freq
plukey - 7 freq
pluke - 1 freq
policy - 149 freq
placie - 14 freq
poliacs - 1 freq
palais - 2 freq
palsy - 2 freq
puhlease - 1 freq
pollis - 4 freq
plish - 2 freq
peleg - 3 freq
pillaes - 7 freq
plough - 7 freq
ploos - 6 freq
pleuch' - 1 freq
pauls - 1 freq
pallas - 2 freq
plaicie - 1 freq
plouks - 9 freq
pleg - 2 freq
plaese - 5 freq
'palais - 1 freq
'palácio - 1 freq
'palace - 1 freq
palace' - 2 freq
'please' - 1 freq
plece - 13 freq
palliasse - 11 freq
pleise - 2 freq
pailace - 8 freq
powls - 2 freq
plies - 3 freq
pulleys - 2 freq
plaags - 1 freq
polka - 6 freq
'place' - 1 freq
'plaece' - 2 freq
plicko - 1 freq
policie - 23 freq
playock - 1 freq
plashy - 1 freq
plags - 4 freq
polks - 1 freq
pollocks - 1 freq
poalis - 1 freq
palak - 1 freq
phyleus - 2 freq
plaess - 18 freq
placks - 3 freq
please' - 1 freq
plexie - 1 freq
palus - 1 freq
plouk - 11 freq
plaes - 1 freq
€œplease - 10 freq
€˜policy - 1 freq
plisky - 1 freq
poliss - 8 freq
puls - 4 freq
pollock - 3 freq
polls - 20 freq
plsay - 1 freq
palazzo - 1 freq
plews - 2 freq
peills - 1 freq
pleese - 1 freq
phallus - 2 freq
€œpolice - 1 freq
€˜plays - 1 freq
plucks - 1 freq
plucky - 1 freq
€œpleeze - 1 freq
€˜please - 3 freq
play's - 2 freq
palls - 5 freq
phials - 1 freq
phil's - 4 freq
pòliss - 2 freq
phyllis - 1 freq
€¦polish - 1 freq
playhouse - 1 freq
plugs - 5 freq
playocks - 3 freq
€”please - 1 freq
plaiks - 1 freq
€™please - 2 freq
pleuchie - 1 freq
pls - 13 freq
pleuks - 1 freq
plz - 2 freq
placeÂ’ - 1 freq
paulza - 1 freq
pleeeeeaaase - 1 freq
pleasssssssssssssssssseeeeeee - 1 freq
pleeeeeeeeeease - 1 freq
pleeeeeeeease - 1 freq
pleeeeeease - 1 freq
pleeeeease - 6 freq
pleeeeeeeeeeease - 1 freq
pleeeease - 1 freq
pleeeees - 1 freq
pallas's - 1 freq
'plays - 1 freq
phils - 1 freq
pillls - 1 freq
'pleece' - 1 freq
polos - 1 freq
“pauls” - 1 freq
MetaPhone code - PLK
poloko - 27 freq
'poloko - 2 freq
plaque - 19 freq
plug - 28 freq
plack - 3 freq
plook - 21 freq
plague - 25 freq
pluck - 12 freq
plec - 1 freq
pollok - 8 freq
plukey - 7 freq
pluke - 1 freq
peleg - 3 freq
pleg - 2 freq
polka - 6 freq
plicko - 1 freq
palak - 1 freq
plouk - 11 freq
pollock - 3 freq
plucky - 1 freq
POLKA
Time to execute Levenshtein function - 0.181459 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.360778 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027812 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.037227 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000901 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.