Recognises the characters in Really Simple CAPTCHA plugin and hence breaks the CAPTCHA.
Created a model to recognise the characters in “Really Simple CAPTCHA“ plugin and hence break the CAPTCHA.
The plugin generates 4-letter CAPTCHAs using a random mix of four different fonts.
A dataset of 10,000 images of CAPTCHAS generated by the plugin was used. The images are present in the generated_captcha_images folder.
As the dataset is small we don’t train our model on this entire image. We form another dataset of segmented characters (letters) of the image and design a model to train on these letters.
In this way each individual letter is identified by the model and the combined letters are used to break the CAPTCHA.
Opencv was used to perform character segmentation and to form the indivdual letters’ dataset. Some of the functions used are:
A shallow convolutional model was created and trained on individual letters. It was used to predict the letters in the CAPTCHAS following segmentation hence breaking the CAPTCHA.
Add folder extracted_letter_images in the working directory before running.