Engineering Journal of Don

Text information extraction from images of modified text
- Misyukov G.I.
- Abstract
- pdf (rus)
This article describes development of a module which provides opportunity to extract text from images of modified text, which can be used to bypass existing information security software and spread sensitive information out of company. The developed module is based on Python programming language with additional libraries expanding basic functional. After creating a module, additional module allowing user to create modified text by themselves was made. Additional module uses a special dictionary that can change any letter to alternative and generate more modified texts in order to test and find the weak spots of a module. To integrate the module into company’s information infrastructure DLP-systems were chosen, because of their popularity and ease of the integration method. To integrate DLP-system and text extraction module we used a mail-server with BCC copies of a mail traffic to send text and images to our module local mail server, additional mechanisms extracts pictures and process them within the module, after what it sends back the image and the text from it. A few rounds of testing were done resulting in nearly 97% accuracy. Future development consider expanding for multi-row processing and adding new alternative symbols after first mention them in text by using a CNN or standard deviation of images pixel and pixel comparison.

Keywords: information security, data leakage, text analisys, image analisys, modified data analisys, protection against steganography

01.10.2024

The conference "Science and Higher Education facing modern challenges: strategies and development prospects"

The scientific and practical conference with international participation "Science and Higher Education facing modern challenges: strategies and development prospects" will be held on November 29,...

More...

01.10.2024

The International Scientific and Practical Conference on Advanced Research in Engineering and Applied Technologies

The International Scientific and Practical Conference on Advanced Research in Engineering and Applied Technologies will be held on November 19 - November 20, 2024 in Samarkand (Uzbekistan). More...

More...

01.10.2024

Conference with international participation "Education – science – production"

The All-Russian scientific and practical conference with international participation "Education – science – production" will be held on November 15, 2024 in Chita. More details:...

More...

01.10.2024

Scientific and practical conference "Spatial development of regions in the context of socio-economic sovereignty of Russia"

The All-Russian scientific and practical conference "Spatial development of regions in the context of socio-economic sovereignty of Russia" will be held on November 22, 2024 in Nizhnevartovsk. More...

More...

01.10.2024

The X All-Russian conference "Engineering technologies: traditions, innovations, vectors of development"

The X All-Russian scientific and practical conference with international participation "Engineering technologies: traditions, innovations, vectors of development" will be held on November 13 -...

More...

01.10.2024

The All-Russian scientific and practical conference "Education – Science – production"

The All-Russian scientific and practical conference with international participation "Education – Science – production" will be held on November 15, 2024 in Chita. More...

More...

Text information extraction from images of modified text

News

News archive