Early data leak detection - with AI against internal data theft

The Challenge

Data leaks can be costly for companies. Unintentional access to customer data, for example, can quickly result in horrendous fines—keyword: data protection. Not to mention the significant loss of image and trust in the public eye. Numerous examples from the recent past show how important it is for companies to protect themselves against data leaks. Internal data leaks can occur, for example, as a result of industrial espionage, internal data theft, or hacked employee accounts. According to Statista, a data leak costs companies in Germany an average of 4.5 million euros. It takes around 160 days to close the leak. Early detection speeds up the process and reduces costs – ideally, data theft can be avoided altogether. We developed an early detection system for data leaks for an international mobile phone provider that uses AI to reliably flag unusual behavior in SQL queries. The goal was to detect potential unwanted data access in time to take quick countermeasures.

Our solution

The development team - consisting of data engineers and data scientists - opted for a machine learning approach in which the statistical distribution of SQL activities on the data warehouse is analyzed and deviating behaviour is displayed. The data analysis and the AI algorithm form the basis of the early warning system. Pre-processing of all data was necessary for the modeling - in this case over 1.5 billion data records with a data volume of around 3 terabytes. In order to be able to process this large amount of information, a powerful cloud platform was implemented with the Data Platform. The SQL activities were then classified, the normal behavior analyzed and various criteria for the evaluation determined. The processed data was used to train the AI. Deviations from normal behavior are displayed and evaluated according to a predefined points system.

The result

The data leak early detection system was explicitly adapted to the customer's needs and wishes. Conspicuous behavior in SQL data queries is displayed in good time so that our customer can reliably check whether it is an internal data leak. In addition to improved protection against data theft, the solution offers increased transparency in database activities and IT management can be optimized accordingly. The program code has become the property of the customer so that internal IT specialists can use it freely.

TL;DR: The most important points in a nutshell

Customer benefits

  • Detection and prevention of data theft and industrial espionage
  • Avoidance of loss of image and trust
  • Avoidance of fines and loss of sales
  • Optimization of existing processes
  • Transparency in data movements
  • Strengthening employees' skills in handling sensitive data

The details

Who is it suitable for? For all companies that use sensitive data in their databases and have their own IT department.

Discover our digital solutions

You too can benefit from our extensive experience with projects involving companies of all sizes.

Streamline planning processes and minimize sources of error

Manage complex master data efficiently and flexibly

Minimize data protection risks and ensure reliable compliance

Pass audits faster and reduce compliance costs in the long term

Reduce electricity costs and cut CO₂ emissions in a sustainable way

Identify new business opportunities and minimize risks

Detect errors early and prevent data loss

Reduce licensing costs and modernize logistics processes

Reduce manual tasks and effectively lighten the workload for teams

Reduce manual effort and scale processes more quickly

Standardize intranet pages and reduce maintenance efforts

Identify risks early and reliably comply with legal requirements

Reduce security risks and efficiently automate processes

Process inquiries more quickly and improve service quality over the long term

Detect issues faster and ensure stable system operation

Reduce complexity and integrate systems more quickly

Identify vehicles faster and streamline processes

Manage configurations efficiently and support electric mobility

Deliver new services faster and scale systems flexibly

Reduce inventory costs and significantly minimize errors

Accelerate communication and significantly reduce system costs

Avoid the risk of legal warnings and automate price listings

Increase customer satisfaction and scale services flexibly

Treat wastewater efficiently and significantly reduce resource consumption

Reduce processing times and increase customer satisfaction

Boost sales success and better identify profitable target audiences

HyExpert

Would you like to know how our customized IT solutions can help your business succeed? Do you have questions about specific digital challenges?

Get in touch with us—we’re here to help!