Early data leak detection - with AI against internal data theft

The Challenge

Data leaks can be costly for companies. Unintentional access to customer data, for example, can quickly result in horrendous fines—keyword: data protection. Not to mention the significant loss of image and trust in the public eye. Numerous examples from the recent past show how important it is for companies to protect themselves against data leaks. Internal data leaks can occur, for example, as a result of industrial espionage, internal data theft, or hacked employee accounts. According to Statista, a data leak costs companies in Germany an average of 4.5 million euros. It takes around 160 days to close the leak. Early detection speeds up the process and reduces costs – ideally, data theft can be avoided altogether. We developed an early detection system for data leaks for an international mobile phone provider that uses AI to reliably flag unusual behavior in SQL queries. The goal was to detect potential unwanted data access in time to take quick countermeasures.

Our solution

The development team - consisting of data engineers and data scientists - opted for a machine learning approach in which the statistical distribution of SQL activities on the data warehouse is analyzed and deviating behaviour is displayed. The data analysis and the AI algorithm form the basis of the early warning system. Pre-processing of all data was necessary for the modeling - in this case over 1.5 billion data records with a data volume of around 3 terabytes. In order to be able to process this large amount of information, a powerful cloud platform was implemented with the Data Platform. The SQL activities were then classified, the normal behavior analyzed and various criteria for the evaluation determined. The processed data was used to train the AI. Deviations from normal behavior are displayed and evaluated according to a predefined points system.

The result

The data leak early detection system was explicitly adapted to the customer's needs and wishes. Conspicuous behavior in SQL data queries is displayed in good time so that our customer can reliably check whether it is an internal data leak. In addition to improved protection against data theft, the solution offers increased transparency in database activities and IT management can be optimized accordingly. The program code has become the property of the customer so that internal IT specialists can use it freely.

TL;DR: The most important points in a nutshell

Customer benefits

  • Detection and prevention of data theft and industrial espionage
  • Avoidance of loss of image and trust
  • Avoidance of fines and loss of sales
  • Optimization of existing processes
  • Transparency in data movements
  • Strengthening employees' skills in handling sensitive data

The details

Who is it suitable for? For all companies that use sensitive data in their databases and have their own IT department.

Discover our digital solutions at

You too can benefit from our extensive experience with projects involving companies of all sizes.

Develop new services faster and analyze data flexibly

Increase customer satisfaction and scale services flexibly

Manage complex master data efficiently and flexibly

Identify risks early and reliably comply with legal requirements

Reduce packaging waste and ensure compliance with ESG requirements

Reduce processing times and adapt processes more quickly

Identify new business opportunities and minimize risks

Process inquiries more quickly and improve service quality over the long term

Modernize line-of-business applications faster and use resources efficiently

Coordinate waste disposal orders more easily and attract new customers

Provide flexible mobility services around the clock

Manage inquiries transparently and reduce response times

Minimize data protection risks and ensure reliable compliance

Strengthen customer loyalty and proactively prevent churn

Reduce system load and make production data available on mobile devices

Prevent data theft more quickly and minimize damage

Ensuring consistent standards in an efficient and transparent manner

Streamline planning processes and minimize sources of error

Deliver new services faster and scale systems flexibly

Distribute goods efficiently and reduce stockouts

Reduce processing times and streamline processes

Update fund data faster and manage releases more efficiently

Analyze complex health data faster and more easily

Reduce manual effort and scale processes more quickly

Treat wastewater efficiently and significantly reduce resource consumption

Accelerate communication and significantly reduce system costs

Efficiently manage transportation processes and reduce maintenance costs

HyExpert

Would you like to know how our customized IT solutions can help your business succeed? Do you have questions about specific digital challenges?

Get in touch with us—we’re here to help!