Robert Mueller, former Director of the FBI famously stated: "There are only two types of companies: those that have been hacked and those that will be." This quote reflects the asymmetry in cybersecurity where a defender must be correct 100% of the time, while an adversary only has to get lucky once. Consequentially, protecting the ever growing amount of (sensitive) data from cyber threats is challenging and of paramount importance.
By integrating generative AI to produce synthetic datasets, we can effectively obfuscate real, sensitive information rendering it useless to adversaries (even after a data breach) while maintaining the functionality of the database. The primary goals of this research are:
Suggested reading:
Obfuscation Techniques In Cloud Computing: A Systematic Survey.
Generating synthetic data with differentially private LLM inference
A Survey of Synthetic Data Generation for Machine Learning
Achieving Secure, Scalable, and Fine-grained Data Access Control in Cloud Computing
Supervisors: Nasim Nezhadsistani, Andy Aidoo
back to the main page