On 30 May 2024, the Personal Information Protection Committee of the Republic of Korea (PIPC) announced the establishment of five types of synthetic data generation reference models aimed at aiding private researchers and companies in the generation and utilisation of synthetic data for machine learning and artificial intelligence development. This initiative, designed to facilitate the safe creation of synthetic data without the legal constraints associated with personal information, encompasses a variety of data types, including oral images and blood sugar measurement information. The process of creating these models involved a comprehensive four-step methodology, including preparation, generation, verification of usefulness and safety, and utilisation, ensuring that the synthetic data maintains the statistical characteristics of actual data while safeguarding personal information.
Original source