学术沙龙
学术沙龙 当前位置: 首页 - 学术沙龙 - 正文

数学与统计学院百年校庆学术系列活动暨熊庆来大讲坛(2)

2023-03-10    点击:[]

Title: Differential Private Data Release for Mixed-type Data via Latent Factor Models

报告人: 张艳青

研究方向:应用统计、机器学习——差分隐私保护

Abstract:  The rise of big data processing, sharing and analysis makes the  protection of confidential information urgently necessary. Differential  privacy is a particular data privacy preserving technology which can  publish synthetic data or statistical analysis with a minimum disclo[1]sure  of private information of individual record. The tradeoff between  privacy-preserving and utility guarantee is always a challenge for  differential privacy technology, especially for synthetic data  generation. Moreover, mixed-type data containing continuous, ordinal  cate[1]gorical  and nominal data are becoming increasingly pervasive due to the rapid  development of various data collecting platforms. In this paper, we  propose a differential private data release algorithm for mixed-type  data with correlated dependency under the framework of latent factor  models. The proposed method can add a relatively small amount of noise  to synthetic data under the same level of privacy protection while  capturing correlation information. Moreover, the proposed algorithm can  generate synthetic data preserving the same data type as original data,  including categorical data, which greatly improves the utility of  synthetic data. The key idea of our method is to partially perturb the  projection of original data on perturbed eigenvector space to construct a  synthetic data generation model, and to utilize link functions between  discrete variables and continuous variables to ensure consistency of  synthetic data type with original data. The proposed method can generate  differentially private synthetic data at low computation cost even when  the origi[1]nal  data is high-dimensional. In theory, we establish differentially  private properties of the proposed method and upper bound on the utility  of synthetic data. Our numerical studies also demonstrate superb  performance of the proposed method on the utility guarantee of the  privacy-preserving data released.



报告题目:A Bridge Between Radar Signal Detection Problems and Mathematics and Statistics

报告人:荣尧

研究方向:统计学、数学与信息科学交叉学科

摘要:Radar  technology plays a critical role in numerous industries, such as  military and defense, navigation, aerospace and aviation, automotive,  remote sensing, and meteorology. Signal detection is one of the primary  functions of radar. This report aims to establish a strong link between  radar signal detection and mathematics and statistics by presenting our  research findings.