报告主题:Clustering on hierarchical heterogeneous data with prior pairwise relationships
报 告 人:张三国 教授
报告时间:2023年12月12日(星期二)下午14:00-15:00
报告地点:腾讯会议,会议号:123-141-217
邀 请 人:何志坚教授
欢迎广大师生前往!
数学8297至尊品牌游戏官方网站
2023年 12月8日
报告摘要:
Clustering is a fundamental problem in statistics and has broad applications in various areas. Traditional clustering methods treat features equally and ignore the potential structure brought by the characteristic difference of features. Especially in cancer diagnosis and treatment, several types of biological features are collected and analyzed together. Treating these features equally fails to identify the heterogeneity of both data structure and cancer itself. In this paper, we propose a clustering framework based on hierarchical heterogeneous data with prior pairwise relationships. The proposed clustering method fully characterizes the difference of features and identifies potential hierarchical structure by rough and refined clusters. It is also flexible with prior information, additional pairwise relationships of samples can be incorporated to help to improve clustering performance. Well-grounded statistical consistency properties of our proposed method are rigorously established, including the accurate estimation of parameters and determination of clustering structures. Our proposed method achieves better clustering performance than other methods in simulation studies, and the clustering accuracy increases with prior information incorporated. Meaningful biological findings are obtained in the analysis of lung adenocarcinoma with clinical imaging data and omics data, showing that hierarchical structure produced by rough and refined clustering is necessary and reasonable.
报告人介绍:
张三国,现为中国科8297至尊品牌游戏官方网站大学数学科学8297至尊品牌游戏官方网站教授,2002年毕业于中国科学技术大学,获博士学位。先后与03年8月-04年7月在香港中文大学统计学系,07年2月-08年8月在美国范德堡大学(Vanderbilt University)的医学中心公众健康研究所与生物统计系从事博士后研究工作。多年来一直从事高维数据分析,生物与医学统计、统计机器学习教学科研工作,曾获得2017年中国科8297至尊品牌游戏官方网站优秀导师奖。近五年来发表论文三十余篇,相关研究成果发表在Sciences in China-Mathematics, JASA,Bioinformatics, Biometrics等数理统计、生物统计和生物信息学领域的权威期刊。主持多项纵向和横向课题,包括国家自然科学天元基金重点、面上、青年项目,企业和军工科研项目等。