邮箱登录 | 所务办公 | 收藏本站 | English | 中国科学院
 
首页 计算所概况 新闻动态 科研成果 研究队伍 国际交流 技术转移 研究生教育 学术出版物 党群园地 创新文化 科学传播
新闻动态
计算所新闻
学术活动
科研动态
媒体文摘
IT动态导读
现在位置:首页 > 新闻动态 > 学术活动
数据流挖掘:主动标签,噪声清洗及模糊学习
2010-03-05 | 【 【打印】【关闭】

时间:2010年3月8号(星期一)下午2:30

地点:四层报告厅

报告人:Dr. Xingquan Zhu, University of Technology, Sydney

摘要:In this talk, I will summarize a number of steam data mining problems, Active Labeling, Cleansing, and Vague Learning, we have addressed in recent years. For active labeling, we consider that labeling all stream data is expensive and impractical, and our objective is to label a small portion of stream data from which a model is derived to predict newly arrived instances as accurate as possible. For data streams containing incorrectly labeled training samples, we propose a Maximum Variance Margin principle to accurately identify and remove mislabeled data, such that the prediction models built from the cleansed streams can be more accurate than the ones trained from the raw noisy streams. For vague learning in data streams, we allow users to label instance groups, instead of single instances, as positive samples for learning. Experimental results on synthetic and real-world data demonstrate the performances of the proposed efforts in comparison with other simple approaches.

报告人简介:Xingquan Zhu received his PhD degree in Computer Science from Fudan University, Shanghai China, in 2001. He is currently an Associate Professor of the Faculty of Engineering and Information Technology, University of Technology, Sydney (UTS), Australia. Before joining the UTS, he was a tenure track Assistant Professor in the Department of Computer Science & Engineering, Florida Atlantic University, Boca Raton FL, USA, and a Research Assistant Professor in the Department of Computer Science, University of Vermont, Burlington VT, USA. Since 2000, he has published more than 100 referred journal and conference proceedings papers in these areas. Dr. Zhu is an Associate Editor of the IEEE Transactions on Knowledge and Data Engineering (2009- ), and a Program Committee Co-Chair for the 9th International Conference on Machine Learning and Applications (ICMLA 2010).

 
网站地图 | 联系我们 | 意见反馈 | 所长信箱
 
欢迎访问中国科学院计算技术研究所 京ICP备05002829号 京公网安备1101080060号
地址:北京海淀区中关村科学院南路6号 邮编:100190 电话:010-62601166 邮箱:xuanchuanban@ict.ac.cn