管理学Workshop:Perils of bias and scarcity: Overcoming challenges in Political Ideology Prediction from text data

发布日期：2023-11-17 00:00 来源：

Perils of bias and scarcity: Overcoming challenges in Political Ideology Prediction from text data

时间：2023年11月17日10：00

地点：承泽园333教室

Speaker: Chen Chen

Abstract:

Political Ideology Prediction (PIP) from text data is pivotal in policy evaluation, online marketing, and understanding firm strategy. However, development of Machine Learning (ML) models have been facing crucial challenges such as sparse self-reported labels and selection bias, as well as label bias, characterized by systematic distortion of observed labels from the ground truth. All these issues have severely limit the applicaton of advanced ML algorithms such as LLMs on PIP from texts. To address these issues, we designed two ML artifacts. The first artifact addresses sampling issues by decomposing document embeddings into a linear combination of a latent neutral context vector and a latent position vector. This semi-supervised model, predicting ideology solely on position vectors, significantly outperforms the SoTAs in accuracy, even with as little as 5% biased data. The second artifact, designed to address label biases, is based on a kernel of Mixture of Theories. Preliminary results show that it adapts universally in various context and aligns with most currently identified causes of biases, demonstrating promising potential for improving PIP.

Introduction of Speaker

Dr. Chen Chen is an Assistant Professor from the area of Information Systems at The Chinese University of Hong Kong, Shenzhen. After graduating from Tsinghua University, he proceed to obtain his first Ph.D. in Molecular Cancer Biology from Duke University in 2014 and second Ph.D. in Management from Boston University in 2020. His primary research interests include 1) using advanced deep learning algorithms and language models to decipher how human beings interact, behave and make decisions in both virtual and real communities; 2) understanding the dynamics of patient-doctor's interaction on online healthcare platforms; 3) knowledge engineering and knowledge iterpolation/extrapolation vis knowledge graph and Graph Neural Networks; 4) AI augmentation and its implication in management, AI alignment and governance, corporate AI strategy and its impact on personnel turnover. Dr. Chen has numerous publications in top-tier journals and computer science conferences including ISR, PNAS, Nature Cell Biology, and ACL proceedings.

首页

访问学者申请

EMAIL

北京大学

ENGLISH

管理学Workshop:Perils of bias and scarcity: Overcoming challenges in Political Ideology Prediction from text data

发布日期：2023-11-17 00:00 来源：

国家发展研究院官方微信

相关链接

中国经济研究中心

健康老龄与发展研究中心

新结构经济学研究院

全球健康发展研究院

能源安全与国家发展研究中心

人力资本与国家政策研究中心

中国卫生经济研究中心

学院概况展开 / 收起

组织架构展开 / 收起

历史回顾展开 / 收起

师资队伍展开 / 收起

教学项目展开 / 收起

硕博研究生展开 / 收起

本科生展开 / 收起

学术研究展开 / 收起

出版物展开 / 收起

科研发布展开 / 收起

科研项目展开 / 收起

科研成果展开 / 收起

讲座会议展开 / 收起

研究机构展开 / 收起

博士后流动站展开 / 收起

智库展开 / 收起

公益展开 / 收起

校友展开 / 收起

校友人物展开 / 收起

校友组织展开 / 收起

捐赠展开 / 收起

捐赠项目展开 / 收起

捐赠鸣谢展开 / 收起