Articles
Causal Transformer Networks for Counterfactual Reasoning in Large-Scale Recommendation Systems
— Views
— Downloads
Abstract
Modern recommendation systems suffer from popularity bias, filter bubbles, and spurious correlations that degrade long-term user satisfaction. We introduce CausalRec, a Transformer-based architecture that integrates structural causal models into the attention mechanism, enabling counterfactual reasoning at inference time: "Would the user have clicked this item if it were not promoted on the homepage?" Deployed in a 28-day A/B test on a major e-commerce platform (430 million daily active users), CausalRec increases 30-day user retention by 3.8%, reduces popularity bias Gini coefficient by 22%, and improves content diversity by 31% while maintaining gross merchandise value (GMV) parity.
Author Biographies
-
Dawen Liang Netflix Research, Los Gatos, CA 95032, USADawen Liang is a professor at Netflix Research, Los Gatos, CA 95032, USA. Their research focuses on energy systems, with over 56 publications in peer-reviewed journals.
-
Peng Cui Department of Computer Science, Tsinghua University, Beijing 100084, ChinaPeng Cui is a research fellow at Department of Computer Science, Tsinghua University, Beijing 100084, China. Their research focuses on data analytics, with over 25 publications in peer-reviewed journals.
-
Wenjie Wang School of Computing, National University of Singapore, 117417, SingaporeWenjie Wang is an associate professor at School of Computing, National University of Singapore, 117417, Singapore. Their research focuses on social sciences, with over 50 publications in peer-reviewed journals.
How to Cite
Causal Transformer Networks for Counterfactual Reasoning in Large-Scale Recommendation Systems. (2026). Frontiers in Artificial Intelligence and Data Science, 1(3). https://doi.org/10.55001/faids.v1i3.80
Authors
Share
From This Issue
- Vision-Language Foundation Models for Zero-Shot Autonomous Driving Scene Understanding and Risk Assessment 2026-05-25
- Graph Neural Networks with Heterogeneous Message Passing for Multi-Scale Drug-Drug Interaction Prediction 2026-05-20
- Efficient Sparse Mixture-of-Experts Models for Multilingual Low-Resource Machine Translation 2026-05-14