These labels were automatically added by AI and may be inaccurate. For details, see About Literature Database.
Abstract
Estimation of conditional average treatment effects (CATEs) is an important
topic in sciences. CATEs can be estimated with high accuracy if distributed
data across multiple parties can be centralized. However, it is difficult to
aggregate such data owing to confidential or privacy concerns. To address this
issue, we proposed data collaboration double machine learning, a method that
can estimate CATE models from privacy-preserving fusion data constructed from
distributed data, and evaluated our method through simulations. Our
contributions are summarized in the following three points. First, our method
enables estimation and testing of semi-parametric CATE models without iterative
communication on distributed data. Our semi-parametric CATE method enable
estimation and testing that is more robust to model mis-specification than
parametric methods. Second, our method enables collaborative estimation between
multiple time points and different parties through the accumulation of a
knowledge base. Third, our method performed equally or better than other
methods in simulations using synthetic, semi-synthetic and real-world datasets.