Estimation of conditional average treatment effects on distributed confidential data

TOP Literature Database Estimation of conditional average treatment effects on distributed confidential data

arxiv

AI Security Portal bot

Information in the literature database is collected automatically.

Source

https://arxiv.org/abs/2402.02672

PDF

https://arxiv.org/pdf/2402.02672

Paper Information

Author: Yuji Kawamata;Ryoki Motai;Yukihiko Okada;Akira Imakura;Tetsuya Sakurai
Published: 2-5-2024
Updated: 9-10-2024
Affiliation: Center for Artificial Intelligence Research, University of Tsukuba
Country: Japan
Conference: Expert Syst. Appl.

Labels Estimated by AI

Data Generation Simulation Result Evaluation Watermarking

These labels were automatically added by AI and may be inaccurate.
For details, see About Literature Database.

Abstract

Estimation of conditional average treatment effects (CATEs) is an important topic in sciences. CATEs can be estimated with high accuracy if distributed data across multiple parties can be centralized. However, it is difficult to aggregate such data owing to confidential or privacy concerns. To address this issue, we proposed data collaboration double machine learning, a method that can estimate CATE models from privacy-preserving fusion data constructed from distributed data, and evaluated our method through simulations. Our contributions are summarized in the following three points. First, our method enables estimation and testing of semi-parametric CATE models without iterative communication on distributed data. Our semi-parametric CATE method enable estimation and testing that is more robust to model mis-specification than parametric methods. Second, our method enables collaborative estimation between multiple time points and different parties through the accumulation of a knowledge base. Third, our method performed equally or better than other methods in simulations using synthetic, semi-synthetic and real-world datasets.

External Datasets

synthetic data

semi-synthetic data from the infant dataset

financial assets dataset (SIPP dataset)

jobs dataset (Dehejia and Wahba's dataset)