Unsupervised Machine Learning for Explainable Health Care Fraud Detection /
Shubhranshu Shekhar, Jetson Leder-Luis, Leman Akoglu.
- Cambridge, Mass. National Bureau of Economic Research 2023.
- 1 online resource: illustrations (black and white);
- NBER working paper series no. w30946 .
- Working Paper Series (National Bureau of Economic Research) no. w30946. .
February 2023.
The US spends more than 4 trillion dollars per year on health care, largely conducted by private providers and reimbursed by insurers. A major concern in this system is overbilling, waste and fraud by providers, who face incentives to misreport on their claims in order to receive higher payments. In this work, we develop novel machine learning tools to identify providers that overbill insurers. Using large-scale claims data from Medicare, the US federal health insurance program for elderly adults and the disabled, we identify patterns consistent with fraud or overbilling among inpatient hospitalizations. Our proposed approach for fraud detection is fully unsupervised, not relying on any labeled training data, and is explainable to end users, providing reasoning and interpretable insights into the potentially suspicious behavior of the flagged providers. Data from the Department of Justice on providers facing anti-fraud lawsuits and case studies of suspicious providers validate our approach and findings. We also perform a post-analysis to understand hospital characteristics, those not used for detection but associate with a high suspiciousness score. Our method provides an 8-fold lift over random targeting, and can be used to guide investigations and auditing of suspicious providers for both public and private health insurance systems.
System requirements: Adobe [Acrobat] Reader required for PDF files. Mode of access: World Wide Web.
Other Bureaucracy • Administrative Processes in Public Organizations • Corruption Health Insurance, Public and Private Illegal Behavior and the Enforcement of Law Auditing