This describes all of my primary Medicare Advantage (MA) datasets and code files. Project-specific uses of these data are treated as their own repositories and will be cited here over time. As always, this is a living document and changes regularly.
Master Do File
Each of the relevant data sources are available individually and discussed in more detail below. Just scroll down to the Raw Data Sources. All of these files can also be called as part of a master file, _BuildFinalData.do. I name these files so that they appear (by default) in Windows folders in a way that makes sense to me. So the master do file begins with an underscore, and the individual do files are numbered. This is just personal preference but helps me to keep things in order.
Raw Data Sources
All of the raw data are publicly available from the Centers for Medicare and Medicaid Services (CMS) website, although navigating the different pages can be cumbersome. There is more information available than what I discuss here, but this is sufficient to create a panel of MA plans/contracts/insurers by county over several years, complete with plan characteristics, quality ratings, and plan enrollments.
Below, I introduce files in the order in which I've found things to be easiest to work with, but there's no magic order that needs to be followed.
The monthly enrollment data by contract/plan/state/county (CPSC) are the most comprehensive enrollment files, providingplan-level enrollments each month. Consistent with CMS data restrictions, the data are only provided for plans and months in which at least 11 beneficiaries enrolled in the plan. In all other cases, the enrollment data are indicated as missing.
Stata .do file to organize CPSC enrollment data is available here: 1_Plan_Data.do
The MA Landscape Files provide data on several plan characteristics. Note that there are separate folders for stand-along prescription drug plans (PDPs), special needs plans (SNPs), MA plans, and MA+PD plans. The accompanying code files focus only on MA and MA+PD plans. Also note that another source for similar information is in the Plan Benefits Data files. Those files are more comprehensive but also significantly harder to work with. I've found the landscape files to have all of the most important information, as well as being much more accessible.
Stata .do file to organize plan premiums and other characteristics is available here: 2_Plan_Characteristics.do
The MA Contract Service Area files provide a list of contracts and the counties in which the contracts are approved to operate. Contracts may have enrollees in counties for which the contract is not approved due, for example, to beneficiaries that moved from one county to another and maintained their old plan. The files are available monthly. I usually look across months in a year for any month in which a contract was approved. If such a month exists, then I flag that contract/county as "approved" and ultimately merge those approved contract/counties with other plan-level data.
Stata .do file to organize the service area files is available here: 3_Service_Areas.do
The MA penetration files provide the overall MA share for each county and month. These data are useful since plan-level enrollments are masked for plans with too few enrollees. As such, summing the plan-level enrollments will always underestimate the total number of MA enrollees in a county. The MA penetration files therefore provide a direct measure of the size of the MA market in each county. Note that these data are only available from CMS beginning in 2008.
Stata .do file to organize the penetration files is available here: 4_Penetration_Files.do
The MA star ratings data are available as part of the Part C and D Performance Data. The data are not in a good analytical format and the variable names differ over time. In the associated code files, I've renamed variables so that sufficiently similar data are given the same name across all years of data.
Stata .do file to organize the star ratings data is available here: 5_Star_Ratings.do
CMS pays MA plans based on their bid and the county-level benchmarks. The final risk-adjusted payments (per beneficiary) are available on the CMS website. Unfortunately, the most recent data available are from 2015. Hopefully CMS will publish the 2016 and 2017 data soon.
Stata .do file to organize the CMS payment and rebate data is available here: 6_Risk_Rebates.do
CMS sets benchmarks for each county, to which plan bids are assessed when determining final premiums. These benchmark formulas are adjusted regularly but are legislatively defined. Details of the benchmarks and underlying data are available at the link above.
Stata .do file to organize the benchmark rates is available here: 7_MA_Benchmark.do
CMS also publishes average fee-for-service (FFS) costs for each county, which is sometimes useful as a proxy for insurer's costs in MA (although the patient population remains very different between Traditional Medicare FFS and Medicare Advantage). Data specifically on the 65+ population was available through 2014, but CMS now only provides the combined "aged" and disabled enrollment and reimbursement figures. Due to the change in data from 2014 to 2015, I use the total (aged+disabled) enrollment and reimbursement figures when working with data that spans before and after 2015.
Stata .do file to organize the CMS FFS cost data is available here: 8_FFS_Costs.do