Differences between RIF, LDS, and PUF Data Files:
Guidance for the Differences between RIF, LDS, and PUF Data Files web page
Issued by: Centers for Medicare & Medicaid Services (CMS)
Issue Date: January 01, 2020
Purpose
CMS offers files from aggregate data to individual person level data. This article describes the differences between the aggregate, public use files, the limited data sets, and research identifiable files.
Public Use File (PUFs), also called Non-Identifiable Data Files, have been edited and stripped of all information that could be used to identify individuals. In general, the PUFs contain aggregate-level information.
Limited Data Set (LDS) files also contain beneficiary level protected health information similar to the RIF files. In fact, many of the RIFs have an LDS equivalent. LDS files are considered identifiable because of the potential to re-identify a beneficiary. The difference, however, between RIF and LDS is that selected variables within the LDS files are blanked or ranged. LDS requests require a DUA, but do not go through a Privacy Board review. LDS files are available as a 100% or 5% random sample file. The DUA-Limited Data Sets (LDS) page on the CMS website describes the ways in which the LDS files may be used.
Research Identifiable Files (RIFs) contain beneficiary level protected health information (PHI). Requests for RIF data require a Data Use Agreement (DUA) and are reviewed by CMS’s Privacy Board to ensure that the beneficiary’s privacy is protected and only the minimum data necessary are requested and justified. The Identifiable Data Files page of the CMS website provides information about the release of these data.
Public Use File | Limited Data Sets | Research Identifiable |
---|
Public Use File | Limited Data Sets | Research Identifiable | ||
---|---|---|---|---|
Requires Privacy Board Review? | No | No | Yes | |
Requires a Data Use Agreement? | No | Yes | Yes | |
Files include beneficiary-level data? | No | Yes | Yes | |
Researchers may request customized cohorts (e.g. Diabetics residing in MN)? | No | No | Yes | |
Data can be linked at beneficiary level to non-CMS data using a beneficiary identifier? | No | No | Yes[1] | |
Claim run off period[2] | NA | Annual file: 6-month run off | Annual file: 12-month run off | |
Quarterly file: 3-month run off | Quarterly file: 3-month run off |
The RIF and LDS files both contain beneficiary-level data, however, some variables included in the RIF data may be presented differently (ranged or absent) in the LDS counterpart. See Table 2 for the key differences.
Variable | File | Limited Data Set | Research Identifiable File |
---|
Variable | File | Limited Data Set | Research Identifiable File |
---|---|---|---|
Unique Beneficiary Identifier | Claims & Enrollment files | Encrypted identifier | Encrypted identifier |
MedPAR | No identifier | Encrypted identifier | |
Health Insurance Claim (HIC) or Social Security Number (SSN) | Claims & Enrollment files | Not included in file | Included as an add-on with special permission only |
Dates (MM/DD/YYYY) | Claims files | Included as of CY2010[3] | Included |
MedPAR | Quarter and year only | Included | |
Claim from date | Claims files | Not included | Included |
Claim through date | Claims files | Included | Included |
Beneficiary Zip Code[4] | Claims & Enrollment files | County and state | Included |
MedPAR | State only | Included | |
Beneficiary Date of Birth | Claims, MedPAR & Enrollment files | Not included. Age year or age range[5] | Included |
Date of Death | Enrollment files | Included, for validated dates of death only[6][7] | Included |
NPI/UPIN for person level provider | Claims files | As of 2013, the real NPI is included[8] | Included |
MedPAR | NOT included | Not included | |
Facility provider number[9] | Claims files & MedPAR | Included | Included |
NPI of the facility | Claims files & MedPAR | Included | Included |
NPI/UPINs are encrypted in the LDS files from 1999-2012. A free crosswalk is available for request. The crosswalk includes encrypted to unencrypted UPIN and NPI numbers back to 1999.
Please see the "Add LDS Files to an Existing LDS DUA" section of the Limited Data Set (LDS) Files page to order this file.
[1] The inclusion of patient identifiers linkable to outside data requires CMS approval. Without this approval, the RIF patient identifiers are not linkable to outside data.
[2] More detailed information about the runoff periods and availability are found in RIF Medicare Quarterly Data article and Medicare Limited Data Set (LDS) Quarterly Claims and Enrollment Data
[3] LDS files include dates as of 2010. For 2009 files, CMS provides the dates as a separate file. Prior to 2009, the files present dates as a quarter and year.
[4] The Medicare Current Beneficiary Survey (MCBS) LDS and Health Outcomes Survey (HOS) LDS files contain zip code, date of birth, and date of death.
[5] See footnote 2 above.
[6] See footnote 2 above.
[7] Based on a ResDAC analysis of the 2012 RIF Master Beneficiary Summary file, 4% of the death dates are not validated.
[8] NPI/UPINs are encrypted in LDS data files from 1999-2012. A free crosswalk file is available for request to identify individual providers back to 1999. See the Appendix for the record layout and ordering information.
[9] The facility provider number is also called the CMS Certification Number (CCN) or the Medicare provider number and identifies the institutional facility.
Article Information
HHS is committed to making its websites and documents accessible to the widest possible audience, including individuals with disabilities. We are in the process of retroactively making some documents accessible. If you need assistance accessing an accessible version of this document, please reach out to the guidance@hhs.gov.
DISCLAIMER: The contents of this database lack the force and effect of law, except as authorized by law (including Medicare Advantage Rate Announcements and Advance Notices) or as specifically incorporated into a contract. The Department may not cite, use, or rely on any guidance that is not posted on the guidance repository, except to establish historical facts.