Skip to content

Statistical processing

Contact info

Welfare and Health, Social Statistics
Jonas Kirchheiner-Rasmussen +45 39 17 34 93
+45 39 17 34 93

Get as PDF

Health Insurance Statistics

Data is received once a year from the Regions. It is assessed which services can be classified as contracts. Basic and practice cost fees are summed. For individuals with child identification, age and gender are imputed, and the person_id is set to unknown. Individuals with invalid CPR numbers are not included in the statistical tables. Corrections that cannot be associated with a registration in the respective year are deleted. Registrations where the variable SIKGRUP has the value 9 (Deceased) are deleted. Data is linked with background data from Statistics Denmark.

Source data

The primary source is LUNA. Additionally, there are supplementary sources regarding services from the tariff folders.

Internal sources:

  • The register of population statistics (family type, ancestry)
  • The register of income statistics (level of income) for the previous year
  • Register-based Labour Force Statistics (socio-economic status) as of November the previous year.

Frequency of data collection


Data collection


Data validation

The data received are compared with data from the previous year, and any major fluctuations examined to reassure quality. For the purpose of statistical production data are analyzed thoroughly.

Data compilation

Services that are not specified as supplementary services in the tariff folders are categorized as contracts. The number of contracts is calculated as the sum of the variable 'antal ydelser'' for the services categorized as contracts. Starting from 2006, gender and age imputation has been performed for the smaller group of children registered with child marking, and the person_id is set to unknown. Basic data (SSSY) is generated from the above data, and basic data is created, where the number of contacts (SSKO) and gross fees (SSHO) are aggregated at the person and service level. In these data, spec2 for contacts with general practitioners is further divided into the following categories: daytime consultation, evening consultation, daytime telephone consultation, evening telephone consultation, daytime visit, evening visit, e-communication (including with municipal care staff), other services, and prevention, etc. Additionally, an indicator for the basic fee is calculated. This is calculated as the sum of the calculated basic and practice cost fees distributed among group 1 insured persons who have received services from general practitioners (excluding persons marked as children). For SSSY, a variable spec80 has been introduced from 2023, which indicates the above division. However, "Other services and prevention, etc." does not appear in SSSY, but the algorithm can be obtained by contacting the statistical responsible party. Likewise, the basic fee is not specified in SSSY.

Before the data is uploaded to the Statbank Denmark, further data processing takes place. Individuals with invalid CPR numbers, individuals with child marking, and corrections (negative values for the service) that cannot be associated with a registration in the respective fiscal year are deleted. This means that corrections (negative entries) that do not match any registrations on the variables person_id, 'date of treatment' (date of service), 'specialty' (type of service), and "ydeltid" (time of service) are excluded. All deleted observations are assigned a value of 0 for the variable statpop in SSSY. The first registered gender and the first registered age are used for individuals who appear in the data with multiple CPR numbers. Registrations where the variable 'SIKGRUP' (Health Insurance Group) has the value 9 (Deceased) are deleted. Health insurance data is linked with other data on family relationships, origin, socio-economic status, and income.


From 2005, the register was cleansed of observations for which there are no reimbursements via the public health insurance (the gross fee equals 0). This applies primarily to physiotherapy and dental treatment. Accordingly, data is assessed for 2005 both by the old method of assessment by which data is not cleansed, and the new assessment method by which data is cleansed.

There is a very small number of records where the contacts are negative. In 2023, there are 172.310negative records (equivalent to 0.17pct. of all records). This is due to billing-related corrections in the registry, meaning corrections that are not made by Danmarks Statistik. Starting from 2021, corrections that cannot be linked to a registration in the respective year will be deleted before the compilation of the statistics bank.