Skip to content

Statistical processing

Contact info

Welfare and Health, Social Statistics
Jonas Kirchheiner-Rasmussen +45 39 17 34 93
+45 39 17 34 93

Get as PDF

Health Insurance Statistics

Data is received once a year from the Regions. It is assessed which services can be classified as contracts. Basic and practice cost fees are summed. For individuals with child identification, age and gender are imputed, and the person_id is set to unknown. Individuals with invalid CPR numbers are not included in the statistical tables. Corrections that cannot be associated with a registration in the respective year are deleted. Registrations where the variable SIKGRUP has the value 9 (Deceased) are deleted. Data is linked with background data from Statistics Denmark.

Source data

The primary source is LUNA. Additionally, there are supplementary sources regarding services from the tariff folders.

Internal sources:

  • The register of population statistics (family type, ancestry)
  • The register of income statistics (level of income) for the previous year
  • Register-based Labour Force Statistics (socio-economic status) as of November the previous year.

Frequency of data collection


Data collection


Data validation

The data received are compared with data from the previous year, and any major fluctuations examined to reassure quality. For the purpose of statistical production data are analyzed thoroughly.

Data compilation

Services that are not specified as supplementary services in the tariff folders are categorized as contracts. The number of contracts is calculated as the sum of the variable 'antal ydelser'' for the services categorized as contracts. Starting from 2006, gender and age imputation has been performed for the smaller group of children registered with child marking, and the person_id is set to unknown. Basic data (SSSY) is generated from the above data, and basic data is created, where the number of contacts (SSKO) and gross fees (SSHO) are aggregated at the person and service level. In this data, contacts with general practitioners are further divided into: daytime consultations, evening consultations, daytime telephone consultations, evening telephone consultations, daytime visits, evening visits, e-communication (including with municipal nursing staff), other services, and prevention, etc.

Before the data is uploaded to the Statbank Denmark, further data processing takes place. Individuals with invalid CPR numbers, individuals with child marking, and corrections (negative values for the service) that cannot be associated with a registration in the respective fiscal year are deleted. This means that corrections (negative entries) that do not match any registrations on the variables person_id, 'date of treatment' (date of service), 'specialty' (type of service), and "ydeltid" (time of service) are excluded. The first registered gender and the first registered age are used for individuals who appear in the data with multiple CPR numbers. Registrations where the variable 'SIKGRUP' (Health Insurance Group) has the value 9 (Deceased) are deleted. Health insurance data is linked with other data on family relationships, origin, socio-economic status, and income.


From 2005, the register was cleansed of observations for which there are no reimbursements via the public health insurance (the gross fee equals 0). This applies primarily to physiotherapy and dental treatment. Accordingly, data is assessed for 2005 both by the old method of assessment by which data is not cleansed, and the new assessment method by which data is cleansed.

There is a very small number of records where the contacts are negative. In 2022, there are 146,020 negative records (equivalent to 0.15pct. of all records). This is due to billing-related corrections in the registry, meaning corrections that are not made by Danmarks Statistik. Starting from 2021, corrections that cannot be linked to a registration in the respective year will be deleted.