European Commission

SILC_ESQRS_A_BE_2011_0000 - Version 1

National Reference Metadata in ESS Standard for Quality Reports Structure (ESQRS)

Compiling agency: Statistics Belgium

Time Dimension: 2011-A0

Data Provider: BE1

Data Flow: SILC_ESQRS_A

For any question on data and metadata, please contact: EUROPEAN STATISTICAL DATA SUPPORT

1. Contact

Top

1.1. Contact organisation

Statistics Belgium

1.2. Contact organisation unit

DTS

1.5. Contact mail address

2. Introduction

Top

The production of quality reports is part of the implementation of the EU-SILC instrument. In order to assess the quality of data at national level and to make a comparison among countries, the National Statistics Institutes are asked to report detailed information mainly on: the entire statistical process, sampling and non-sampling errors, and potential deviations from standard definition and concepts.

This document follows the ESS standard for quality reports structure (ESQRS), which is the main report structure for reference metadata related to data quality in the European Statistical System. It is a metadata template, based on 13 main concepts, which can be used across several statistical domains with the purpose of a better harmonisation of the quality reporting requirements in the ESS.

For that reason the template of this document differs from that one stated in the Commission Reg. 28/2004.

Finally it is the combination of the previous intermediate and final quality reports therefore it is worth mentioning that it refers to both the cross sectional and the longitudinal data.

3. Quality management - assessment

Top

Not requested by Reg. 28/2004

4. Relevance

Top

4.1. Relevance - User Needs

Not requested by Reg. 28/2004

4.2. Relevance - User Satisfaction

Not requested by Reg. 28/2004

4.3. Completeness

Not requested by Reg. 28/2004

4.3.1. Data completeness - rate

Not requested by Reg. 28/2004

5. Accuracy and reliability

Top

5.1. Accuracy - overall

In terms of precision requirements, the EU-SILC framework regulation as well the Commission Regulation on sampling and tracing rules refers respectively, to the effective sample size to be achieved and to representativeness of the sample. The effective sample size combines sample size and sampling design effect which depends on sampling design, population structure and non-response rate.

5.2. Sampling error

EU-SILC is a complex survey involving different sampling design in different countries. In order to harmonize and make sampling errors comparable among countries, Eurostat (with the substantial methodological support of Net-SILC2) has chosen to apply the "linearization" technique coupled with the “ultimate cluster” approach for variance estimation. Linearization is a technique based on the use of linear approximation to reduce non-linear statistics to a linear form, justified by asymptotic properties of the estimator. This technique can encompass a wide variety of indicators, including EU-SILC indicators. The "ultimate cluster" approach is a simplification consisting in calculating the variance taking into account only variation among Primary Sampling Unit (PSU) totals. This method requires first stage sampling fractions to be small which is nearly always the case. This method allows a great flexibility and simplifies the calculations of variances. It can also be generalized to calculate variance of the differences of one year to another .

The main hypothesis on which the calculations are based is that the "at risk of poverty" threshold is fixed. According to the characteristics and availability of data for different countries we have used different variables to specify strata and cluster information. In particular, countries have been split into four groups:

1)BE, BG, CZ, IE, EL, ES, FR, IT, LV, HU, NL, PL, PT, RO, SI, UK and HR whose sampling design could be assimilated to a two stage stratified type we used DB050 (primary strata) for strata specification and DB060 (Primary Sampling Unit) for cluster specification;

2) DE, EE, CY, LT, LU, AT, SK, FI, CH whose sampling design could be assimilated to a one stage stratified type we used DB050 for strata specification and DB030 (household ID) for cluster specification;

3) DK, MT, SE, IS, NO, whose sampling design could be assimilated to a simple random sampling, we used DB030 for cluster specification and no strata;

In case Eurostat methodology is not accepted by your country, please describe the methodology used at national level for computing the estimates.

5.2.1. Sampling error - indicators

	AROPE			At risk of poverty (60%)			Severe Material Deprivation			Very low work intensity
	Ind. value	Stand. errors	Half CI (95%)	Ind. value	Stand. errors	Half CI (95%)	Ind. value	Stand. errors	Half CI (95%)	Ind. value	Stand. errors	Half CI (95%)
Total	21	0.96	1.9	15.3	0.85	1.7	5.7	0.53	1	13.7	0.85	1.7
Male	20,4	1.06	2.1	14.6	0.92	1.8	5.9	0.63	1.2	13.2	0.92	1.8
Female	21,5	0.98	1.9	16	0.88	1.7	5.4	0.5	1	14.3	0.93	1.8
Age0-17	23.3	1.69	3.3	18.7	1.58	3.1	8.2	1.01	2	13.9	1.42	2.8
Age18-64	20	0.95	1.9	12.9	0.8	1.6	5.6	0.52	1	13.6	0.74	1.4
Age 65+	21.6	1.32	2.6	20.2	1.29	2.5	2.6	0.44	0.9

5.3. Non-sampling error

Non-sampling errors are basically of 4 types:

Coverage errors: errors due to divergences existing between the target population and the sampling frame.
Measurement errors: errors that occur at the time of data collection. There are a number of sources for these errors such as the survey instrument, the information system, the interviewer and the mode of collection
Processing errors: errors in post-data-collection processes such as data entry, keying, editing and weighting
Non-response errors: errors due to an unsuccessful attempt to obtain the desired information from an eligible unit. Two main types of non-response errors are considered:

– Unit non-response: refers to absence of information of the whole units (households and/or persons) selected into the sample

– Item non-response: refers to the situation where a sample unit has been successfully enumerated, but not all required information has been obtained

5.3.1. Coverage error

Coverage errors include over-coverage, under-coverage and misclassification:

Over-coverage: relates either to wrongly classified units that are in fact out of scope, or to units that do not exist in practice
Under-coverage: refers to units not included in the sampling frame
Misclassification: refers to incorrect classification of units that belong to the target population

In Belgium, the sampling frame is the Central Population Register.

As there was a period of one month between the drawing of households and the survey itself, over-coverage, under-coverage and misclassification could be happen.

Over-coverage: Persons who died before the survey. Households who moved outside Belgium before the survey. Address is not the principal residence.

Under-coverage: Immigrants who came in Belgium before the survey. Persons who moved from a household to create a new household. Diplomats exempt from an inscription in the national register. Refugees on a waiting list.

Misclassification: Household who moved from a region in Belgium to another region of Belgium.

The size of coverage errors is not available but it was obviously small.

5.3.1.1. Over-coverage - rate

Main problems

Size of error

Cross sectional

data

·Over-coverage

·Under-coverage

·Misclassification

5.3.2. Measurement error

Cross sectional data

Source of measurement errors

Building process of questionnaire

Interview training

Quality control

-survey instrument

-information system

-interviewer

-mode of collection (CAPI interview)

The questionnaire of the SILC2011 survey is the result of several steps:

-For building up the questionnaire we took the blue print questionnaire of Eurostat as the basis (documents SILC055, SILC065 and EU-SILC65/02 Addendum II). The order of the questions and the groups (themes of) questions is taken from this blue print. The majority of the questions are almost literally copied (and translated), other questions are changed, however, because experiences in Belgium gave better results posing the questions in another way (The questionnaires were developed in collaboration with the universities that have the experience of the ECHP/PSBH project in Belgium).

-After each survey an evaluation of the questionnaire was made (detection of the problematic or difficult to answer questions based on the comments of the interviewers and on a study of the item non-response). When building up the SILC2011 questionnaire we took account of this evaluation.

Interview training (Number of training days and information on the intensity and efficiency of interview training)

Overall we had the impression that the working-experience of the interviewers with EU-SILC starts to pay of. In our opinion the basis data has improved since 2010. All new interviewers have to follow a two day formation. All trained interviewers followed a formation for an hour and half.

They both had to complete a test-interview before they could download their data. So we can be sure they can completely manage the use of the PC and that they know the questionnaire before they go on the field.

A training group for new interviewers consisted of minimum 5 to maximum 20 interviewers, and according to the size of the training group there were 1 or 2 trainers.

Even though the accent was given to the practical side of the training (getting to know the questions and mastering the CAPI-program by imitating interview situations), three manuals were distributed and explained during the training:

-A general manual (‘Manuel general aux enquêteurs’) containing information about the objectives of the survey, the organisation of the survey, legal and administrative aspects around the survey, fieldwork aspect (how to contact the household, how to introduce oneself, who answers which questions, time delays, …) and the content of the questionnaires.

-A second manual (‘Manuel contenu’) with all kinds of additional explanations and examples for certain questions/answers.

-A third manual (‘Manuel CAPI’) about the use of the portable PC for the SILC Computer Assisted Personal Interviews and about the data entry program itself.

The first day of the training there was half a day for learning about and discussing the first two manuals. In the afternoon the trainees received their laptop and got to know the survey and the tool to carry out the interview in practice. One test-interview was simulated collectively. The second day of the training a small part of the time was dedicated to testing to send the data electronically after carrying out the interview. All the rest of the day interviewers practiced several interviews and interview situations with each other on the basis of household profiles that were given. There was also a lot of time for questions and discussions in between these test-interviews.

At the end of the training sessions the instructors had a good image on the degree in which each interviewer ameliorated during the training and on the degree in which they mastered the work. For certain interviewers two days of training was more than enough to master the work, for others it was necessary that they practiced some more at home on specific aspects of carrying out this survey (for example using of the CAPI-program itself, working on the content of the survey, …). They were recommended to do so before carrying out their first real interview. They were often also recommended to start interviewing one-person households.

A training group for trained interviewers consisted maximum 30 interviewers with two trainers. The accent was also given on the content: questions that changed, the module 2011 and questions, which are misunderstood by the interviewers. We made an extra manual for trained interviewers. The trained interviewers obtained four manuals, the same three as the new interviewers, and a fourth (‘Modifications du questionnaire : module 2011’) about the module, changed questions and questions misunderstood by the interviewers.

• Skills testing before starting the fieldwork

Interviewers were selected from the interviewer database that Statistics Belgium has centralised for all the survey’s that are carried out by the institute. For each interviewer a basic curriculum vitae is present in the database (mentioning for example for which surveys they have experience, their language knowledge, their knowledge of pc, …). A specific unit at Statistics Belgium (‘Unité Corps Enquêteurs’) is occupied with the selection of the interviewers for each survey; they have good contact with and knowledge of the interviewers. They try to find the best interviewer for each of the geographical areas to cover for SILC. This is not always an easy task because for certain geographical areas several interviewers are candidate, but for other geographical unit there are few or no candidates. Note that interviewers in Belgium most often carry out this work as a second or casual occupation.

• Skills control during the fieldwork

During the fieldwork we controlled the work of the interviewers by looking at some of their completed questionnaires. We gave extra attention to all new interviewers and to some trained interviewers that we suspected to be less accurate. Remarks (positive as negative) resulting from these controls were immediately communicated to the interviewer so they could improve their way of working and interviewing.

• Number of households by interviewer

Groups of secondary units consisted of about 35 households, depending on the strata. Most of the interviewers had one group of households. Nevertheless several interviewers also had more groups.

5.3.3. Non response error

Non-response errors are errors due to an unsuccessful attempt to obtain the desired information from an eligible unit. Two main types of non-response errors are considered:

1) Unit non-response which refers to the absence of information of the whole units (households and/or persons) selected into the sample. According the Commission Regulation 28/2004:

Household non-response rates (NRh) is computed as follows:

NRh=(1-(Ra * Rh)) * 100

Where Ra is the address contact rate defined as:

Ra= Number of address successfully contacted/Number of valid addresses selected

and Rh is the proportion of complete household interviews accepted for the database

Rh=Number of household interviews completed and accepted for database/Number of eligible households at contacted addresses

Individual non-response rates (NRp) will be computed as follows:

NRp=(1-(Rp)) * 100

Where Rp is the proportion of complete personal interviews within the households accepted for the database

Rp= Number of personal interview completed/Number of eligible individuals in the households whose interviews were completed and accepted for the database

Overall individual non-response rates (*NRp) will be computed as follows:

*NRp=(1-(Ra * Rh * Rp)) * 100

For those Members States where a sample of persons rather than a sample of households (addresses) was selected, the individual non-response rates will be calculated for ‘the selected respondent’, for all individuals aged 16 years or older and for the non-selected respondent.

2) Item non-response which refers to the situation where a sample unit has been successfully enumerated, but not all the required information has been obtained.

5.3.3.1. Unit non-response - rate

Cross sectional data
Address contact rate (Ra)*		Complete household interviews (Rh)*		Complete personal interviews (Rp)*		Household Non-response rate (NRh)*		Individual non-response rate (NRp)*		Overall individual non-response rate (NRp)*
A*	B*	A*	B*	A*	B*	A*	B*	A*	B*	A*	B*
0.9843	0.9167	0.6427	0.4327	0.9777	0.9626	36.74	57.74	2.23	3.73	38.15	59.23

* All the formulas are defined in the Commission Regulation 28/2004, Annex II

A* = Total sample; B = * New sub-sample

5.3.3.2. Item non-response - rate

The computation of item non-response is essential to fulfil the precision requirements concerning publication as stated in the Commission Regulation No 1982/2003. Item non-response rate is provided for the main income variables both at household and personal level.

5.3.3.2.1. Item non-response rate by indicator

	Total hh gross income (HY010)	Total disposable hh income (HY020)	Total disposable hh income before social transfers other than old-age and survivors benefits (HY022)	Total disposable hh income before all social transfers (HY023)
% of household having received an amount	99.92	99.93	97.87	96.13
% of household with missing values (before imputation)	13.73	10.99	9.06	4.28
% of household with partial information (before imputation)	56.49	74.3	77.58	83.01

	Imputed rent (HY030)	Income from rental of property or land (HY040)	Family/ Children related allowances (HY050)	Social exclusion payments not elsewhere classified (HY060)	Housing allowances (HY070)	Regular inter-hh cash transfers received (HY080)	Interest, dividends, profit from capital investments in incorporated businesses (HY090)
% of household having received an amount	78.95	8.78	35.31	1.93	0.63	7.51	66.85
% of household with missing values (before imputation)		8.67	1.25	0.88	59.46	2.03	71.37
% of household with partial information (before imputation)	100	0.19	8			0.9

	Cash or near-cash employee income (PY010)	Other non-cash employee income (PY020)	Income from private use of company car (PY021)	Employers social insurance contributions (PY030)	Cash profits or losses from self-employment (PY050)	Value of goods produced for own consumption (PY070)	Unemployment benefits (PY090)	Old-age benefits (PY100)	Survivors benefits (PY110)	Sickness benefits (PY120)	Disability benefits (PY130)	Education-related allowances (PY140
% of household having received an amount	47.03	22.6	4.71	46.18			12.48	18.84	3.43	2.56	3.98	2.33
% of household with missing values (before imputation)	2.24	67.04	22.78		6.17		6.01	30.28	18.07	1.37	3.73	6.37
% of household with partial information (before imputation)	1.67	19.72		100	16.12			1.67				0.37

5.3.4. Processing error

Data entry and coding

Editing controls

Contact form: control
Column 21, 22, 23 and 24
Column 8,21 and 22
Column 8,21 and 22
Column 21 and 22
Column 21, 22, 23, 24
Column 23, 24
Household questionnaire: control
H5 and H7:
H13
H19
H27, category g, H45 category g:
H44
H95
H97
Individual questionnaire : control
Question I6, I7 and I8
Question I6 , I7 and I8
Question I13 and I14:
Question I13 et I16
Question I14 and I16
Question I21 and I22
Question I21 and I29.
Question I29 and I22
Question I37
Question I38
Question I 52, I 92.
Question I 116
Question I25 (I26) (gross income) and question I27 (I28) (net income)
Question I25 and I 26
Household questionnaire : warnings
H16
H22 (monthly)
H22 (half-yearly)
H22 (yearly)
H23 (monthly)
H23 (half-yearly)
H23 (yearly)
H26
H33
H34, H37, H41
H43, H77, H84
H66
H71B
H79, H86
H93
Individual questionnaire : warnings
I25, I27, I47, I50, I90, I91
I53, I54, I55, I56, I86, I93, I94
I58
I98B, I98C, I115B, I115C
I99, I102B, I102C

You can’t combine father, mother or being spouse with ‘being younger than 12 years”. If higher than 1200

Editing controls

It’s not possible to combine being ‘female’ and being ‘father’.
It’s not possible to combine being ‘male’ and being ‘mother’.
Mother and father have to be older than their children (and at least being older than 12 years).
Parents of the spouses or of the partners must be different.
You can’t mix ‘spouse ‘and ‘partner’. Must choose one of both for the couple.

It is not possible to combine H5, code 6 with H7 code 2, 3, 4, 5, 6, 7, 8, 9, 10
Enter a numeric value between 1900 and 2008
The first of the reimbursement must be between 1954 and 2008 (included). The year of the first purchase must be at the same time or later than the date of buying.
Code 1 is only possible if at question H5, code 3,4,5,6 or 7
Not possible to answer more than 12 months
Persons have to be between the age of 11 and 23 (included) to obtain a scholarship for secondary school
Persons have to be between the age of 16 and 99 (included) to obtain a scholarship for higher education

You can’t combine code 2 of questions I6 and I7 with code 1, 2, 3, 4 and 10 of the question I8.
You can’t combine code 1 of question I6 or question I7 with code 5, 6, 7, 8, 9 and 11 of the question I8.
You can’t combine code 1,2,3,4 and 10 question in I13 with code 2 and 3 in question I14
You can’t combine code 1, 2, 3, 4 and 10 of the question I13 with code 1, 2 of the question I16.
You can’t combine code 2 or 3 of the question I14 and code 3 or 4 of the question I16.
You can’t combine code 1,2,3,4 or 10 in question I21 with code 2 or 3 in question I22.
You can’t combine code 1, 2, 3, 5, 6 of the question I29 with the code 1, 2, 3, 4 or 10 of the question I21.
You can’t combine code 7 of the question I 29 with code 2 or 3 of the question I22.
Age has to be less than current age and not less than 8 year.
Number of years can’t be higher than current age minus the age mentioned in question I37.
Can’t be higher than 12 months.
Can’t enter a year which is before date of birth.
Amounts given in question I25 can’t be higher than the amounts given in the question I27.
If the person didn’t give an exact amount at the question I25, please go to the question I26.

If lower than 500 or higher than 1000000
If lower than 20 or higher than 2000
If lower than 100 or higher than 10000
If lower than 200 or higher than 20000
If lower than 20 or higher than 2000
If lower than 100 or higher than 10000
If lower than 200 or higher than 20000
If lower than 25 or higher than 5000
If lower than 50 or higher than 10000
If lower than 100 or higher than 5000
If lower than 25 or higher than 1000
If lower than 100 or higher than 25000
If lower than 25 or higher than 750
If lower than 25 or higher than 1000
If lower than 100 or higher than 1500

If lower than 500 or higher than 5500
If lower than 6000 or higher than 66000
If higher than 1350
If higher than 5400

5.3.4.1. Imputation - rate

Not requested by Reg. 28/2004

5.3.4.2. Common units - proportion

5.3.5. Model assumption error

5.3.6. Data revision

5.3.6.1. Data revision - policy

5.3.6.2. Data revision - practice

5.3.6.3. Data revision - average size

5.3.7. Seasonal adjustment

6. Timeliness and punctuality

Top

6.1. Timeliness

Not requested by Reg. 28/2004

6.1.1. Time lag - first result

6.1.2. Time lag - final result

6.2. Punctuality

Not requested by Reg. 28/2004

6.2.1. Punctuality - delivery and publication

Not requested by Reg. 28/2004

7. Accessibility and clarity

Top

7.1. Dissemination format - News release

7.2. Dissemination format - Publications

7.3. Dissemination format - online database

Not requested by Reg. 28/2004

7.3.1. Data tables - consultations

Not requested by Reg. 28/2004

7.4. Dissemination format - microdata access

7.5. Documentation on methodology

Not requested by Reg. 28/2004

7.5.1. Metadata completeness - rate

7.5.2. Metadata - consultations

Not requested by Reg. 28/2004

7.6. Quality management - documentation

Not requested by Reg. 28/2004

7.7. Dissemination format - other

8. Comparability

Top

8.1. Comparability - geographical

8.1.1. Asymmetry for mirror flow statistics - coefficient

8.1.2. Reference population

Reference population	Private household definition	Household membership
The reference population is all citizens living officially living at Belgian territory (population de jure). This means that the source of our sample is the central population register. This Register includes all private households and their current members residing in the territory. Persons living in collective households and in institutions are excluded from the target population.	The definition of household that Eurostat recommends is used. Household is defined as a person living alone or a group of people who live together in the same dwelling and share expenditures including the joint provision of the essentials of living.	The definition of household membership is the same as mentioned in the Eurostat document EU-SILC065/03 about the description of target variables (Chapter ‘Units’). All household members of 16 year and older at the end of the income reference period, are selected for a personal interview.

8.1.3. Reference Period

Period for taxes on income and social insurance contributions	Income reference periods used	Reference period for taxes on wealth	Lag between the income ref period and current variables
A fixed twelve-month period, namely the previous calendar year. For SILC 2011, the period is the year 2010.	A fixed twelve-month period, namely the previous calendar year. For SILC 2011, the income reference period is the year 2010.	n.a.	The income reference period is the previous calendar year (year 2010) and the current variables refer to the fieldwork period (April-December2011). Therefore the lag is at minimum 4 months and at maximum 12 months.

8.1.4. Statistical concepts and definitions

Total hh gross income (HY010)	Total disposable hh income (HY020)	Total disposable hh income before social transfers other than old-age and survivors' benefits (HY022)	Total disposable hh income before all social transfers (HY023)
F	F (remark:We didn’t take count of HY120G, because regular taxes on wealth do not exist in Belgium.)	F	F

Imputed rent (HY030)	Income from rental of property or land (HY040)	Family/ Children related allowances (HY050)	Social exclusion payments not elsewhere classified (HY060)	Housing allowances (HY070)	Regular inter-hh cash transfers received (HY080)	Interest, dividends, profit from capital investments in incorporated businesses (HY090)	Interest paid on mortgage (HY100)	Income received by people aged under 16 (HY110)	Regular taxes on wealth (HY120)	Regular inter-hh transfers paid (HY130)
F	F	F	F	F	F	F	F	F	NC	F

Cash or near-cash employee income (PY010)	Other non-cash employee income (PY020)	Income from private use of company car (PY021)	Employers social insurance contributions (PY030)	Cash profits or losses from self-employment (PY050)	Value of goods produced for own consumption (PY070)	Unemployment benefits (PY090)	Old-age benefits (PY100)	Survivors benefits (PY110)	Sickness benefits (PY120)	Disability benefits (PY130)	Education-related allowances (PY140)	Gross monthly earnings for employees (PY200)
F	F	F	F	F	F	F	F	F	F/L/P/NC	F/L/P/NC	F/L/P/NC	F/L/P/NC

The source or procedure used for the collection of income variables

The form in which income variables at component level have been obtained

The method used for obtaining target variables in the required form

The collection of the income variables is by interview.Belgiumhas no income variables collected from registers for the survey of 2011.

Areas	Qr. Block	Target Variable	Unit of measurement	Tax or tax-exempt	If taxable, how the amount is recorded
Employee Income	PY010	Gross Employee Cash or near cash Income in reference period	Individual level	Taxable	Net + gross
Employee Income	PY020	Gross Non-Cash Employee income (company car, mail tickets)	Individual level	Not taxable (mail tickets are not taxable for the employee and can not be deducted from taxes by the employer) (the company car itself is not taxable but the kilometres that are done for job/work distances and for private distances are taxed: there is always a minimum of 5000 km taxed)
Self-employment Income	PY050	Gross Cash Income benefits/Losses from self-employment (including profit/loss from unincorporated enterprise, royalties)	Individual level	Taxable For losses, this means a deduction from taxes of this amount can be done on other income posts of that year or on income of the next year)	Net OR gross
Imputed rent[1]	HY030	imputed rent	Household level	-
Property income	HY090	Interest, dividends, profit from capital investments in unincorporated business	Individual level	Taxable	Net
	HY040	Income from rental of property or land	Household level	Taxable	Gross
	PY080	Regular pension from Private (non-ESSPROS) schemes	Individual level	Taxable	Gross (for the major part of the pensions)
Current transfer received Social benefits: ESSPROS Regular inter household transfer received	HY050	Family-related allowances: parental leave benefits	Individual level	Taxable	Net + gross
		Family-related allowances:	Household level	Not taxable
	HY060	Social assistance	Individual level	Not taxable
	HY070	Housing allowances	Household level	Not taxable
	PY090	Unemployment Benefits	Individual level	Taxable	Net + gross
	PY100	Old-age benefits	Individual level	Taxable	Net + gross
	PY110	Survivor’s Benefits	Individual level	Taxable	Net + gross
	PY120	Sickness Benefits	Individual level	Taxable	Net + gross
	PY130	Invalidity Benefits	Individual level	Taxable	Net + gross
	PY140	Education-related Allowances	Household level	Not taxable
	HY080	Regular inter-household cash transfers received	Household level	Not taxable, but taxed if alimentation	Gross
Other income received	HY110	Income received by people aged under 16	Household level	Not taxable
Interest payments	HY100	Interest repayments on mortgage	Household level	Taxable, this means a deduction from taxes can be done	Gross
Current transfers paid	HY130	Regular inter-household cash transfers paid	Household level	Not taxable or deductible, but taxed if alimentation	Gross

^{^[1]}Information on that component is asked because it is important to know if :

- an owner is taxed regarding his tenure status (specific tax on property income)

- a 'rent-free' tenant could be taxed on behalf of the accommodation's owner

See information on control, correction, imputation and creation of the gross target variables.

8.2. Comparability - over time

The results of the Belgian EU-SILC 2011 operation are very similar to those of the 2010 operation.

We have to note one change in the number of rooms in the dwelling : the question has changed since last year, and it results in a slight change in the figures.

8.2.1. Length of comparable time series

8.3. Comparability - domain

9. Coherence

Top

9.1. Coherence - cross domain

9.1.1. Coherence - sub annual and annual statistics

9.1.2. Coherence - National Accounts

9.2. Coherence - internal

10. Cost and Burden

Top

11. Confidentiality

Top

11.1. Confidentiality - policy

Not requested by Reg. 28/2004

11.2. Confidentiality - data treatment

Not requested by Reg. 28/2004

12. Statistical processing

Top

12.1. Source data

In Belgium, the sampling frame is the Central Population Register. This Register includes all private households and their current members residing in the territory. Persons living in collective households and in institutions are excluded from the target population.

The Central Population Register of 1 February was used.

Updating actions: Central Population Register is updated two times during a month. The changes were communicated to the interviewers.

12.1.1. Sampling design and procedure

Type of sampling design

The Belgian EU-SILC 2011 survey is based on a stratified 2-stage sampling scheme in 2004, followed by rotation since 2005. Rotation allows to replace roughly one fourth of the sample each year. Hence, households (ignoring split-offs) participating in 2012 have been drawn for participation since 2008, 2009, 2010 and 2011.

Stratification and sub stratification criteria

The main stratification criterion is the NUTS2 level. The 11 sampling strata are the 10 Belgian provinces (5 in Flanders – coded BE21-BE25 – and 5 in Wallonia – coded BE31 to BE35) and the Brussels Capital Region (BE10).Further implicit stratification is obtained by sorting PSUs (sub-municipalities) on mean income and sorting SSUs (households) in selected PSUs on age of reference person.

Sample selection schemes

Sampling units and 2-stage sampling in 2004

In 2004, when organizing EU-SILC for the first time (ignoring the pilot survey in 2003), 2-stage sampling has been applied in each sampling stratum.

Stage 1 – Primary Sampling Units

The primary sampling units (PSUs) in stage 1 are the municipalities, or parts thereof in the larger ones. In each stratum, the PSUs in the frame are first descendingly sorted by average income; next, a fixed number of times a PSU is drawn according to a systematic PPS (probability proportional to size) selection scheme, where size is measured as the number of private households. This systematic sampling method generally causes some PSUs being selected repeatedly (e.g. Schaerbeek, a rather large municipality in stratum BE10, turns out to be drawn 6 times). In total, i.e. in all 11 sampling strata together, 275 PSU draws were made in 2004, once and for all (i.e. for the whole duration of EU-SILC).

Stage 2 – Secondary Sampling Units

The secondary sampling units (SSUs) in stage 2 are private households. According to each single PSU draw, a group (generally of fixed size) of households is selected in this stage; notice that a group of households corresponds to each PSU draw.

In 2004, 40 households have been selected for each PSU draw (i.e. in each group); e.g. in Schaerbeek, 6 times 40 households were drawn. Systematic selection of households has been applied, after sorting the households in selected PSUs by age of reference person. Within each group, the selected households were numbered 1 to 40; households 1-10 constitute the first rotational group or replication, households 11-20 constitute the second rotational group or replication, and so on. The first replication was meant to participate in 2004 only, the second until 2005, and so on.

The initial household sample in 2004 was self-weighting, by the combination of (systematic) PPS sampling of sub-municipalities (PSUs) – size of PSUs being the number of private households – and (systematic) sampling of private households (SSUs), as explained.

Renewal of the sample by rotation, since 2005

Since 2005, a rotation scheme has been applied. Details for each year, from 2005 to 2010, can be found in the corresponding Quality Reports .

The rotation pattern is such that the overlap between samples in any two successive years is roughly 75%, and that the sample is completely renewed after 4 years. Hence four replications or rotational groups in each year, one of which is replaced the year after. Since 2005, each new replication remains in the survey during the next 4 years, and since 2007, each of the four replications is in the survey during four consecutive years.

At the start of 2011, the replication that is in the survey since 2007, is entirely (i.e. irrespective of whether the households are responding or not) dropped. The three replications which entered into the survey in 2008, 2009 and 2010, respectively, are retained (including their split-offs); the households belonging to these three replications will be designated ‘old’ hereafter.

The supplementary sample, i.e. the new replication that replaces the just dropped replication, is obtained by selecting, for each PSU draw, a fixed number of new households from the corresponding PSU. This selection is done again by systematic sampling, after sorting the households in each PSU on age of reference person. The number of new households for each PSU draw, is determined by considering some (expected) attrition of old households, some (expected) nonresponse for new households, and the required/desired minimum and maximum numbers of responding households, given some precision and budget constraints.

Hence, the (cross-sectional) sample of SILC 2011 consists of

• “old” households: drawn between 2008 and 2010; and

• “new” households: drawn in 2011, staying until 2014

12.1.2. Sampling unit

2-stage sampling .

Stage 1 – Primary Sampling Units

Stage 2 – Secondary Sampling Units

12.1.3. Sampling rate and sampling size

Concerning the SILC instrument, three different sample size definitions can be applied:

- the actual sample size which is the number of sampling units selected in the sample

- the achieved sample size which is the number of observed sampling units (household or individual) with an accepted interview

- the effective sample size which is defined as the achieved sample size divided by the design effect with regards to the at-risk-of poverty rate indicator

Given that the effective sample size has been already treated in the section dealing with sampling errors, in this section the attention focuses mainly on the achieved sample size.

Achieved sample size per nuts2

NUTS2	Name	Old hh	New hh	Total hh	Accepted hh (DB135 = 1)
BE10	Brussels	702	788	1490	837
BE21	Antwerpen	623	687	1310	752
BE22	Limburg	346	256	602	433
BE23	Oost-Vlaanderen	619	512	1131	712
BE24	Vlaams-Brabant	465	427	892	534
BE25	West-Vlaanderen	538	317	855	657
BE31	Brabant Wallon	173	128	301	189
BE32	Hainaut	753	625	1378	842
BE33	Liège	461	398	859	529
BE34	Luxembourg	161	96	257	193
BE35	Namur	216	162	378	232
Total	Belgium	5057	4396	9453	5910

12.2. Frequency of data collection

The survey is lead each year.

12.3. Data collection

Mode of data collection

A description of the mode of data collection used in your country. Please mention if you use mixed mode of data collection.

1-PAPI (% of total)	2-CAPI (% of total)	3-CATI (% of total)	4-Self administrated (% of total)
	100%

The mean interview duration

The mean interview duration per household is calculated as the sum of the duration of all household interviews plus the sum of the duration of all personal interviews, divided by the number of household questionnaires completed. Only households accepted for the database have to be considered.

Average interview duration =

44 minutes

12.4. Data validation

12.5. Data compilation

12.5.1. Weighting procedure

Design factor

Non-response adjustments

Adjustment to external data

Final cross sectional weights

2.1.8.2. Nonresponse correction for the new households

Following Eurostat’s suggestion (see Document 065, WEIGHTING II. WEIGHTING FOR THE FIRST YEAR OF EACH SUB-SAMPLE), we replaced the homogeneous response groups (based on household size crossed with urbanity) ratio by a multiple regression model (based on the same dummy variables). By “responding”, we mean only those households whose results were accepted (DB135=1). Since 2009 we used logistic regression.

The file was split by NUTS1 and the following variables were used

- Everywhere: Household size, recoded into the four values “one”, “two”, “three” and “four or more” (so three dummies)

- Out of Brussels: DB100 = urbanity

- In Brussels = BE10: median fiscal income of municipality

The regression produced a new variable “expresp”, allowing us to define

NRwei = INIwei/expresp

2.1.8.3 Attrition for the old households

Before “sharing” the 2010 weights, a correction for attrition should be introduced. This year, we elected to perform this correction at the level of individuals, since a 2010 sample person either stays in the panel or leaves it (rotated out, left population, noncontact, refusal or inability to respond, while the structure of a household can change. Note that all household characteristics (e.g. HH021) can be distributed to the members.

We separated the “Children” (for which only basic personal information from the R-file and the distributed H-file is available) from the “Adults” (present in the 2010 P-file as well), i.e. those persons born in 1992 or before.

In the children’s model, the following predictors (all, except the last, from the 2010 file – although this does not matter much for group A) were used, grouped by type :

A. individual demographic information: age from RB080, sex = RB090,

B. housing information: dwelling type = HH010 and tenure = HH020

C. household type: a limited number of dummies, as there is at least one dependent child;

D. monetary indicators: we refrained from taking the equivalised income (outliers), but took a transform of it, as well as the dummy “poor or not” and the subjective ability to make ends meet = HS120

E. sampling and rotation: number of years in panel (from DB075) and urbanisation (=DB100)

F. one variable (paradata) related to fieldwork in 2010 (computed from HB040 and HB050)

For the adults, the same predictors were used, and moreover :

G. variables from the P-file (related to education level and health);

H. country of birth (dummy Belgium Yes/No)

were integrated.

We used logistic regression.

Recall that 11 sampling strata were used (provinces= NUTS2); we use 3 extrapolation strata (the 3 NUTS1 regions BRUssels=BE1, VLAanderen=BE2 and WALlonia=BE3)

Calibration model:

VLA, WAL:

SIZE4+(AGE8XSEX2)+PROV5 20 individual + 4 household constraints

BRU:

SIZE4+(AGE8XSEX2) 16 individual + 4 household constraints

Prov = province where interviewed (differs from DB040 in two cases)

Individual constraints 27=16+11 (age*sex + prov; note that each province belongs to one single region (extrapolation stratum), for the other two regions, the total is set to 0 and the condition is vacuous)

Household constraints 4 (size: "1", "2", "3 or "4 & more",)

Calibration type (after some trials and errors…): truncated

N : 5910

Minimum: 89.24

Maximum : 5958.57

Mean : 800.42

Std Dev : 402.05

12.5.2. Estimation and imputation

Imputation procedure used

Imputed rent

Company car

We (i.) corrected (not imputed – in fact) a greater number of cases and if correction was not desirable or possible, but information on a directly comparable variable was present anyway (see section on internal information above), we (ii.) resorted to direct imputation, via a regression model.

i. Corrections.

Corrections were also mainly done on basis of information in other comparable variables. Gross-net ratio of 12 - yearly income entered as monthly or vice versa - lead to simple corrections of the gross or the net, for example.

ii. Regressions.

If correction was not desirable or possible but information on a directly comparable variable was present anyway, we resorted to direct imputation, via a regression model, of the variable for which input was missing. Below we describe how this was done for net –gross imputation, which were the most prevalent instances of that sort. The method was extended, however, to other imputations (imputations of the 2011 income based on the current income, for example).

Missing values on gross income variables (PY010G, PY020G, … and components) were, if collected, imputed on the basis of the corresponding net variables (PY010N, PY020N, … and components). The implementation of this imputation procedure was quasi-similar for almost all (income) variables on which it was applied. The procedure implied 6-steps:

1. Identification of the ‘reference cases’ (both gross and net collected) and identification of the cases to be imputed (net collected – gross missing).

2. Calculation of the gross/net ratio for the reference cases. Cases with an extreme value on this ratio were excluded from further use in the procedure.

3. Curve estimation of the relation (regression model) between gross and net income. The best fitting model (linear, logarithmic, quadratic, exponential) was being implemented.

4. Implementation of the regression model for the reference cases to identify outliers.

5. Re-implementation of the regression model for the reference cases after removal of the outliers.

6. Actual imputation step: missing (gross) values are imputed on the basis of

a) net values and

b) the estimates for the relation between gross and net income assessed in the steps above.

In step 1 the cases of which both gross and net income were collected are identified. We refer to these cases as ‘reference cases’ (step 1). The relationship between their net and gross income serves as reference for the imputation of the gross incomes for the cases where only the net was collected (cases to be imputed).

To avoid bias in this imputation model atypical reference cases (both outliers and errors) were identified and removed at several steps in the procedure (step 2 and 4).

In step 2 (reference)cases for whom the ratio between gross and net income exceeded what can be considered typical for the taxation regime applicable to the income concerned, were excluded.

In the case of almost all variables the boundary value of this ratio was set at 2,5. This boundary was arbitrary chosen.

Scrutiny of the excluded cases, however, validates this value’s potential to discriminate between incomes which were subjected to real(istic) taxation and outliers or errors.

The latter category seldom counted more than a few percent of the total population in the survey and their gross/net ratio often exceeded the 2,5 considerably.

Further exploration also revealed that the exclusion of these cases from the procedure results in a dramatic increase of the fit of the regression model on which the imputation is based.

In step 4 outliers in the regression model were identified and removed using default regression diagnostics.

The underlying probability model of the net-gross relation was assessed with ‘curve-estimation’ procedure (step 3). It can be hypothesised that in most taxation schemes this relation will not be linear as higher revenues will be subjected to disproportionate higher taxes. The concern therefore is that application of a linear regression model may lead to biased result. Step 3 is an answer to that concern, which turned out to be unfounded, however. In fact, for most variables the linear model fitted the data well. For a few variables the fit of the quadratic model was slightly better, however. Overall, and we underline this, the fit was very good and R-squares very high (always > 0.85)

The estimates of this regression model (step 5) served as direct input for the implementation of the actual imputation (step 6).

iii. Other techniques.

Although we preferred the techniques above we were in some instances forced to resort to other techniques (due to lack of information – for example).

For some cases we imputed median values calculated after categorising using relevant variables. Most of the median values imputed, were for example, calculated after categorisation for status.

From 2007 onwards a measure for ‘imputed rent’ needs to add to the data.

IN the QR-rapport for the 2007 we extensively reported on the method to calculated imputed rent. In the 2011 operation exactly the same method has been used. Results were very similar.

Since 2005, we decided to work with the national rules of the tax authorities. The benefit for individuals of using a company car for private goals was not directly assessed at the interview but afterwards calculated by applying the applicable taxation rules.

The fiscal benefit of all nature that a person has - due to disposition of a company car for private goals - is calculated by multiplying a fixed amount of kilometres driven for private use by a coefficient. To calculate the latest we need the fiscal cylinder capacity of the car. This fixed amount of kilometres driven for private use is for the tax authorities 5000 km if the distance home-work is less than 25 km, and 7500 if it’s more than 25 km.

Since 2005, we asked directly the fiscal cylinder capacity and the distance between work and home. In case of non response of the cylinder capacity, we asked the mark, type and registration year of the car. Than we had to use an imputation method.

Imputation: To calculate the cylinder capacity, we did the following. We assumed that a company car is mostly diesel driven. We looked up for each mark, type and diesel engine what the corresponding cylinder capacity is. If we had several cylinder capacities for the type of the mark, we calculated the weighted mean of the cylinder capacity. If there is not diesel version for a type of car, we did the same logic but than for petrol.

Once we had that we could easily find the corresponding fiscal coefficient. Than we only had to multiply it by the fixed amount of kilometres driven for private use to obtain the fiscal benefit of all nature

Example:

Type of car Fiscal cylinder capacity Forfait Distance home work Fixed amount Fiscal benefit of all nature

Smart fortwo 5 0,1898 < 25 km 5000 949 €

Smart fortwo 5 0,1898 > 25 km 7500 1423.5 €

After we calculated the fiscal benefit of all nature for a whole year, we weighted it for respondents who didn’t dispose for a whole year of the company car. The fiscal benefit of all nature is a gross non-cash employee income.

12.6. Adjustment

13. Comment

Top

National questionnaire is available in Circa BC at: https://circabc.europa.eu/

Please select EU SILC section and then select the folder called “06 National Questionnaire” in the library list.

Annexes

Top