Chapter 8 Quality Assurance Frameworks and Metadata
Ensuring data quality is a core challenge of all statistical offices. Energy data made available to users are the end product of a complex process comprising many stages, including the definition of concepts and variables, the collection of data from various sources, data processing, analysis, formatting to meet user needs and finally, data dissemination. Achieving overall data quality is dependent upon ensuring quality in all stages of the process.
Quality assurance comprises all institutional and organizational conditions and activities that provide confidence that the product or service is adequate for its intended use by clients and stakeholders. In other words, the quality is judged by its “fitness for use.” The pursuit of good quality means having a legal basis for the compilation of data, ensuring that the institutional environment is objective and free of political interference, ensuring the adequacy of data-sharing and coordination among data-producing agencies, assuring the confidentiality and security of information, addressing the concerns of respondents regarding reporting burden, providing adequate human, financial and technical resources for the professional operation of energy statistics, and implementing measures to ensure their efficient, cost-effective use. All the actions that responsible agencies take to assure data quality constitute quality assurance. In the IRES, all countries were encouraged to develop their own national energy data quality assurance, to document these, to develop measures of data quality, and to make these available to users.
Managers of statistical agencies must also promote and demonstrate their support for ensuring quality throughout the organization. This can be done in a number of ways:
Most international organizations and countries have developed general definitions of data quality, outlining the various dimensions (aspects) of quality and quality measurement, and integrating them into quality assurance frameworks. Although these quality assurance frameworks may differ to some extent in their approaches to quality and in the number, name and scope of quality dimensions, they complement each other and provide comprehensive and flexible structures for the qualitative assessment of a broad range of statistics, including energy statistics.
The overall objective of these frameworks is to standardize and systematize quality practices and measurement across countries. They allow the assessment of national practices in energy statistics in terms of internationally (or regionally) accepted approaches for data quality measurement. The quality assurance frameworks can be used in a number of contexts, including for (a) guiding countries’ efforts towards strengthening and maintaining their statistical systems by providing a self-assessment tool and a means of identifying areas for improvement; (b) supporting technical development and enhancement purposes; (c) reviews of a country’s energy statistics program as performed by international organizations; and (d) assessments by other groups of data users.
National agencies responsible for energy statistics can decide to implement one of the existing frameworks for quality assurance for any type of statistics, including energy statistics, either directly or by developing, on the basis of those frameworks, a national quality assessment framework that best fits their country’s practices and circumstances. See Box 8.1 for references to data quality frameworks from various countries and organizations.
Box 8.1 Examples of Data Quality Frameworks
United Nations (2012). National Quality Assurance Frameworks. Prepared by the United Nations Statistics Division, New York. (obtain additional quality frameworks ?) |
The following dimensions of quality reflect a broad perspective and therefore, have been incorporated in many of the existing data quality frameworks. The dimensions of quality below should be taken into account when measuring and reporting the quality of statistics. These dimensions can be divided into static and dynamic elements of quality.
Quality measures/indicators: Identification of gaps between key user needs and compiled energy statistics in terms of concepts, coverage and detail. Compile through structured consultations and regular feedback. Perception from user feedback surveys. Monitor requests for information and the capacity to respond.
Quality measure/indicator: Perceptions from user feedback survey.
Quality measures/indicators: Sampling errors (standard errors). Non-sampling errors (overall and item response rate). Quantity response rate (e.g., percentage of total energy production reported, weighted response rate). Number, frequency and size of revisions to energy data.
Quality measures/indicators: Time lag between the end of the reference period and the date of the first release (or the release of final results) of energy data.
Quality measures/indicators: Comparison and joint use of related energy data from different sources. Number and rates of divergences from the relevant international statistical standards in concepts and measurement procedures used in the collection/compilation of energy statistics.
Quality measures/indicators: Number of announcements of release of energy data. Number and types of methods used for dissemination of energy statistics. Number of energy statistics data sets made available by mode of dissemination, as a percentage of total energy statistics data sets produced. The number of requests for information.
Quality measures/indicators: Non-response rate and imputation rate.
Quality measure/indicator: Proportion of population covered by data collected.
Quality measure/indicator: Deterioration of sample.
In addition to the above quality dimensions, interpretability is another important criterion of quality in regards to metadata.
The above dimensions of quality were also incorporated into the country practice template that was developed by the Oslo Group and the UN Statistics Division. This template enables countries to report and share their practices. Some of these practices are presented below in Box 8.2 to demonstrate how the dimensions of quality are applied.
Box 8.2 Examples of Country Practices on the Quality Dimensions in Energy Statistics Electricity Energy Balances Central Statistical Bureau of Latvia (April 2012). Energy Balance. Prepared by the Central Statistical Bureau of Latvia. Statistics Mauritius (March 2012). Energy Balance Compilation. Prepared by Statistics Mauritius. Consumption
Statistics Canada (August 2012). Industrial Consumption of Energy Survey. Prepared by Statistics Canada. Czech Statistical Office (March 2012). Energy Consumption and Fuel by Year. Prepared by the Czech Statistical Office. Other energy topics Sustainable Energy Authority of Ireland (October 2012). Combined Heat and Power. Prepared by the Sustainable Energy Authority of Ireland. ISTAT (April 2012). Urban Environment Indicators on Energy. Prepared by ISTAT, Italy. |
Ensuring data quality is an important function of any statistical organization, whether it be centralized or decentralized. Below is the example of Sweden’s decentralized statistical system and quality of official statistics.
Box 8.3 Example of Data Quality in a Decentralized System |
Ensuring Data Quality in a Statistical Survey Process
To ensure data quality, strategies must be implemented at every stage of a statistical survey process, from start to finish. Chapter 4 looks at quality measures related to each stage of the survey process. The main stages of a statistical survey process are: specify needs, design, build, collect, process, analyze, disseminate, archive, and evaluate. These represent the nine stages of the Generic Statistical Business Process Model (GSBPM) which are described in detail in Chapter 4.
Metadata on statistics
The term metadata defines all information used to describe other data. A very short definition of metadata is “data about data.” Metadata descriptions go beyond the pure form and content of data to encompass administrative facts about the data (e.g., who has created them and when), and how data were collected and processed before they were disseminated or stored in a database. In addition, metadata facilitate the efficient search for and location of data. Documentation on data quality and methodology is an integral component of statistical data and analytical results based on these data. Such documentation provides the means of assessing fitness for use and contributes directly to their interpretability.
Statistical metadata describe or document microdata, macrodata or other metadata and facilitate the sharing, querying and understanding of data. Statistical metadata also refer to any methodological descriptions on how data are collected and manipulated. For energy statistics, for example, metadata include the name of the data variable, the statistical unit from which the information has been collected, data sources, information about classifications of energy products used, and series breaks, and definitions of energy products, and methodologies used in their compilation. Metadata are essential for the interpretation of statistical data. Without appropriate metadata, it would not be possible to fully understand energy statistics or to conduct international comparisons.
There is a bidirectional relationship between metadata and quality. On the one hand, metadata describe the quality of statistics. On the other hand, metadata are a quality component which improves the availability and accessibility of statistical data. There are many types of users and uses for any given set of data. The wide range of possible users and uses means that a broad spectrum of metadata requirements has to be addressed. In particular, the responsible agencies as data suppliers must make sufficient metadata available to enable both the least and the most sophisticated users to readily assess the data and their quality. The following Box 8.4 presents the type of information that should be available to data users when disseminating data. Next, in Box 8.5, are examples of metadata or survey documentation published by some countries to assist users with the interpretation of the statistics.
Box 8.4 Information that Should Accompany Statistical Releases (Metadata)
· Survey/Product name |
· Objectives of survey |
· Timeframe |
· Concepts and definitions |
· Target population |
· Collection method |
· For sample surveys: |
· Error detection |
· Imputation of missing data |
· Disclosure control |
· Revisions |
· Description of analytical methods used |
· Other explanatory notes |
· Links to other information or documents |
Source: United Nations, 2011, International Recommendations for Energy Statistics, Working Group “Oslo Working Group on Energy Statistics”, 42nd meeting, New York, February 201
Box 8.5 Country Examples of Metadata on Statistics Statistics Canada Finland The Netherlands (ask countries for additional examples of metadata/documentation) |
The Future of Metadata
As statistical processes evolve, there is a push to make metadata a driver of statistical business process design and to standardize the collection of metadata within and across different international, statistical organisations. While these efforts to streamline rules and procedures for metadata are still in development, the goal is to create an integrated approach to producing and recording metadata.
Examples of international metadata and exchange protocols are the Data Documentation Initiative (DDI) and the Statistical Data and Metadata Exchange (SDMX) being developed by Eurostat, on behalf of seven international organizations (Eurostat, OECD, UNSD, IMF, WB, ECB, and IBS). With this increased international cooperation, it is hoped that a standardized framework can be established that will increase the comparability of statistical data across countries and international organisations.
In the future, it is expected that these metadata can be used on a proactive basis to prescribe definitions, concepts, variables and standards to surveys that are being designed or redeveloped. This will help to ensure consistency and comparability of the data to be collected. Refer to Box 8.6 provides for a list of metadata resources and guidelines.
Box 8.6 Metadata Resources and Guidelines |
United Nations (2011). International Recommendations for Energy Statistics. Prepared by the United Nations Statistics Division, New York, p. 138.
United Nations (2011). International Recommendations for Energy Statistics. Prepared by the United Nations Statistics Division, New York, p. 139-147.
Statistics Canada (2002). Statistics Canada’s Quality Assurance Framework. Catalogue no. 12-586-XIE. Statistics Canada, Ottawa. Available: http://www.statcan.gc.ca/pub/12-586-x/12-586-x2002001-eng.pdf.
The Swedish Official Statistical System, document dated January 28, 2013, received from Niklas Notstrand, Swedish Energy Agency.
Source: https://unstats.un.org/oslogroup/methodology/docs/escm-edited/ESCM%20Chapter%208%20140422.doc
Web site to visit: https://unstats.un.org
Author of the text: indicated on the source document of the above text
If you are the author of the text above and you not agree to share your knowledge for teaching, research, scholarship (for fair use as indicated in the United States copyrigh low) please send us an e-mail and we will remove your text quickly. Fair use is a limitation and exception to the exclusive right granted by copyright law to the author of a creative work. In United States copyright law, fair use is a doctrine that permits limited use of copyrighted material without acquiring permission from the rights holders. Examples of fair use include commentary, search engines, criticism, news reporting, research, teaching, library archiving and scholarship. It provides for the legal, unlicensed citation or incorporation of copyrighted material in another author's work under a four-factor balancing test. (source: http://en.wikipedia.org/wiki/Fair_use)
The information of medicine and health contained in the site are of a general nature and purpose which is purely informative and for this reason may not replace in any case, the council of a doctor or a qualified entity legally to the profession.
The texts are the property of their respective authors and we thank them for giving us the opportunity to share for free to students, teachers and users of the Web their texts will used only for illustrative educational and scientific purposes only.
All the information in our site are given for nonprofit educational purposes