This glossary contains the key terms used in the operating model for data sharing and their definitions. Please note that these definitions may not be universal.
Aggregation is a statistical procedure for combining and summarising data. Aggregated data describes a group of persons instead of an individual person. The data for these groups is formed in such a way that the persons in it cannot be identified.
Processing data to eliminate the possibility of identifying individuals directly or indirectly. For example, identifiers can be deleted or generalised to a level where individuals can no longer be identified. Identifiers include names, addresses, phone numbers or personal identity codes.
sv Öppna data
fi Avoin data
sv Öppen källkod
fi Avoin lähdekoodi
A method for developing and distributing computer software that users can freely use, copy, modify, and distribute. In the open-software development model, both ideas and implementations can be inspected and utilised by everyone.
sv Öppet gränssnitt
fi Avoin rajapinta
An API whose features are fully public and can be used without restrictive conditions. Open APIs can be used free of charge, and the user need not ask the API holder for permission or inform them in advance of the purpose for which they intend to use the API. The description and documentation of an open API must be freely accessible to everyone. In addition, the API must be available for testing purposes.
The Comprehensive Knowledge Archive Network (CKAN) is an open source information management system published by the Open Knowledge Foundation in 2007 which has since been developed further. It was designed particularly for publishing and finding open data. See data catalogue.
Source: CKAN website.
sv Creative Commons
fi Creative Commons
Creative Commons is a non-commercial organisation that promotes the sharing and use of creativity and information. CC licences are a standardised and internationally recognised way of granting rights to the use, further processing and sharing of data.
Source: Creative Commons website.
Information in a machine-readable, communicable, or processed format.
In everyday speech, the terms “data”, “information” and “knowledge” are often used interchangeably.
Source: Tietotekniikan termitalkoot:data
A data catalogue is a structured metadata register in which the metadata for data held by more than one public organisation is combined.
Data catalogues can be:
- national (including Open Data and data.gov.uk),
- regional (Washington D.C. or Helsinki Region Infoshare),
- maintained by cities (San Francisco and Tampere),
- maintained privately (Sunlight Foundation - National datacatalog).
Harvesting refers to the automatic collection of data from different websites to a single location, such as Open Data service. For example, Open Data harvests Paikkatietohakemisto, which means that Paikkatietohakemisto data can automatically be found also on Open Data. Harvesting makes it easier to find data, as you can search for it centrally in one location rather than visiting many different sites.
Information refers to interpretable data that can be refined into knowledge.
Source: Finto: information
Data content structured in a way that enables a computer to process it.
Terms and conditions applicable to the use of intellectual property rights, right to data protection or other object protected by rights
Information that describes the context, content or structure of a dataset and guides and documents its processing and management.
Source: Finto metadata.
Application programming interface
A standard policy or interface that allows data to be transferred between devices, programs, or users. The interface between two devices can be made up of, for example, a physical connection and the specifications required for data transfer. The interface between a program and its user is formed by the user interface. The application program interface (API) is an interface between two programs, such as the operating system and the application, that provides ready-made services that facilitate the work of computer programmers.
Data on a person that can be managed and utilised by that person. For example, MyData can be health and well-being data recorded by the person or their family, such as information on vaccinations, doctor’s visits, laboratory tests, and exercise performances.
Source: TEPA-termipankki: omadata
Pseudonymisation means processing personal data so that it can no longer be connected to a certain person without additional information.
Raw data refers to data (machine-readable information) that is raw or unedited from the perspective of a specific information processing phase.
Source: Finto: raw data
Data sharing ecosystem
sv Ekosystem för datadelning
fi Tiedon jakamisen ekosysteemi
A data sharing ecosystem refers to a network consisting of data-sharing organisations and parties using shared data. It encompasses all actors and systems involved in the sharing, maintenance, and utilisation of data.
sv Ledning genom information
fi Tiedolla johtaminen
A part of information and knowledge management that aims to enact and facilitate knowledge-based decision-making
The act of arranging knowledge processes in a way that the availability, discoverability and utilisation of data for different purposes can be ensured for the lifespan of the data.
Source: Finto: information management
Information that has been comprehended and incorporated into the knower's intellectual understanding of the subject.
A entity formed by a selection of storage media and the data stored in them. A dataset can be provided in both electronic and paper form.
Source: Termipankki: tietoaineisto
Management that promotes an organisation's ability to create value with information, knowledge and expertise.
Source: Finto: knowledge management
An information system means an overall arrangement comprising data processing equipment, software and other data processing.
Data balance sheet
A report that is used to support an organisation's information and knowledge management by describing the state of its data processing and data management practices.
Source: Finto: data balance sheet
A dataset or collection of datasets that contain information that is logically or physically related and that has been formed for a specific purpose.
Source: Finto: information resource
The ability of actors, processes and information systems that are connected to an activity to act and communicate together in such a way or scope that they can routinely use and understand each other's data.
Source: Finto: interoperability