JP-2023183313-A - DATA DISTRIBUTION INTERMEDIATION SYSTEM AND DATA DISTRIBUTION INTERMEDIATION METHOD
Abstract
To provide a data distribution intermediation system and method which can efficiently monitor data use.SOLUTION: A data distribution intermediation system 10 for mediating data provided from a data providing system 20 to a service providing system 30 is configured to: generate data collections in a predetermined unit from time-series data provided from the data providing system; generate meta data for monitoring data use, to be stored for each of the generated data collections; and provide the generated data collections to the service providing system. The service providing system detects a service generated by using the data collections, updates the meta data, and determines whether the detected service conforms to a predetermined purpose set in advance.SELECTED DRAWING: Figure 1
Inventors
- PUNIT GALAV
- BANDARA SYAFRIL
- OKADA AKEMASA
Assignees
- HITACHI LTD
Dates
- Publication Date
- 20231227
- Application Date
- 20220615
- Priority Date
- 20220615
Claims (13)
- A data distribution intermediary system that mediates data provided from a data provision system to a service provision system, comprising a processor section, a memory section used in the processor section, and a communication section used in the processor section and connected to the data providing system and the service providing system, The processor section includes: Generating a predetermined unit of data collection from time-series data provided by the data providing system, generating and storing metadata for monitoring data usage for each generated data collection; providing the generated data collection to the service providing system; detecting a service generated by the service providing system using the data collection and updating the metadata; A data distribution intermediary system that determines whether the detected service is compatible with a predetermined purpose set in advance.
- The data distribution intermediary system according to claim 1, The processor section includes: When provision of data is requested from the service providing system, the data is provided to the service providing system based on reliability indicating that the service providing system uses the data for the predetermined purpose. judge, When it is determined that the reliability of the service providing system is equal to or higher than a predetermined threshold, requesting the data providing system to provide data; storing data provided from the data providing system via the communication unit in the memory unit to generate the predetermined unit of data collection; generating and storing metadata for monitoring the data usage for each of the generated data collections; transmitting the generated data collection to the service providing system via the communication unit; updating the metadata according to a service provided by the service providing system based on the data collection; A data distribution intermediary system that determines whether a service provided by the service providing system is suitable for the predetermined purpose based on the metadata, and updates the reliability according to the determination result.
- The data distribution intermediary system according to claim 1, The processor section includes: A data distribution intermediary system that warns the data providing system when a service provided by the service providing system does not meet the predetermined purpose.
- The data distribution intermediary system according to claim 1, The processor section includes: A data distribution intermediary system that restricts provision of the data collection to the service providing system when the service provided by the service providing system does not meet the predetermined purpose.
- The data distribution intermediary system according to claim 1, The processor section includes: A data distribution intermediary system that, in a predetermined case, approves the service providing system to provide an exceptional service that is not compatible with the predetermined purpose.
- The data distribution intermediary system according to claim 1, In the data distribution intermediary system, the predetermined unit of data collection is a collection of time series data for each predetermined time window having a preset time length.
- The data distribution intermediary system according to claim 6, The processor section includes: A data distribution intermediary system that generates data collection for each of the predetermined time windows based on time information included in time-series data provided by the data providing system.
- The data distribution intermediary system according to claim 1, The predetermined unit of data collection is a data distribution intermediary system in which the predetermined unit of data collection is a set of time-series data provided for each of the service providing systems.
- The data distribution intermediary system according to claim 1, The metadata is a data distribution intermediary system in which information indicating the purpose of the service provided by the service providing system is included.
- The data distribution intermediary system according to claim 9, The metadata further includes information specifying a service providing system that uses the data collection.
- The data distribution intermediary system according to claim 5, The processor section includes: A data distribution intermediary system that also monitors the exceptional service provided by the service providing system.
- The data distribution intermediary system according to claim 1, When receiving a request for data provision from a new service providing system, the processor section acquires data from the data providing system at least once to generate a data collection requested by the new service providing system; A data distribution intermediary system that sends data to a service providing system.
- A data distribution intermediary method for intermediating data provided from a data providing system to a service providing system by a data distribution intermediary system, the method comprising: The data distribution intermediary system is Generating a predetermined unit of data collection from time-series data provided by the data providing system, generating and storing metadata for monitoring data usage for each generated data collection; providing the generated data collection to the service providing system; detecting a service generated by the service providing system using the data collection and updating the metadata; A data distribution intermediary method for determining whether the detected service is compatible with a predetermined purpose set in advance.
Description
The present invention relates to a data distribution mediation system and a data distribution mediation method. We live in the so-called IoT (Internet of Things) era, and a large amount of digital data is generated for each user. These data are distributed across domains to improve existing services and create new ones. This disclosure is against the backdrop of a data distribution governance framework that circulates data between data providers, service providers, and data-driven service consumers so that data owners can manage their data. The governance framework for data distribution is provided by the data distribution intermediary system. Data distribution intermediary systems provide a trusted environment for participants in digital marketplaces. The data distribution intermediary system allows participants to collaborate collectively so that all acceptable rules, legal standards are mutually and transparently respected. The data distribution intermediary system consists of various functions that solve problems related to data distribution, such as safety, privacy, and security risks. Patent Documents 1 and 2 disclose data distribution intermediary systems for data distribution. WO 2006/000001 relates to efficient access to medical records. Patent Document 2 relates to reliability of data distribution. Patent Document 3 relates to generation of metadata for a collection of items. One of the functions of the data distribution intermediary system is to ensure data traceability and reliability. Monitor metadata to ensure data traceability. Metadata is created for each data transfer and updated for each data use. Blockchain will be used to record metadata in data distribution. Patent Document 1 provides a method for recording data distribution and consent in a ledger record that implements a blockchain mechanism. In Patent Document 1, a blockchain mechanism is used to create a record of data distribution for each patient. Patent Document 2 updates data tracking information (block chain, etc.) that indicates storage, processing, and distribution of data. In Patent Document 2, the reliability of an entity is measured using data tracking information. Monitor metadata to understand data usage. Service providers acquire data through the data distribution intermediary system and provide services based on pre-agreed data usage conditions. If a service provider violates these conditions and uses data in an unintended manner, this method allows data owners to track data usage. In order to monitor the metadata of data usage, a metadata model is designed with the items ``service'' and ``owner'' that record the service creator and current data owner. The data monitoring method determines whether there is any unintended use of data by comparing the created ``service'' with the service that the ``owner'' has agreed to in advance. Alert data owners in case of unintended use of data. The data monitoring method is applied to time series data. Since this data is generated at short time intervals on the order of seconds, metadata for distributing the corresponding data to service providers is generated at a frequency on the order of seconds. Monitoring metadata records are updated according to data usage. US Patent Application Publication No. 2018/0082023U.S. Patent No. 9,928,290US Patent No. 8,321,456 FIG. 1 is an overall configuration diagram of a data distribution system that confirms whether data is being used as intended.FIG. 2 is a configuration diagram of a data distribution intermediary system that monitors data usage, and includes a TWP (time window processor unit) that collects data based on a time window.FIG. 3 is a sequence diagram showing the process of monitoring data usage.FIG. 4 is a flowchart showing a process for evaluating reliability.FIG. 5 is a flowchart illustrating a process of collecting data based on a time window value specified by a service provider.FIG. 6 is a flowchart showing the process of creating metadata for data transfer.FIG. 7 shows an example of monitored distribution data.FIG. 8 shows an example structure of a metadata model configured to record data usage.FIG. 9 is a flowchart showing metadata updating when creating a service.FIG. 10 is a flowchart showing the determination result for the intended service.FIG. 11 is a storage table showing the intended data usage for each service provider.FIG. 12 is a flowchart for making a flow control determination when accessing a service generated by a service provider.FIG. 13 is a configuration diagram of a data distribution intermediary system according to the second embodiment.FIG. 14 is a flowchart showing the process of collecting data depending on the data owner.FIG. 15 is a flowchart showing the determination result regarding emergency services according to the third embodiment.FIG. 16 is a table that stores data usage in an emergency by each service provider. Embodiments of the present invention will be described below based on the drawings