Search

CN-116881250-B - Data index identification method, device, equipment and storage medium

CN116881250BCN 116881250 BCN116881250 BCN 116881250BCN-116881250-B

Abstract

The application relates to a data index identification method, a device, equipment and a storage medium, and relates to the field of data processing. The method comprises the steps of analyzing and processing each data field in a plurality of data fields based on historical data to determine at least one first data field, analyzing and processing each data field based on a preset dimension matching rule to determine at least one second data field, analyzing and processing each data field based on a preset measurement matching rule to determine at least one third data field, and determining the type of each data field based on the at least one first data field, the at least one second data field and the at least one third data field. Therefore, the dimension and the measurement of the data index can be automatically identified based on the historical data, the preset dimension matching rule and the preset measurement matching rule, and the technical problems that the dimension and the measurement of the manual identification data index consume more manpower and have low efficiency are solved.

Inventors

  • WANG BIN

Assignees

  • 重庆长安汽车股份有限公司

Dates

Publication Date
20260508
Application Date
20230628

Claims (10)

  1. 1. A method for identifying a data index, comprising: Analyzing each data field in a plurality of data fields included in a target business detail data table based on historical data, determining at least one first data field from the plurality of data fields, wherein the historical data comprises a plurality of data fields with common dimensions, and the at least one first data field is a data field with common dimensions, and the common dimensions comprise at least one of a time dimension, an organization dimension and a product dimension; Analyzing each data field in the plurality of data fields based on a preset dimension matching rule, and determining at least one second data field from the plurality of data fields, wherein the at least one second data field is a data field with a private dimension, and the private dimension comprises at least one of an order state, an order type and an access channel; analyzing each data field in the plurality of data fields based on a preset measurement matching rule, and determining at least one third data field from the plurality of data fields, wherein the at least one third data field is a data field of a preset type, and the preset type comprises at least one of a numerical value type and a time type; And sending the at least one first data field, the at least one second data field and the at least one third data field to a data index definition module, so that when the data index definition module receives the at least one first data field, the at least one second data field and the at least one third data field, a user is assisted in determining the type of each data field in the plurality of data fields through a guided interactive interface to finish data index definition.
  2. 2. The method of claim 1, wherein the analyzing each of a plurality of data fields included in the target service detail data table based on the history data, and determining at least one first data field from the plurality of data fields, comprises: Comparing any one of a plurality of data fields included in the target service detail data table with each data field of each common dimension included in the historical data one by one; And under the condition that any one of the data fields of the plurality of common dimensions included in the historical data exists, determining the any one of the data fields as a first data field, and marking the any one of the data fields with the label of the common dimension.
  3. 3. The method of claim 1 or 2, wherein the preset dimension matching rules comprise a plurality of dimension matching rules, each of the plurality of dimension matching rules corresponding to a different priority; the analyzing, based on a preset dimension matching rule, each data field in the plurality of data fields, and determining at least one second data field from the plurality of data fields, including: And based on the priority corresponding to each dimension matching rule in the plurality of dimension matching rules, sequentially analyzing and processing each data field in the plurality of data fields included in the target business detail data table according to the order of the priority from high to low, and determining the at least one second data field from the plurality of data fields.
  4. 4. The method of claim 1 or 2, wherein the preset metric matching rules comprise a plurality of metric matching rules, each metric matching rule of the plurality of metric matching rules corresponding to a different priority; The analyzing each data field of the plurality of data fields based on the preset metric matching rule, and determining at least one third data field from the plurality of data fields includes: And based on the priority corresponding to each metric matching rule in the plurality of metric matching rules, sequentially analyzing and processing each data field in the plurality of data fields included in the target service detail data table according to the order of the priority from high to low, and determining the at least one third data field from the plurality of data fields.
  5. 5. A data index recognition device, characterized in that the data index recognition device comprises a determination module; The determining module is used for analyzing and processing each data field in a plurality of data fields included in the target business detail data table based on historical data, determining at least one first data field from the plurality of data fields, wherein the historical data comprises a plurality of data fields with common dimensions, and the at least one first data field is a data field with common dimensions, and the common dimensions comprise at least one of a time dimension, an organization dimension and a product dimension; The determining module is further configured to analyze each data field of the plurality of data fields based on a preset dimension matching rule, determine at least one second data field from the plurality of data fields, where the at least one second data field is a data field of a private dimension, and the private dimension includes at least one of an order status, an order type, and an access channel; the determining module is further configured to analyze each data field of the plurality of data fields based on a preset metric matching rule, determine at least one third data field from the plurality of data fields, where the at least one third data field is a data field of a preset type, and the preset type includes at least one of a numerical value type and a time type; The determining module is further configured to send the at least one first data field, the at least one second data field, and the at least one third data field to a data index definition module, so that when the data index definition module receives the at least one first data field, the at least one second data field, and the at least one third data field, a user is assisted in determining a type of each of the plurality of data fields through a guided interactive interface, so as to complete data index definition.
  6. 6. The data index identification device of claim 5, further comprising a processing module; the processing module is configured to compare, for any one of a plurality of data fields included in the target service detail data table, the any one data field with each data field of a common dimension included in the history data one by one; The determining module is further configured to determine, when it is determined that any one of the data fields of the plurality of common dimensions included in the history data exists, that the any one of the data fields is a first data field; The processing module is further configured to tag the arbitrary data field with the tag of the common dimension.
  7. 7. The data index recognition device according to claim 5 or 6, wherein the preset dimension matching rules include a plurality of dimension matching rules, each of the plurality of dimension matching rules corresponding to a different priority; The determining module is further configured to sequentially analyze each data field in the plurality of data fields included in the target service detail data table according to a priority corresponding to each dimension matching rule in the plurality of dimension matching rules, and determine the at least one second data field from the plurality of data fields according to a sequence from high priority to low priority.
  8. 8. The data index recognition device according to claim 5 or 6, wherein the preset metric matching rules comprise a plurality of metric matching rules, each metric matching rule of the plurality of metric matching rules corresponding to a different priority; The determining module is further configured to sequentially analyze each data field of the plurality of data fields included in the target service detail data table according to a priority corresponding to each metric matching rule in the plurality of metric matching rules, and determine the at least one third data field from the plurality of data fields according to a sequence from high priority to low priority.
  9. 9. An electronic device is characterized by comprising a processor; a memory for storing the processor-executable instructions; Wherein the processor is configured to execute the instructions to implement the method of any one of claims 1 to 4.
  10. 10. A computer readable storage medium, characterized in that, when computer-executable instructions stored in the computer readable storage medium are executed by a processor of an electronic device, the electronic device is capable of performing the method of any one of claims 1 to 4.

Description

Data index identification method, device, equipment and storage medium Technical Field The present invention relates to the field of data processing technologies, and in particular, to a method, an apparatus, a device, and a storage medium for identifying a data index. Background In the big data age, data processing has become an important foundation for enterprise decision and management, and it is important to correctly define data indexes for measuring the development of various services (such as marketing service, manufacturing service, financial service, etc.). The dimension of the data index (such as time, geographic location, product type, organization, etc.) is an attribute index for describing the category to which the data belongs, and the measure of the data index (such as sales, access, conversion, etc.) is a numerical index for measuring the value of a particular attribute of the data. The dimensions and metrics serve as key elements of the data metrics that can help users understand and analyze the data to make informed decisions. The dimensions of the data index and the metrics of the data index may be manually identified, such that the definition of the data index is based on the dimensions of the data index and the metrics of the data index. However, in the above method, with the rapid growth of service data, the rapid adjustment of the service mode and the association relationship between service data are more and more complex, and the dimension of the data index and the measurement of the data index need to be manually identified by manpower, which requires more manpower, has low efficiency and high cost, so that the identification efficiency of the data index is poor. Disclosure of Invention The application aims to provide a data index identification method, a device, equipment and a storage medium, which are used for solving the technical problems that the dimension of a data index and the measurement of the data index are manually identified by manpower, the efficiency is low and the cost is high. The technical scheme of the application is as follows: according to a first aspect of the application, a data index identification method is provided, which comprises the steps of analyzing each data field in a plurality of data fields included in a target service detail data table based on historical data, determining at least one first data field from the plurality of data fields, wherein the historical data comprises a plurality of data fields with common dimensions, the at least one first data field is a data field with common dimensions, the common dimensions comprise at least one of a time dimension, an organization dimension and a product dimension, analyzing each data field in the plurality of data fields based on a preset dimension matching rule, determining at least one second data field from the plurality of data fields, the at least one second data field is a data field with private dimensions, the private dimensions comprise at least one of an order state, an order type and an access channel, analyzing each data field in the plurality of data fields based on a preset measurement matching rule, determining at least one third data field from the plurality of data fields, wherein the at least one third data field is a data field with a preset type, and the preset type comprises at least one data field with the at least one data field, the at least one first data field and the at least one data field with the at least one data field is determined based on the data field with the data type. According to the technical means, the method and the device can determine the first data field with the public dimension from the plurality of data fields included in the target business detail data table based on the historical data, determine the second data field with the private dimension from the plurality of data fields based on the preset dimension matching rule and determine the third data field with the preset type from the plurality of data fields based on the preset measurement matching rule, namely, determine the data field with the data index with the public dimension, the data field with the data index with the private dimension and the data field with the data index as the measurement from the plurality of data fields included in the target business detail data table, solve the problems of low efficiency and high cost when the data indexes of the data fields in the business detail data table are manually identified in the prior art, and further improve the efficiency of identifying the data indexes. In one possible implementation, each data field of the plurality of data fields included in the target service detail data table is analyzed and processed based on the historical data, at least one first data field is determined from the plurality of data fields, the method comprises the steps of comparing any data field one by one with each data field of the common dimension included in