Search

CN-121998444-A - Standard development contribution degree analysis method and system based on standard question log library

CN121998444ACN 121998444 ACN121998444 ACN 121998444ACN-121998444-A

Abstract

The invention provides a standard development contribution degree analysis method and a system based on a standard question library, which relate to the field of aviation standard analysis, and the method comprises the following steps of S1, constructing a database; S2, performing multidimensional cleaning on the acquired original standard inscription data and complementing the standard labels, S3, performing quantitative evaluation on contribution values of all regions and drafting units in standard development activities, and S4, performing visual presentation on statistical analysis results. The system comprises a database generation module, a data cleaning module, a custom tag creation module, a standard information retrieval module, a standard development contribution analysis module and a result demonstration module. According to the method, the missing ICS/CCS classification labels are automatically complemented through the conditional probability model and the semantic similarity matching algorithm, the problem of limited statistical dimension caused by the traditional label missing is solved, and the analysis integrity and accuracy are improved.

Inventors

  • LI SHANG
  • ZHANG GUANGDA
  • DONG SHIMING
  • FU TIAN

Assignees

  • 中国航空综合技术研究所

Dates

Publication Date
20260508
Application Date
20251125

Claims (9)

  1. 1. A standard development contribution degree analysis method based on a standard question record library is characterized by comprising the following steps: s1, constructing a database, wherein the database comprises a standard title sub-database, a drafting unit information sub-database, a region coding sub-database and a label mapping sub-database; s2, cleaning original standard inscription data in a standard inscription sub-database and complementing a standard tag; s3, analyzing the contribution degree of each region and the drafting unit in standard development based on the completed standard topic book sub-database, wherein the method specifically comprises the following steps: s31, carrying out contribution degree quantification by adopting a position weight attenuation model, and respectively calculating contribution weights for the kth drafting unit in a certain standard t in a given standard range Contribution weight The calculation formula of (2) is as follows: ; Wherein, the The reference weight coefficient is the primary coding unit, As the location attenuation factor(s), For the sequence number of each drafting unit, the first unit is the main unit ; S32, calculating a standard contribution index of a certain drafting unit in a given standard range, wherein the standard contribution index is the sum of contribution weights of the drafting unit in various standards, and the standard contribution index of the drafting unit The calculation formula is as follows: ; wherein n is the standard number of the drafting units participating in drafting within the given standard range; s33, calculating a standard development contribution value of a certain region in a given standard range, wherein the standard development contribution value of the certain region is the sum of standard contribution indexes of all drafting units in an administrative division range belonging to the region, and calculating the standard development contribution value of the region : ; And S4, visually presenting the obtained standard contribution index of the drafting unit and the regional standard development contribution value.
  2. 2. The standard development contribution degree analysis method based on the standard question book library of claim 1, wherein the step S2 of complementing the standard label specifically comprises the following sub-steps: s21, carrying out statistical analysis on all complete standard records in a standard topic record sub-database; S22, when the CCS classification number is missing and the ICS classification number exists in the standard topic book sub-database, establishing a one-to-one mapping relation between the ICS classification number and the CCS classification number by adopting a conditional probability model, and calculating a probability value : ; Wherein, the Represents the number of times that ICS class numbers and CCS class numbers co-occur, Total number of occurrences for ICS class number alone; obtaining a completed CCS class number based on a target ICS class number query probability value Complement value as CCS classification number: ; S23, when the ICS classification number is missing in the standard topic book sub-database or the ICS classification number corresponds to a plurality of CCS classification numbers, a semantic similarity matching algorithm is introduced to extract standard description texts of the ICS classification number and the CCS classification number, sentence and word segmentation processing is carried out on the texts through a natural language processing tool, and TF-IDF vectorization is utilized to generate text feature vectors And (3) with And calculating cosine similarity of the two: ; when the calculated cosine similarity When the threshold value is higher than the preset threshold value, a one-to-one mapping relation between the corresponding ICS classification number and the corresponding CCS classification number is established, and the ICS classification number and the CCS classification number are complemented.
  3. 3. The method for analyzing standard development contribution degree based on standard question book library according to claim 1, wherein in step S33, a drafting unit registration address field is called from a drafting unit information sub-database, semantic matching is performed based on a region code sub-database, and a mapping table of unit and administrative division codes is generated.
  4. 4. The method for analyzing standard development contribution degree based on standard question book library according to claim 2, wherein the probability value obtained in step S22 is determined by And the one-to-one mapping relationship between ICS class numbers and CCS class numbers obtained in step S23 are stored in a relationship table of the label mapping sub-database.
  5. 5. The standard development contribution analysis system based on the standard bibliography library for the standard development contribution analysis method based on the standard bibliography library according to claim 1 is characterized by comprising a database generation module, a data cleaning module, a custom label creation module, a standard information retrieval module, a standard development contribution analysis module and a result demonstration module; The database generation module is used for generating a database, and the database comprises a standard title sub-database, a drafting unit information sub-database, a region coding sub-database and a label mapping sub-database; The data cleaning module is used for performing multidimensional cleaning on the acquired original standard bibliographic data and comprises a standard label complement sub-module, a drafting unit cleaning sub-module and a region coding cleaning sub-module; The custom tag creation module is used for selecting relevant standards from the search results on the standard information search page to carry out tag adding operation; The standard information retrieval module is used for realizing the retrieval function based on standard bibliographic information, wherein the standard bibliographic information comprises a standard number, a standard name, a standard state, a homing unit, a drafting unit, a release date, an implementation date, ICS classification, CCS classification and a custom label; The standard development contribution analysis module is based on a multidimensional data fusion architecture, and is used for realizing quantitative evaluation of contribution value of each region and each drafting unit in standard development, and comprises a standard quantity statistics sub-module and a standard contribution index statistics sub-module; The result demonstration module is used for visually presenting the obtained standard contribution index of the drafting unit and the developed contribution value of the regional standard.
  6. 6. The standard development contribution degree analysis system based on the standard question book library of claim 5, wherein the standard label completion sub-module constructs a one-to-one mapping relation between ICS and CCS class numbers based on existing ICS and CCS class number data of national standard and industry standard, and stores the mapping relation in the label mapping sub-database, and automatically completes the standard data of the missing ICS or CCS class numbers based on the mapping relation.
  7. 7. The standard development contribution analysis system based on the standard inscription library of claim 5, wherein the drafting unit cleaning submodule is used for establishing a standard drafting unit three-level mapping rule library, carrying out standardized cleaning on drafting unit names and associating credit codes, and eliminating naming ambiguity of drafting units, and the region code cleaning submodule converts region text description into 6-bit standard digital codes according to national administrative division coding standards.
  8. 8. The system for analyzing the standard development contribution degree based on the standard bibliographic database of claim 5, wherein the standard quantity statistics submodule constructs a multidimensional matrix of the national standard and industry standard parameters according to standard release time, drafting unit registration place and entity name fields in the standard bibliographic database and drafting unit information sub-database, and is used for counting the conditions of the national standard and industry standard parameters according to year, region, industry field and drafting unit.
  9. 9. The standard development contribution degree analysis system based on the standard bibliographic database of claim 5, wherein the standard contribution index statistics sub-module is used for calculating standard contribution indexes of the grass units and the regional standard development contribution values according to the sequence of the standard grass units in the standard bibliographic sub-database based on a position weight attenuation algorithm.

Description

Standard development contribution degree analysis method and system based on standard question log library Technical Field The invention relates to the field of standard analysis, in particular to a standard development contribution degree analysis method and system based on a standard topic catalog. Background Under the background of globalization and collaborative development of standardized work, standard development has become a core link supporting industrial upgrading, technical innovation and market specification, and the participation body of the standard development covers multiple organizations and individuals such as enterprises, scientific research institutions, colleges and universities, industry associations and the like. In the standard development process, the contributions of all the participants in links of technical development, opinion collection, test verification, text compilation and the like directly influence the quality, applicability and popularization effect of the standard. Therefore, the development contribution of the participants is accurately identified and quantitatively analyzed, and the development contribution is not only the key for guaranteeing standard development fairness and motivating participation enthusiasm, but also the important basis for developing resource allocation, result evaluation and credit system construction by a standardized management department. Currently, standard development contribution analysis mainly relies on traditional manual statistics or simple data summarization modes, and has obvious technical limitations: First, the data base is scattered and non-uniform, and lacks standardized transcript library support. The related data (such as drafting units, release dates, implementation dates, classification numbers and the like) of the existing standard are stored in the internal system of each unit, the document or the offline record in a polydisperse manner, the data formats are not uniform, the association degree is low, and the large-scale data integration and multiplexing of cross-standard, cross-domain and cross-period data are difficult to realize. Secondly, the contribution analysis efficiency is low and the error is large. The manual statistics mode needs to consume a large amount of manpower to comb participation traces and account contribution ratio, so that the efficiency is low, analysis results are easily distorted due to subjective judgment, data omission and the like, and the rapid analysis requirement of mass standard development projects cannot be met. Thirdly, the contribution evaluation dimension is single, and comprehensiveness and objectivity are lacked. When the standard quantity and the contribution index are compared and analyzed from the field of the standard, the analysis can only be carried out according to the existing ICS classification number and CCS classification number. On one hand, the accuracy of analysis junction is affected by the lack of part of standard ICS and CCS classification numbers, and on the other hand, the ICS and CCS classification comparison system cannot cover the fine analysis requirements of users and has poor availability of analysis results. Finally, the analysis results lack visualization and traceability. The prior art cannot form a structured contribution analysis report, and is difficult to trace the source and calculation process of contribution data, so that the analysis result is difficult to support standardized management decisions and dispute resolution. In summary, the prior art cannot solve the core problems of data dispersion, low efficiency, single dimension and difficult tracing in standard development contribution analysis, and needs to construct a systematic and intelligent analysis scheme based on a unified standard bibliographic library to realize integration, quantification, multidimensional evaluation and visual presentation of contribution data, thereby providing technical support for scientific management of standardized work. Disclosure of Invention In order to solve the defects in the prior art, the invention aims to provide a standard development contribution degree analysis method and system based on a standard topic catalog, which can realize integration, quantification, multidimensional evaluation and visual presentation of contribution data, provide technical support for scientific management of standardized work, automatically complement missing ICS/CCS classification labels through a conditional probability model and a semantic similarity matching algorithm, solve the problem of limited statistical dimension caused by the traditional label deletion, and improve analysis integrity and accuracy. In one aspect, the invention provides a standard development contribution degree analysis method based on a standard question book library, which comprises the following steps: s1, constructing a database, wherein the database comprises a standard title sub-database, a