Search

CN-122019478-A - Automatic file single-sleeve-making and winding method and system based on template matching

CN122019478ACN 122019478 ACN122019478 ACN 122019478ACN-122019478-A

Abstract

The invention discloses a template matching-based archive single-layer automatic assembly method and system, wherein the method comprises the steps of constructing an archive category template system, including a preset data organization form and a hierarchical directory type hierarchical file organization structure, defining the association of categories and item levels, and defining the path and node configuration rules of each level of a hierarchical directory; according to the template classification requirement, organizing the electronic archive files by taking the items as units, intensively storing the electronic archive files in a corresponding item node container, standardizing the storage path and the folder names, finishing classification filing pretreatment, displaying the files to be assembled by using a directory tree according to the category items, selecting target categories and item nodes, generating temporary group files, distributing identity information, finishing formal group files according to the preset requirement, distributing unique file identity information and recording the unique file identity information in an attribute table, collecting the attribute information of the formal group files, and supporting manual single entry or file number related batch import to form complete electronic archive group file data. The file management efficiency and standardization are improved.

Inventors

  • CHEN FEI
  • Zong Qiangwen
  • WANG YUNFENG
  • ZHANG YUNFENG
  • WANG JIE
  • Gao Shiduan
  • WANG JIAN
  • JIANG YANG

Assignees

  • 三峡高科信息技术有限责任公司

Dates

Publication Date
20260512
Application Date
20251225

Claims (10)

  1. 1. The automatic file sheet sleeving and winding method based on template matching is characterized by comprising the following steps of: S1, constructing an archive category template system which comprises a preset data organization form and a hierarchical file organization structure, wherein the hierarchical file organization structure is presented in a hierarchical directory form and defines the hierarchical association relation between categories and items; S2, organizing the electronic archive files by taking the project as a unit according to the classification requirement of the archive category template, enabling the electronic archive files of the same project to be stored in a file storage container of a corresponding project node in a centralized manner, and meanwhile, standardizing the storage path and the folder name of the electronic archive files according to the file organization structure specified by the template to finish classification filing pretreatment of the electronic archive files; S3, displaying the electronic archive files to be assembled in a directory tree mode according to category sub-items, selecting target archive categories and corresponding item nodes, screening the electronic archive files to be assembled under the nodes to generate temporary assembly and distributing temporary assembly identification information; S4, aiming at the files and the cases after the formal files are assembled, corresponding attribute information is collected for writing, and the writing information is supported to be recorded and stored in a manual single recording mode or a batch importing mode based on file number association, so that complete electronic file assembly data are formed.
  2. 2. The method for automatically assembling and rolling file list based on template matching according to claim 1, wherein the file type templates in S1 are configured according to file management actual requirements and support parallel creation of multiple templates, the templates corresponding to each type of files can be repeatedly called in the follow-up implementation process after the configuration is completed, and the file types are presented in a hierarchical directory form after the configuration is completed.
  3. 3. The automatic file grouping method based on template matching according to claim 1, wherein the preset data organization form in S1 includes a preset for at least one of file classification number, class name, parent class classification number, final class, file organization structure, file attribute information, and case attribute information, and the file organization structure is a tree structure.
  4. 4. The automatic file list preparation and winding method based on template matching according to claim 1, wherein the node configuration rule in S1 includes the identification mode of each level node and the storage specification of the file storage carrier corresponding to the node; wherein, the identification of each level node in the node configuration rule cooperates with the validity Based on topological ordering and matrix operation construction, the core realizes quantization through the product operation of the hierarchical dependency matrix and the identification compatible matrix, and the specific calculation process is as follows: First define Is a hierarchical dependency matrix with dimensions of , For the total number of layers of the hierarchical directory, matrix elements Hierarchical association logic assignment of category-item-sub-items in a hierarchical file organization structure according to an archive category template, if Hierarchical node and the first If the hierarchy nodes have direct dependency relationships, then Otherwise ; Redefinition of To identify compatible matrices, the dimensions are the same Matrix elements Presetting core information assignment in a data organization form according to the archive category template, if the core information assignment is the first Identification element of hierarchical node and the first The identification elements of the hierarchical nodes are collision-free and pass mutual authentication, then Otherwise ; Finally, through matrix multiplication operation Obtaining an identification synergistic effectiveness matrix, wherein elements in the matrix Represent the first Hierarchy and the first The degree of co-validity of the hierarchy node identification, The value range is To the point of The larger the value is, the stronger the dependence suitability and compatibility consistency of the two-level identifiers are, and the quantitative characterization of closed loop verification between node identifiers of different levels is completed through the calculation; Meanwhile, the storage dynamic adaptation degree of the file storage carrier corresponding to the node Based on Markov chain model construction, the core solves steady-state distribution through a state transition probability matrix and completes adaptation degree calculation, and the specific process is as follows: Firstly, dividing the storage state type and setting Defining a state type number for file storage Is that Dimensional state transition probability matrix, matrix element Representing the storage carrier from the first The seed state transitions to the first The probability of the seed state is calculated by counting the historical data of the single sleeve group volume of the archive, namely, the transition frequency duty ratio between any two states is calculated Wherein For slave state in historical data Transition to State Is a total frequency of (2); re-solving the steady state distribution vector of the Markov chain Dimension is Satisfies the following conditions And is also provided with Vector elements Indicating that the storage carrier is in the first place after long-term operation Probability of seed status; Finally by matrix multiplication Obtaining a stored dynamic adaptation degree vector, wherein elements in the vector Indicating that the storage carrier is maintained at steady state The adaptation capability of the seed status, the load demand threshold of the whole flow business of the file single sleeve system volume are combined, The closer the threshold interval is, the better the dynamic adaptability of the storage configuration is, and the quantitative pre-judgment of the full life cycle state change trend of the storage carrier is completed.
  5. 5. The automatic archival single system and scroll method based on template matching according to claim 1, wherein the process of classifying, archiving and preprocessing the electronic archive file in S2 is as follows: Normalization validity of electronic archive classification archiving preprocessing Path matching and set equivalence determination construction based on graph theory, path isomorphism through hierarchy Double-shot degree with project file collection Is characterized by complex operations, i.e ; Hierarchical path isomorphism The core calculation process of (1) comprises firstly converting hierarchical file organization structure defined by archive class template into directed acyclic graph The node in the graph corresponds to each level of catalogue, the directed edge corresponds to the hierarchical dependency relationship among the catalogues, and the actual storage path of the electronic archive file is converted into the directed acyclic graph Defining nodes and directed edges according to the same rule, traversing by graph isomorphism judging algorithm And (3) with Counting the sum of the number of nodes and edges which are completely consistent in structure in two graphs Statistics of Total number of intermediate nodes and edges By operation of Isomorphism quantification of path structure and template hierarchy is accomplished, When the value is 1, the actual path and the template path are completely isomorphic, and when the value is lower than 1, the path level missing or redundancy problem is indicated; Project file collection double-shot degree The core calculation process of (1) is as follows, constructing standard set of single item should gather file according to classification requirement of archive category template Constructing a set of actual files in the node storage container corresponding to the item Counting by collecting bijective judgment And also belong to Number of files of (a) Statistics of And (3) with Number of union elements of (2) By operation of Completing the bijective quantification of the project file collection, When the value is 1, the actual collection file and the standard file collection are completely double-shot, the accurate collection without omission and redundancy is realized, and when the value is lower than 1, the problem of file missing collection or error collection is indicated.
  6. 6. The method for automatically assembling files based on template matching according to claim 1, wherein S3 further comprises checking the file integrity of the temporary assembled files, determining the file range directly in units of items if the file is complete, and pre-assembling files and recording the pre-assembled file information in units of items if the file is incomplete, wherein the method comprises the following steps: constructing a temporary group volume file integrity check model based on a project file list preset by an archive category template Actual file collection by temporary group volumes And preset file set The symmetry difference operation of (2) completes the integrity determination, i.e The core calculation process is as follows; Firstly, extracting all file information corresponding to the item in an archive category template to construct a preset file set The elements in the set are unique identification information of each file; extracting all file information in the temporary group volume to construct an actual file set Element in collection adopts AND And then executing a symmetric difference operation, wherein the core logic of the symmetric difference operation is that Wherein For a set of redundant files that exist within the temporary group volume but are not recorded in the preset file list, A file set which exists in a preset file list but is missing in the temporary group volume is obtained; When the operation result When the temporary group file is empty, judging that the temporary group file is complete, directly locking all files corresponding to the item as the file range of the formal group file, synchronously associating the temporary group file identification information to the follow-up formal group file flow, and when the operation result is obtained When the set is not empty, judging that the temporary group volume file is incomplete, and extracting The method comprises the steps of establishing a pre-standing volume, establishing a corresponding project node association map of the pre-standing volume, forming a pre-standing volume tracking account, directly updating a temporary group volume based on the pre-standing volume information after the missing file is completed, restarting an integrity checking flow, and entering a formal group volume link after checking.
  7. 7. The automatic archival single system grouping method based on template matching according to claim 1, wherein the preset grouping requirement in S3 includes grouping at least one of a file storage container, the same file name and a single archive file as a basis.
  8. 8. The automatic archival sheet composition and winding method based on template matching according to claim 1, wherein the attribute information in S4 at least includes one of contract number, contract name, project category, engineering code, file format, and data size.
  9. 9. Automatic archival list system of making group of package system based on template matching, characterized by comprising: The file category template system construction unit is used for constructing a file category template system and comprises a preset data organization form and a hierarchical file organization structure, wherein the hierarchical file organization structure is presented in a hierarchical directory form and defines the hierarchical association relationship between categories and items; The electronic archive classification and archiving preprocessing unit is used for organizing the electronic archive files by taking an item as a unit according to the classification requirement of an archive class template, so that the electronic archive files of the same item are intensively stored in the file storage containers of the corresponding item nodes, and meanwhile, the storage paths and the folder names of the electronic archive files are normalized and arranged according to the file organization structure specified by the template, so that the classification and archiving preprocessing of the electronic archive files is completed; The system comprises a temporary file generation unit, a file grouping attribute table and a file grouping attribute table, wherein the temporary file generation unit is used for generating a temporary file and completing a file grouping unit, displaying an electronic file to be grouped according to category and category items in a directory tree form, selecting a target file category and a corresponding item node, screening the electronic file to be grouped under the node to generate a temporary file and distributing temporary file grouping identification information; The recording unit is used for recording the corresponding attribute information of the files and the files after the formal assembly, and supporting the recording and the storage of the recorded information by manual single recording or batch importing mode based on file number association to form complete electronic file assembly data.
  10. 10. A computer readable storage medium having stored thereon a computer program, characterized in that the computer program is executed by a processor to perform a machine learning modeling method of dynamic prediction of the progress of a building project according to any of claims 1-8.

Description

Automatic file single-sleeve-making and winding method and system based on template matching Technical Field The invention belongs to the technical field of archives preparation and winding, and particularly relates to an automatic archives sheet sleeving and winding method and system based on template matching. Background In the process of converting electronic file management into digital conversion, a file single sleeve management mode is widely applied, and the core aim is to realize the whole electronic management of the electronic file by canceling the synchronous filing of paper files, reduce the file management cost and improve the file utilization efficiency. However, the current file group rolling work still faces a plurality of real pain points, and restricts the floor effect of a single set of management modes. Traditional archives are assembled and rolled up and rely on manual operation more, and the standard lacks unified standard of group, and different managers exist the difference to the understanding of archives classification, lead to the storage route confusion of electronic archives, and the file naming rule is not unified, and the archives dispersion storage of same project, follow-up inquiry and call the degree of difficulty great. Meanwhile, the file integrity is difficult to accurately control in the manual file assembling process, the condition of file omission or redundancy is easy to occur, the formed files are uneven in quality, and the standardized requirements of file management cannot be met. In part of the existing winding methods, although a digital tool is introduced, a systematic template system is not constructed, only a simple file collection function is realized, the hierarchical association of file categories and projects cannot be realized, a large amount of manual intervention is still required in the winding process, and the automation degree is low. In addition, the file writing link lacks an efficient data input mode, manual single input efficiency is low, batch import also has the problem of inaccurate data association, so that writing information is incomplete, and the follow-up management and utilization of files are affected. The above problems not only increase the workload of file manager and reduce the efficiency of the group volume, but also affect the normalization and traceability of the electronic files, and the advantages of a single set of management mode can not be fully exerted. Therefore, a standardized and automatic file grouping method is needed to solve the pain problem in the current grouping process, and promote the standardized development of electronic file list manufacturing management. Disclosure of Invention The invention aims to solve the problems of non-uniform standard, low automation degree, disordered file collection, poor writing efficiency and the like in the traditional file single-sleeve assembly process, and realizes the standardization and automation of electronic file assembly and improves the file management efficiency and standardization by constructing a file category template system and standardizing the classified filing pretreatment, assembly and writing processes. In view of the above-mentioned drawbacks or improvements of the prior art, as a first aspect of the present invention, the present invention provides a method for automatically assembling and winding archives on the basis of template matching, including: S1, constructing an archive category template system which comprises a preset data organization form and a hierarchical file organization structure, wherein the hierarchical file organization structure is presented in a hierarchical directory form and defines the hierarchical association relation between categories and items; S2, organizing the electronic archive files by taking the project as a unit according to the classification requirement of the archive category template, enabling the electronic archive files of the same project to be stored in a file storage container of a corresponding project node in a centralized manner, and meanwhile, standardizing the storage path and the folder name of the electronic archive files according to the file organization structure specified by the template to finish classification filing pretreatment of the electronic archive files; S3, displaying the electronic archive files to be assembled in a directory tree mode according to category sub-items, selecting target archive categories and corresponding item nodes, screening the electronic archive files to be assembled under the nodes to generate temporary assembly and distributing temporary assembly identification information; S4, aiming at the files and the cases after the formal files are assembled, corresponding attribute information is collected for writing, and the writing information is supported to be recorded and stored in a manual single recording mode or a batch importing mode based on file number association,