CN-121981675-A - Engineering electronic archive resource integrated management method and system
Abstract
The invention relates to the technical field of resource management, in particular to an engineering electronic archive resource integrated management method and system, comprising the following steps: the method comprises the steps of obtaining an engineering electronic file, detecting a time stamp coordinate, collecting a unique identifier to generate a file identification code, analyzing the matching degree of a PDF page number and an XML path, optimizing offset to form a coordinate mapping parameter, performing spatial positioning by using the mapping parameter to generate a three-dimensional positioning matrix, analyzing the hierarchical deviation of XML nodes, correcting weights, calculating the similarity of the weights, and dynamically adjusting a cache to form an electronic file integrated management system.
Inventors
- CHEN YAN
- GUO LINGJIAN
- FENG BINBIN
- ZHANG LUSHA
- ZHANG ZESHENG
- YI YUJIE
Assignees
- 绍兴市柯诸高速公路有限公司
- 微柏(杭州)软件科技有限公司
Dates
- Publication Date
- 20260505
- Application Date
- 20260121
Claims (10)
- 1. The engineering electronic archive resource integrated management method is characterized by comprising the following steps of: S1, acquiring electronic archive data of an engineering project, detecting archive time stamps and geographic coordinate parameters, acquiring an engineering unique identification field, inputting a hash mapping algorithm for index coding, and generating an archive identification code; s2, analyzing the matching degree of the corresponding PDF format file page number label parameter and the XML structured file node path parameter based on the file identification code, and optimizing a path offset calculation rule if the matching degree is lower than a preset matching degree threshold value to generate a coordinate mapping parameter; S3, calling the coordinate mapping parameters, inputting file identification codes, page number label parameters and node path parameters into a least square method to execute space positioning calculation, and carrying out three-dimensional coordinate mapping of relation database records, PDF file page numbers and XML nodes to generate a three-dimensional positioning matrix; S4, analyzing the hierarchical depth deviation of the XML structured file node based on the three-dimensional positioning matrix, and adjusting the node weight value through the difference calculation of the hierarchical depth and the standard hierarchical depth to form a weight value correction sequence; and S5, calculating the similarity between node weights according to the weight correction sequence, comparing the similarity with a preset synchronization threshold, and dynamically adjusting the mapping cache state if the similarity exceeds the threshold to form an electronic file integrated management system.
- 2. The integrated management method of engineering electronic archive resources according to claim 1, wherein the archive identification code comprises a coding sequence, a check digest and an index association key, the coordinate mapping parameters comprise an offset correction factor, a scale mapping rule and a layout association weight, the three-dimensional positioning matrix comprises a space coordinate vector, a cross-medium association relationship and a positioning confidence interval, the weight correction sequence comprises a correction coefficient set, a hierarchical weight distribution and a deviation compensation amount, and the integrated management system of the electronic archive comprises a synchronization state set, a cache scheduling strategy and a unified management view.
- 3. The integrated management method of engineering electronic archive resources according to claim 1, wherein the specific steps of S1 are as follows: s101, acquiring electronic archive data of engineering projects through engineering field acquisition equipment, monitoring a corresponding relation between a timestamp field and an acquisition sequence, detecting a geographic coordinate value and a field coordinate boundary matching state, logically collecting a timestamp judgment result and a coordinate judgment result, and generating an archive space-time effective parameter set; S102, based on the file space-time effective parameter set, collecting a unique identification field of an engineering project, performing length normalization processing on character sequences of the identification field, performing fixed-length segmentation operation on the normalized character coding sequences, and calling a time stamp numerical value segment to participate in order splicing to obtain engineering file index coding sequences; S103, inputting the coding sequence into a Hash mapping algorithm according to the engineering archive index coding sequence, executing Hash mapping operation on a plurality of segments, judging the repeated state of the index value of the mapping result, and executing sequential solidification processing on the non-repeated index value to generate an archive identification code.
- 4. The integrated management method for engineering electronic archive resources according to claim 3, wherein the specific steps of S2 are as follows: S201, calling a page label parameter of a PDF format file based on the file identification number, analyzing the index position of a page label in a page sequence, calculating the index difference value of adjacent page labels, judging an increment relation, summarizing a continuity state and generating a page label sequence judgment amount; S202, acquiring node path parameters of the XML structured file according to the page label sequence judgment amount, extracting a node path level index sequence, calculating a corresponding offset value of the level index and the page sequence, and carrying out offset judgment according to a preset matching degree threshold value to obtain a path matching offset value; S203, for the path matching deviation amount, the node path index and the page number sequence index are called, the path deviation amount calculation rule is adjusted according to the deviation amount distribution state, the corresponding sequence of the page number position index and the node path level index is rearranged, and the coordinate mapping parameter is generated.
- 5. The integrated management method of engineering electronic archive resources according to claim 4, wherein the preset matching degree threshold is based on an offset value distribution parameter obtained by counting the matching result of the path of the completed page label and the XML structured document node in the history archive processing, the offset value distribution parameter is obtained by calculating a sample set formed by a plurality of groups of level indexes and offset values corresponding to the page sequence, the matching degree threshold is specifically an offset value proportion lower limit meeting a preset confidence interval in the sample set, and the preset confidence interval is determined according to the accumulated distribution of the occurrence frequency of the offset values in the sample set.
- 6. The integrated management method for engineering electronic archive resources according to claim 4, wherein the specific steps of S3 are as follows: S301, calling the coordinate mapping parameters, carrying out coordinate dimension alignment on file identification codes, PDF format file page number label parameters and XML structured file node path parameters, extracting space coordinate components according to index corresponding relations, and carrying out differential operation on the coordinate components with the same index to generate a coordinate residual error association set; s302, based on the coordinate residual error association set, carrying out weight constraint accumulation operation on residual error components under index association, judging index association combinations according to minimum constraint conditions of residual error aggregate values, screening index combinations meeting constraint conditions, and reconstructing corresponding coordinate components to obtain an index convergence coordinate set; S303, according to the index convergence coordinate set, carrying out unified number mapping on the relation database record index, the PDF file page index and the XML node path index, and carrying out matrix arrangement and index mapping operation on the numbered coordinate components to generate a three-dimensional positioning matrix.
- 7. The integrated management method of engineering electronic archive resources according to claim 6, wherein the specific steps of S4 are as follows: S401, detecting an XML structured file node sequence based on the three-dimensional positioning matrix, extracting node path level marks, expanding the node path level marks into level vectors, executing sequence verification and depth indexing, recording the corresponding relation between node indexes and levels, and generating a node level depth sequence; S402, according to the node level depth sequence, a standard level depth configuration sequence is obtained, item-by-item alignment is carried out, level differences are calculated according to alignment items, symbol attributes are kept, index binding and vectorization arrangement are carried out on difference results, and a level depth deviation vector is obtained; S403, aiming at the hierarchical depth deviation vector, performing deviation value and weight mapping operation according to the node initial weight set, performing position alignment arrangement on the mapping result according to the node index, and performing serialization coding to generate a weight correction sequence.
- 8. The integrated management method of engineering electronic archive resources of claim 7, wherein the specific steps of S5 are as follows: s501, according to the weight correction sequence, a node weight vector is obtained, the weight components are aligned in the same sequence according to node indexes, normalized transformation is carried out on the components according to a unified dimension rule, item-by-item difference operation is carried out on the transformed components, and the node indexes are bound, so that a node weight difference sequence is generated; S502, based on the node weight difference sequence, acquiring the dimension and arrangement order of the node difference vector, performing item-by-item multiplication on the same-dimension difference values, accumulating to form association quantity, and rearranging the association quantity into a matrix structure according to the node symmetry relation to generate a node weight similarity matrix; s503, according to the node weight similarity matrix, a preset synchronization threshold parameter set is obtained, item-by-item comparison is carried out on matrix elements and synchronization thresholds, mapping cache state transition and unified identification are carried out on node pairs exceeding the thresholds, and an electronic file integrated management system is established.
- 9. An integrated management method for engineering electronic archive resources according to claim 8, wherein the preset synchronization threshold is a quantitative parameter, the range of which is between 0.1 and 0.9, and the specific value of the preset synchronization threshold is adaptively adjusted according to the similarity and the difference between nodes in the node weight difference sequence.
- 10. An integrated management system for engineering electronic archive resources, characterized in that the system is used for implementing an integrated management method for engineering electronic archive resources according to any one of claims 1 to 9, and the system comprises: the file identification generation module is used for acquiring electronic file data of the engineering project, detecting file time stamps and geographic coordinate parameters, acquiring engineering unique identification fields, inputting a hash mapping algorithm for index coding, and generating file identification codes; the layout path matching module is used for analyzing the matching degree of the corresponding PDF layout file page number label parameter and the XML structured file node path parameter based on the file identification code, and optimizing a path offset calculation rule if the matching degree is lower than a preset matching degree threshold value to generate a coordinate mapping parameter; The space mapping calculation module calls the coordinate mapping parameters, inputs the file identification codes, the page number label parameters and the node path parameters into a least square method to execute space positioning calculation, and performs three-dimensional coordinate mapping of the relation database record, the PDF file page number and the XML node to generate a three-dimensional positioning matrix; The hierarchical weight correction module is used for analyzing the hierarchical depth deviation of the XML structured file node based on the three-dimensional positioning matrix, and adjusting the weight of the node through the difference calculation of the hierarchical depth and the standard hierarchical depth to form a weight correction sequence; and the synchronous state management module calculates the similarity between node weights according to the weight correction sequence, compares the similarity with a preset synchronous threshold, and dynamically adjusts the mapping cache state if the similarity exceeds the threshold to form an electronic file integrated management system.
Description
Engineering electronic archive resource integrated management method and system Technical Field The invention relates to the technical field of resource management, in particular to an engineering electronic archive resource integrated management method and system. Background The technical field of resource management relates to planning, configuring, registering, storing, calling, maintaining and supervising multi-class resources, the core matters comprise resource classification coding rules for formulating resource life cycle management resource information acquisition and arrangement resource state update resource authority control, resource sharing and scheduling and the like, the technical field relies on database technology network communication technology and an information system integration mode under the informatization and digitalization background, the distributed paper or electronic resources are centrally managed to form a unified resource catalog and management flow so as to support the long-term preservation and standard utilization and safety management and control of the resources, and the method is widely applied to the scenes of public institution resource management of engineering construction archives management, enterprise and public institution internal information management and the like. The traditional engineering electronic archive resource integrated management method is characterized in that project files formed around engineering projects are constructed, files are checked and accepted, related electronic data are managed in a manual or semi-automatic mode according to project stages or file types, the electronic files are recorded into a system one by one and related project number storage paths and responsibility personnel information are related through manually setting file classification hierarchy compiling file number rules, in the management process, file receiving registration version replacement authority allocation and borrowing record maintenance are completed by means of manual operation, association relations among engineering archive resources are described by manually filling association fields or tables, archive data are stored in different catalogues or servers in a scattered mode, and management personnel gather, sort and maintain according to established procedures. The existing engineering electronic archive management relies on manual setting of classification level and file number rules, the identification is only used as an external number, no internal association is formed with file contents, files with different formats establish weak relations through paths or fields, time and space dimension support is lacked, catalog dispersion leads to state update to depend on manual summarization, deviation is easy to occur in association relation filling, version replacement and authority adjustment are difficult to synchronize, structural change is required to be repeatedly tidied and corrected, retrieval positioning accuracy is limited, and management cost is continuously increased. Disclosure of Invention In order to solve the technical problems that the existing engineering electronic archive management relies on manual setting of classification level and file number rules, identification is only used as an external number, no internal association is formed with file contents, files with different formats establish weak relations through paths or fields, time and space dimension support is lacked, directory dispersion leads to state update to depend on manual summarization, deviation is easy to occur in association relation filling, version replacement and authority adjustment are difficult to synchronize, structural change needs repeated arrangement and correction, retrieval positioning accuracy is limited, and management cost is continuously increased, the embodiment of the invention provides an engineering electronic archive resource integrated management method. In order to achieve the above purpose, the invention adopts an engineering electronic archive resource integrated management method, which comprises the following steps: S1, acquiring electronic archive data of an engineering project, detecting archive time stamps and geographic coordinate parameters, acquiring an engineering unique identification field, inputting a hash mapping algorithm for index coding, and generating an archive identification code; s2, analyzing the matching degree of the corresponding PDF format file page number label parameter and the XML structured file node path parameter based on the file identification code, and optimizing a path offset calculation rule if the matching degree is lower than a preset matching degree threshold value to generate a coordinate mapping parameter; S3, calling the coordinate mapping parameters, inputting file identification codes, page number label parameters and node path parameters into a least square method to execute space positioning calcu