Search

CN-122019469-A - Hydrologic and geoprime intelligent archive system for full life cycle management and control

CN122019469ACN 122019469 ACN122019469 ACN 122019469ACN-122019469-A

Abstract

Aiming at the problems of management fragmentation, low retrieval accuracy, stiff borrowing flow, low resource utilization rate and the like in the management of the hydrogeology archives, the invention constructs the intelligent management system covering the whole life cycle, the system adopts a three-layer B/S architecture, realizes the automatic traceability of archives through the whole life cycle management module, improves the retrieval accuracy by utilizing a multi-mode retrieval algorithm, designs the flexible borrowing approval flow based on secret-related classification, and develops an intelligent recommendation module by combining user behaviors. The invention realizes multi-mode retrieval, flow approval, intelligent pushing and full life cycle state tracing of the hydrogeologic archive resource, improves archive retrieval efficiency, shortens borrowing approval period, improves archive resource utilization rate, and provides powerful support for efficient decision and data sharing of hydrogeologic work.

Inventors

  • WANG XIA
  • CAO WANJUN

Assignees

  • 中国煤炭地质总局水文地质局

Dates

Publication Date
20260512
Application Date
20251218

Claims (10)

  1. 1. The hydrologic and geoprime intelligent archive system controlled by the full life cycle is characterized by comprising a system architecture module, a full life cycle control module, a multi-mode retrieval module, a flexible borrowing approval module and an intelligent recommendation module; The system architecture module adopts a three-layer B/S architecture, and comprises an interface expression layer, a business logic layer and a data access layer, and is used for: The interface presentation layer provides a responsive interface, supports access of a PC end and a mobile end, integrates a file retrieval center, a borrowing management, a personal center and a system management inlet, and comprises a project file thematic page special for hydrologic industry; The business logic layer comprises a full life cycle management and control engine, a multi-mode search engine, a flexible approval engine and an intelligent recommendation engine, and is used for processing core logic of file management; the data access layer adopts a mixed storage mode of a relational database and a non-relational database, the relational database stores archive metadata, the non-relational database stores unstructured data, and data operation is realized through a unified data access interface; The full life cycle management and control module is used for: Managing the full life cycle state of the hydrogeologic archives from collection, storage, retrieval, borrowing and archiving to destruction, and realizing automatic traceability and data linkage of the archives through a state transfer function, wherein the archives life cycle state comprises unaddressed, in-store, borrowing, overdue and destroying the state to be audited and destroyed, and the state transfer is triggered by an event; the multi-mode search module is used for: The file retrieval is carried out based on metadata, text semantics and associated dimensions, the multi-mode retrieval module adopts a multi-layer retrieval model, comprises a metadata retrieval layer, a text depth retrieval layer and an associated dimension retrieval layer, and sorts retrieval results through a comprehensive scoring algorithm; the flexible borrowing approval module is used for: Designing a differential approval process according to the file secret-related classification, wherein the secret-related classification comprises public, secret and confidential levels, and the approval duration is dynamically calculated based on the file secret level and the approval level; the intelligent recommendation module is used for: Based on the historical borrowing behavior of the user and the project association degree, recommending archive resources by adopting a collaborative filtering algorithm.
  2. 2. The full life cycle managed hydrogeoprime intelligence archive system of claim 1, wherein the state transfer functions in the full life cycle management module are: Wherein S i is the current state of the file, E j is a file state transition trigger event, and S k is a target state; The file state transfer triggering event comprises file archiving, borrowing initiation, file returning, application destruction, audit passing and audit rejection; the system automatically records a state transfer log, which contains trigger time, operators and related events, and realizes the traceability of the whole life cycle.
  3. 3. The full life cycle managed hydrologic prime number intelligence archive system of claim 1, wherein the metadata retrieval layer of the multimodal retrieval module performs a combined retrieval based on archive type, project name, borehole number and formation time field, the retrieval score formula being: Wherein, the In order to retrieve the number of fields, For the weight of the field, Is a field And query terms Matching degree of (3); The text depth retrieval layer adopts a BERT-based hydrologic professional corpus model to carry out semantic matching, and a retrieval score formula is as follows: Wherein, the And Semantic vectors of query words and text fragments respectively, and judging effective matching when Score 2 is more than or equal to 0.6; the association dimension retrieval layer establishes an item-drilling-water damage data association map, and an association degree formula is as follows: the final search result is as follows: The composite scores are arranged in descending order.
  4. 4. The full life cycle managed hydrographic prime number intelligence archival system of claim 1, wherein the flexible borrowing approval module's approval duration prediction formula is: Wherein, the As the basic approval duration, the public file is 1 hour, the secret file is 4 hours, the secret file is 24 hours, alpha is the approval level coefficient, and L is the approval level; the public file adopts single-layer approval, the secret file adopts double-layer approval, and the secret file adopts three-layer approval; The system displays the approval progress in real time and automatically sends a reminder before the borrowing expires.
  5. 5. The full life cycle managed hydrologic and geoprime intelligent archive system of claim 1, wherein the recommendation similarity formula of the intelligent recommendation module is: Wherein, the For the current user to be present, In order to recommend the profile, For users The history is borrowed for a set of files, For users For files Is used to determine the frequency of borrowing, Is files And (3) with Is a degree of association of (1); the intelligent recommendation module calculates the association degree by combining the project, the area and the data type based on the borrowing record of the user.
  6. 6. The full life cycle controlled hydrologic and geoprime intelligent archive system of claim 1, further comprising an archive collection module supporting archive uploading, format verification and automatic archiving, uploading of an automatic file verification format, including PDF, JPEG, TIFF, MP4; And after the verification is passed, automatically distributing a unique number for the file based on the full life cycle management and control module, and entering a warehouse state.
  7. 7. The full life cycle managed hydrologic geoprime intelligence archives system of claim 6, wherein the unique number format is XM-year-project number-archive type, XM represents item prefix, year is four digit number year formed by archive, project number is unique identifier of project, archive type is predefined archive classification code for distinguishing archive category; the unique number is automatically generated during file uploading and is used for full life cycle state tracing and retrieval association.
  8. 8. The full life cycle managed hydrologic geoprime intelligence archive system of claim 1, wherein the system hardware environment comprises a server and a client; the server is configured to be not lower than a 16-core CPU and a 64GB memory, the storage adopts an SSD and HDD hybrid architecture, SSD is used for database files, HDD is used for archiving and storing; the client supports a PC end and a mobile end, the PC end requires an Intel Core i5 and above CPU and an 8GB memory, and the mobile end supports an Android 10.0 and above or iOS 14.0 and above system; the network environment requires that the bandwidth of the server is not lower than 100Mbps, and the network delay is controlled within 50 ms.
  9. 9. The full life cycle controlled hydrologic and geoprime intelligence archive system of claim 1, wherein the system software environment comprises a Linux operating system at a server side and a Windows, macOS, android, iOS system at a client side; the database adopts a MySQL relational database and a MongoDB non-relational database, and realizes a unified data access interface through a Hibernate framework; The development framework comprises a front-end responsive framework and a back-end lightweight framework, and the middleware comprises a Web server and a buffering middleware.
  10. 10. The full life cycle controlled hydrologic and geoprime intelligent archive system of claim 1, wherein the system interface presentation layer provides a temporary storage clamp function, a user can add files into the temporary storage clamp in batches and borrow the files, and interaction is realized through JavaScript codes; the data access interface of the business logic layer comprises a query archive by item and an archive storage method.

Description

Hydrologic and geoprime intelligent archive system for full life cycle management and control Technical Field The invention belongs to the technical field of archive management and hydrogeologic application, and relates to a hydrogeoprime intelligent archive system for full life cycle management and control. Background The hydrogeologic archives are used as data carriers for core works such as hydrologic investigation, coal mine water control engineering, ecological water retention engineering and the like, and the management efficiency directly determines the scientificity and safety of hydrographic engineering decisions. The files cover key information such as drilling parameters, mine hydrogeology type reports, water damage foundation monitoring data, water control engineering design schemes and the like, and are foundation supports for water resource management, disaster prevention and ecological protection. With the shift of hydrogeology to digitization and refinement, the traditional file management mode is difficult to adapt to the development demands of the industry, the core pain point is increasingly prominent, and the modern upgrading of file management is urgently realized through technical innovation. The current hydrogeology archive management still takes paper files or scattered electronic files as the main materials, and lacks a unified full life cycle management and control standard, so that the key problems of (1) data splitting and version confusion are caused, namely, the files are mutually independent from each other from the formation and archiving to the destruction of the files, and the problems of repeated storage, inconsistent versions and the like are easy to occur. For example, in mine water control projects, investigation scheme reports and matching drawings are often stored in different positions (such as project group computers and archive servers), and are required to be manually checked and associated, so that the efficiency is low and errors are prone to occur. (2) The searching efficiency is low, the existing searching mode depends on shallow layer matching of the title-key words, and the accurate searching cannot be performed in deep file text contents (such as drilling depth, water chemistry components and the like). The average searching time is more than 30 minutes, and the quick response requirement of emergency scenes such as mine water burst and the like is difficult to meet. (3) The borrowing process is complicated, the borrowing approval is mainly offline, the period is as long as 3-5 working days, and the file state (such as borrowers and residual time) cannot be tracked in real time. Meanwhile, a hierarchical security mechanism aiming at security files (such as water burst emergency data) is lacking, and the risk of data leakage exists. (4) The resource multiplexing is insufficient, the file sharing capability is weak, a large amount of high-value data (such as old mining area drilling records and historical water damage cases) are in an idle state, and the lack of an intelligent pushing mechanism based on user behaviors or project association restricts the trans-regional experience multiplexing. The domestic research is gradually deepened from 2000 to form the following representative achievements, but the general system limitation is that the digital archive management system basic edition promoted by the national archive office realizes basic functions such as electronic archiving, but is not optimized for hydrogeology professional characteristics (such as unstructured data processing and project association requirements), and cannot effectively support dynamic management of data such as drilling curves, monitoring maps and the like. (2) The industrialized system has single function, and a special system developed by a part of hydrologic bureau (such as a hydrologic file digital management system of a yellow river hydrologic bureau) focuses on data storage and simple inquiry, but does not cover the full life cycle of the file, and has single retrieval dimension (only supports site-time inquiry), so that complex scenes such as water damage prevention and treatment are difficult to associate. (3) Theoretical research and practice are disjointed, namely, a workflow engine scheme proposed by a learner in journals such as archives communication optimizes approval efficiency, but the security and applicability are limited because the security classification characteristics (such as imperative, secret and public data differentiation requirements) of hydrologic files are not combined. Foreign researches are guided by long-term preservation and open sharing, have higher maturity, but are difficult to directly apply to the field of hydrogeology, and (1) the framework universality is too strong, namely the OAIS standard of the International archives Congress (ICA) provides a full life cycle management framework, but dynamic update characteristics of the hydroarchive (su