CN-122020624-A - Anti-interference standard data traceability fingerprint embedding and identifying method and system
Abstract
The invention relates to the technical field of data processing and information security, and discloses an anti-interference standard data traceability fingerprint embedding and identifying method and system. The method comprises the steps of firstly modeling layout boundaries, partition boundaries, field areas, table unit areas or predefined anchor areas of standard data to obtain a plurality of standardized target areas and corresponding area indexes, then generating area-level synchronous index reference information according to the spatial distribution relation of each standardized target area, binding the area-level synchronous index reference information with corresponding fingerprint fragments, and then respectively embedding the area-level synchronous index reference information into the corresponding areas. During identification, interference analysis is carried out on standard data to be identified, the corresponding relation between the regional indexes and the spatial positions of each standardized target region is restored according to the regional synchronous index reference information, corresponding fingerprint fragments are extracted after regional reconstruction is completed, index consistency check and source matching are carried out, and therefore stability and accuracy of traceability identification under cutting, rotation, scaling, resampling, compression, screenshot or printing scanning conditions are improved.
Inventors
- Hua Ruzhong
Assignees
- 润申标准化技术服务(上海)有限公司
Dates
- Publication Date
- 20260512
- Application Date
- 20260414
Claims (10)
- 1. An anti-interference standard data traceability fingerprint embedding and identifying system, which is characterized by comprising: The standard data modeling module is used for acquiring standard data to be processed, and establishing a regional structure model for the standard data according to layout boundaries, partition boundaries, field areas, table unit areas or predefined anchor areas of the standard data so as to acquire a plurality of standardized target areas and area indexes of the standardized target areas; the region-level synchronization index reference generation module is used for generating region-level synchronization index reference information according to the spatial distribution relation of the plurality of standardized target regions, wherein the region-level synchronization index reference information is used for representing the corresponding relation between the region index and the spatial position of each standardized target region; The fingerprint segment construction module is used for generating traceable fingerprint information according to the source identification and splitting the traceable fingerprint information into a plurality of fingerprint segments respectively corresponding to different area indexes; The binding embedding module is used for binding the region-level synchronous index reference information with the corresponding fingerprint fragments according to the region index and then respectively embedding the region-level synchronous index reference information and the corresponding fingerprint fragments into the corresponding standardized target regions so as to obtain embedded standard data; The interference analysis and region recovery module is used for carrying out interference type analysis on the standard data to be identified and recovering the corresponding relation between the region index and the space position of each standardized target region in the standard data to be identified according to the region-level synchronous index reference information so as to finish reconstruction of the standardized target region; the segment extraction module is used for extracting corresponding fingerprint segments from the corresponding standardized target region according to the region index obtained by recovery after the standardized target region is rebuilt; The consistency recognition module is used for carrying out index consistency check and source matching on the plurality of extracted fingerprint fragments so as to output a traceability recognition result; The region-level synchronous index reference information and the fingerprint segments establish a binding relation based on the same region index, so that after standard data to be identified are disturbed by cutting, rotation, scaling, resampling, compression, screenshot or printing scanning, the region index of a standardized target region can be recovered first, and then the corresponding fingerprint segments are extracted for source identification.
- 2. The system of claim 1, wherein the modular data modeling module creates a localized structural model comprising at least one of a page bounding region, a header footer region, a field bearing region, a table cell region, a teletext column region, or a predefined anchor region.
- 3. The system of claim 1, wherein the region-level synchronization index reference information generated by the region-level synchronization index reference generation module comprises at least one of region boundary reference information, region center reference information, region adjacency relation reference information, region scale reference information, or region level reference information.
- 4. The system of claim 1, wherein the binding embedding module is configured to set different embedding strengths, embedding densities, redundancy levels, or priorities for the standardized target regions corresponding to different region indexes, so that when a portion of the standardized target regions of the standard data to be identified is damaged, fingerprint segments in the remaining standardized target regions can still be used for source identification.
- 5. The system of claim 1, wherein the consistency recognition module is configured to perform region index consistency verification, segment integrity evaluation, and cross-region matching result fusion on the extracted fingerprint segments before source matching, and output a final traceability recognition result according to the fusion result.
- 6. An anti-interference standard data traceability fingerprint embedding and identifying method is characterized by comprising the following steps: s1, acquiring standard data to be processed, and establishing a regional structure model according to layout boundaries, partition boundaries, field areas, table unit areas or predefined anchor areas of the standard data to obtain a plurality of standardized target areas and area indexes of the standardized target areas; S2, generating region-level synchronous index reference information for representing the corresponding relation between the region index and the spatial position of each standardized target region according to the spatial distribution relation of the plurality of standardized target regions; S3, generating traceable fingerprint information according to the source identification, and splitting the traceable fingerprint information into a plurality of fingerprint fragments respectively corresponding to different area indexes; S4, binding the region-level synchronous index reference information with the corresponding fingerprint segments according to the region index, and then respectively embedding the region-level synchronous index reference information and the corresponding fingerprint segments into the corresponding standardized target regions to obtain embedded standard data; s5, obtaining standard data to be identified, and analyzing the interference type of the standard data to be identified; S6, recovering the corresponding relation between the region index and the space position of each standardized target region in the standard data to be identified according to the region-level synchronous index reference information so as to complete reconstruction of the standardized target region; S7, extracting corresponding fingerprint fragments from the corresponding standardized target areas according to the restored area indexes; s8, carrying out index consistency check sum source matching on the extracted fingerprint fragments, and outputting a tracing identification result; The method comprises the steps of establishing a binding relation between region-level synchronous index reference information and fingerprint fragments based on the same region index, enabling standard data to be identified to recover the region index of a standardized target region after being interfered, and extracting the corresponding fingerprint fragments for source identification.
- 7. The method of claim 6, wherein the creating a localized structural model in S1 based on the standard data comprises: And identifying page boundaries, partition boundaries, field boundaries, table cell boundaries, graphic column boundaries or predefined anchor region boundaries in the standard data, and generating a standardized target region and a corresponding region index based on the position relation between the boundaries.
- 8. The method of claim 6, wherein generating the region-level synchronization index reference information in S2 from the spatial distribution relationship of the plurality of normalized target regions comprises: And generating corresponding region boundary reference information, region center reference information, region adjacency relation reference information, region scale reference information or region hierarchy reference information according to the center positions, boundary positions, adjacent relations, scale relations or hierarchy relations of the plurality of standardized target regions.
- 9. The method according to claim 6, wherein the performing interference type analysis on the standard data to be identified in S5 includes: and identifying at least one of clipping, rotation, scaling, resampling, compression, screenshot or printing scanning interference suffered by the standard data to be identified, and determining a corresponding region recovery processing path according to an identification result.
- 10. The method according to claim 6, wherein the performing index consistency checksum source matching on the extracted fingerprint segments in S8, outputting a traceability recognition result, includes: and respectively judging whether the region index corresponding to each fingerprint fragment is consistent with the region index obtained by recovery, and carrying out fusion judgment according to the consistency result, the fragment integrity and the fragment matching result so as to output a final source identification result.
Description
Anti-interference standard data traceability fingerprint embedding and identifying method and system Technical Field The invention relates to the technical field of data processing and information security, in particular to an anti-interference standard data traceability fingerprint embedding and identifying method and system. Background Along with the wide application of standard data in scenes such as bill circulation, license management, form transmission, report archiving, format file distribution, electronic document exchange and the like, how to embed traceability information which can be used for source identification and responsibility tracing into the standard data becomes an important technical requirement in data security management and data circulation control. In order to distinguish the propagation paths of standard data and the use sources or distribution objects, in the prior art, specific identification information is generally embedded into a data carrier in a digital watermark, a digital fingerprint or an implicit identification mode, and the identification information is extracted and judged in a subsequent identification stage, so that the data tracing is realized. Most of the existing tracing information embedding and identifying schemes are developed around images, watermark documents or security documents. According to the partial scheme, a synchronous mark, a synchronous block, a calibration pattern or a reference signal is arranged in the carrier, so that the identification end corrects rotation, scaling, cutting or other geometric distortion suffered by the carrier before detecting the traceability information, and then the identification is completed after the embedding position of the watermark or the fingerprint is restored. The scheme improves the detection capability of the carrier after geometric transformation to a certain extent, and becomes a common technical route in the identification of the anti-geometric interference watermark. However, in standard data scenarios, there are still significant limitations to the prior art. Standard data usually has predefined layout boundaries, field areas, table cell areas, graphic column areas or other regularized structure areas, and each area inside the standard data has not only a spatial position relationship but also a relatively stable structure correspondence relationship. The synchronous marks or calibration signals in the prior art are mostly focused on restoring the geometric gestures of the whole image or the whole page, but the corresponding relation among a plurality of structural areas in the standard data is not fully restored. After standard data is transmitted, compressed, resampled, screenshot, printed and scanned or partially cut, even if the geometric correction of the whole layer can be completed, the situation that the local target area is misplaced, the area boundary is misplaced or the embedded area is not identified can still occur, so that the trace-source fingerprint extraction is failed, the omission ratio is increased, or erroneous judgment is generated when the local area is damaged. Furthermore, most of the tracing schemes in the prior art embed the identification information as a whole into the carrier, and extract the whole identification information during identification. In the mode, after the standard data is subjected to local shielding, local replacement, region cutting or multiple times of processing, the whole recognition result is easily influenced by the information distortion of the local region, and the recognition stability is insufficient. Especially, under the condition that the standard data has multiple areas, regular distribution, fixed fields or fixed formats, if the embedded positions and the identification positions of the traceability information cannot be established with more stable corresponding relations by combining the regional structural characteristics of the standard data, the anti-interference capability and the identification accuracy are difficult to be considered. Therefore, how to construct an anti-interference trace fingerprint embedding and identifying scheme capable of recovering the corresponding relation between the trace fingerprint embedding position and the identifying position under the interference conditions of cutting, rotating, zooming, resampling, compressing, screenshot or printing scanning and the like aiming at the structural region characteristics of the standard data becomes a technical problem to be solved in the field. Disclosure of Invention The invention aims to provide an anti-interference standard data trace-source fingerprint embedding and identifying method and system, which are used for solving the technical problems that the trace-source fingerprint embedding position and the identifying position are out of synchronization, a target area is difficult to stably reconstruct, the accuracy of fingerprint extraction is reduced and the stability