CN-121997356-A - Desensitization method for geographic sensitive data
Abstract
The invention discloses a desensitization method for geographic sensitive data, which comprises the following steps of obtaining an original data set and a priori geographic information graph, judging land utilization type and geological background unit type of each data acquisition point according to the priori geographic information graph, determining maximum offset based on the corresponding geographic sensitive data, generating random offset in geological background units corresponding to the maximum offset and limited to the data acquisition point, obtaining desensitization coordinates, applying controlled random disturbance to the geographic sensitive data of each data acquisition point, carrying out liner value transformation by combining regional background values after logarithmic transformation to obtain liner value data, generating an initial grid by combining the desensitization coordinates and the liner value data interpolation, outputting a final grid after repeated smooth filtering iteration, and carrying out hierarchical visual display based on the final grid.
Inventors
- CHEN HUAZHU
- Xiu Linfang
- MO WENBO
- WANG XIA
- DENG GUANYU
Assignees
- 海南省生态环境地质调查院
Dates
- Publication Date
- 20260508
- Application Date
- 20260108
Claims (10)
- 1. A method for desensitizing geographically sensitive data, the method comprising the steps of: acquiring an original data set and a priori geographic information map, wherein the priori geographic information map comprises a land utilization type map and a geological background unit map, and the original data set comprises geographic coordinates of each data acquisition point and corresponding geographic sensitive data; judging the land utilization type and the geological background unit type of each data acquisition point according to the prior geographical information graph, and determining the maximum offset based on the geographical sensitive data corresponding to the land utilization type and the geological background unit type; generating a random offset in a geological background unit corresponding to the maximum offset and limited to the data acquisition point, thereby obtaining desensitization coordinates; applying controlled random disturbance to the geographic sensitive data of each data acquisition point, carrying out liner value transformation by combining the regional background value after logarithmic transformation to obtain liner value data; And generating an initial grid by combining the desensitization coordinates and the lining value data interpolation, outputting a final grid after repeated smooth filtering iteration, and carrying out hierarchical visual display based on the final grid.
- 2. A method of desensitising geographical sensitive data according to claim 1, wherein the geographical sensitive data comprises environmental monitoring data including at least radionuclide content, radiation dose rate, and pollution distribution survey data including at least soil heavy metal concentration, toxic contaminant content of water, atmospheric contaminant concentration, each data collection point collecting any type of geographical sensitive data.
- 3. The method for desensitizing geographically sensitive data according to claim 1, wherein said determining land utilization type and geological background unit type for each data acquisition point based on a priori geographic information map comprises: Carrying out space superposition analysis on the geographic coordinates of each data acquisition point and a land utilization type graph and a geological background unit graph in the prior geographic information graph respectively; Extracting land use type attributes corresponding to the geographic coordinates from the land use type map, and determining the land use type of the data acquisition point; and extracting geological partition attributes corresponding to the geographical coordinates from the geological background unit map, and determining the geological background unit type to which the data acquisition point belongs.
- 4. A method for desensitizing geographically sensitive data according to claim 3, characterized in that before the original dataset and the prior geographical information map are acquired, a table of correspondence between all land types and all basic maximum offsets is established, and a land use type-basic dither amplitude comparison table is obtained.
- 5. A method for desensitizing geographically sensitive data according to claim 4, characterized by determining maximum offset based on its corresponding geographically sensitive data, comprising in particular: Inquiring a land use type-basic shaking amplitude comparison table based on the land use type of the data acquisition point, and determining a basic maximum offset corresponding to the land use type; Setting a scaling factor according to the value of any type of geographic sensitive data which is correspondingly acquired by the data acquisition point; And multiplying the basic maximum offset by the scaling factor to obtain the maximum offset.
- 6. A method for desensitizing geographically sensitive data according to claim 5, characterized by generating random offsets within the geological background unit corresponding to the data acquisition point and defining the maximum offset, thereby obtaining desensitized coordinates, comprising in particular: Generating initial offset by a random function in a space range with the maximum offset as a radius by taking the geographic coordinates of the data acquisition point as a center to obtain candidate desensitization coordinates; Judging whether the candidate desensitization coordinate falls into a geological background unit corresponding to the data acquisition point, if so, determining the candidate desensitization coordinate as the desensitization coordinate of the data acquisition point, and if so, discarding the initial offset and reconstructing the random offset; And repeating the judging step, and selecting the point position closest to the geographic coordinates in the geological background unit boundary as the desensitization coordinate if the candidate desensitization coordinate meeting the condition is not generated yet when the preset maximum try times are reached.
- 7. A method for desensitizing geographically sensitive data according to claim 6, characterized in that the maximum amplitude of the controlled random disturbance is a preset percentage of the value of the geographically sensitive data, and the higher the data value, the smaller said preset percentage.
- 8. The method for desensitizing geographically sensitive data according to claim 7, wherein said log-transformed lining value data is obtained by lining value transformation in combination with regional background values, comprising: superposing the geographical sensitive data on the controlled random disturbance to obtain intermediate data, and carrying out logarithmic transformation on the intermediate data; obtaining a regional background value corresponding to the geographic sensitive data type, and calculating the logarithm of the regional background value; and dividing the result of logarithmic transformation of the intermediate data by the logarithm of the regional background value to obtain the liner value data.
- 9. The method for desensitizing geographically sensitive data according to claim 8, wherein the initial grid is generated by interpolating the desensitized coordinates with the liner value data, and the final grid is output after a plurality of smoothing filter iterations, comprising: the desensitization coordinates of each data acquisition point are associated with corresponding lining value data one by one, and a spatial interpolation data set is constructed; Performing interpolation calculation on the interpolation data set by adopting a spatial interpolation algorithm, and generating an initial grid file according to the preset pixel size; And carrying out iterative smoothing processing on the initial grid file according to a preset filtering window size by adopting a filtering algorithm, ending iterative smoothing if the frequency distribution difference of the accumulated frequency distribution curves of the front and back times is smaller than a preset threshold value, and determining the filtered grid data as a final grid and outputting the final grid.
- 10. The desensitization method for geographic sensitive data according to claim 9 is characterized by comprising the steps of defining a grading threshold value and dividing the grading threshold value to form a grading interval based on numerical distribution characteristics of lining value data of each grid unit in a final grid, generating a first type of visual layer and a second type of visual layer based on the grading interval, and displaying the first type of visual layer or the second type of visual layer to a user according to unused user rights.
Description
Desensitization method for geographic sensitive data Technical Field The invention relates to the technical field of geographic information, in particular to a desensitization method for geographic sensitive data. Background With the continuous development of environmental investigation and monitoring technology and the increasing demand of government data open sharing, the safe and efficient release of geospatial sensitive data becomes a key subject in the fields of current environmental science, geographic information systems and data security intersection. If the data are directly related to core sensitive information such as sensitive facility positions, accurate ranges of abnormal pollution areas and the like, and original geographic coordinates and investigation monitoring values are directly issued to the outside, not only sensitive information leakage can be caused, but also unnecessary panic of the public can be easily caused, and potential safety hazards and social management risks are brought. In the prior art, the processing mode aiming at the release of the geospatial sensitive data mainly comprises three types, namely, the whole blurring processing is carried out on investigation monitoring data in a target area, namely, the whole blurring processing is carried out on investigation monitoring data in the target area, the information leakage risk can be reduced to a certain extent by the mode, but the spatial resolution and gradient change characteristics of the data can be seriously lost, so that the processed data can not objectively reflect the real regional distribution trend, the scientific value of the processed data in scenes such as environmental analysis and decision support can be greatly lost, the position desensitization can be realized by applying fixed distance or random offset on all data acquisition points by a simple coordinate offset method, the encryption cracking difficulty of the method is lower, the inherent space structure of the data and the inherent relevance of geographic elements such as terrains and land utilization types can be easily damaged, the reliability of the subsequent application of the data is seriously influenced, the three types of direct data are displayed in a grading mode, the original geographic coordinates are released outwards after the data is simply divided, the positioning leakage risk of sensitive points can not be avoided, and the safety protection effect is limited. Therefore, a desensitization method of geographical sensitive data is needed, which can integrate priori geographical knowledge in the desensitization process, and maximize the scientific value and application reliability of reserved data while realizing effective protection of sensitive information. Disclosure of Invention Aiming at the prior art, the invention aims to provide a desensitization method for geographic sensitive data, which mainly solves the technical problems in the background art. In order to achieve the above purpose, the technical scheme of the embodiment of the invention is realized by the following steps of: acquiring an original data set and a priori geographic information map, wherein the priori geographic information map comprises a land utilization type map and a geological background unit map, and the original data set comprises geographic coordinates of each data acquisition point and corresponding geographic sensitive data; judging the land utilization type and the geological background unit type of each data acquisition point according to the prior geographical information graph, and determining the maximum offset based on the geographical sensitive data corresponding to the land utilization type and the geological background unit type; generating a random offset in a geological background unit corresponding to the maximum offset and limited to the data acquisition point, thereby obtaining desensitization coordinates; applying controlled random disturbance to the geographic sensitive data of each data acquisition point, carrying out liner value transformation by combining the regional background value after logarithmic transformation to obtain liner value data; And generating an initial grid by combining the desensitization coordinates and the lining value data interpolation, outputting a final grid after repeated smooth filtering iteration, and carrying out hierarchical visual display based on the final grid. Optionally, the geographic sensitive data comprises environmental monitoring data and pollution distribution investigation data, the environmental monitoring data at least comprises radionuclide content and radiation dose rate, the pollution distribution investigation data at least comprises soil heavy metal concentration, toxic pollutant content of water body and atmospheric pollutant concentration, and each data acquisition point acquires any type of geographic sensitive data. Optionally, judging the land utilization type and the geological bac