CN-121996658-A - Data processing method
Abstract
The present application relates to a data processing method. The method comprises the steps of obtaining a facility data set of an entity facility and an identification template, wherein the facility data set comprises coordinate information, entity type, time information and environment information of the entity facility, generating a space identification of the entity facility based on the coordinate information and an earth subdivision grid algorithm, generating a time identification according to the time information, determining a basic identification of the entity facility according to a preset identification generation rule and the entity type, determining an index identification of the entity facility based on the environment information, and filling the basic identification, the space identification, the time identification and the index identification into the identification template to obtain the identification of the entity facility. By adopting the method, the accuracy of the data processing method can be improved.
Inventors
- ZENG YANYAN
- LIN ZHIYONG
- HOU MENGYING
- Tao Yingchun
- ZHANG KUI
- Liang Hanmei
- XU ZONGXIA
- ZHOU YANDI
- ZHANG XUPING
- TIAN HUIMIN
Assignees
- 北京市测绘设计研究院
Dates
- Publication Date
- 20260508
- Application Date
- 20260127
Claims (10)
- 1. A method of data processing, the method comprising: Acquiring a facility data set of an entity facility, and acquiring an identification template, wherein the facility data set comprises coordinate information, entity category, time information and environment information of the entity facility; Generating a space identifier of the entity facility based on the coordinate information and an earth subdivision grid algorithm, and generating a time identifier according to the time information; Determining a basic identifier of the entity facility according to a preset identifier generation rule and the entity category, and determining an index identifier of the entity facility based on the environment information; And filling the basic identifier, the space identifier, the time identifier and the index identifier into the identifier template to obtain the identifier of the entity facility.
- 2. The method of claim 1, wherein generating a spatial identification of the physical facility based on the coordinate information and an geostationary grid algorithm, and generating a temporal identification from the temporal information, comprises: Judging whether the facility data set meets preset data processing conditions or not, wherein the facility data set also comprises geometric types; If the facility data set meets the data processing conditions, generating a space identifier of the entity facility according to the geometric type, the coordinate information and an earth subdivision grid algorithm; And generating the time identifier of the entity facility according to the generation time or the current time in the time information.
- 3. The method of claim 2, wherein said determining whether the facility data set satisfies a preset data processing condition comprises: performing air verification processing on each facility data in the facility data set to obtain an air verification result; determining a verification condition corresponding to each facility data under the condition that the facility data are not empty as the verification result; performing verification processing on the facility data based on the verification conditions to obtain a verification result; And under the condition that each verification result is verification passing, determining that the facility data set meets the preset data processing condition.
- 4. The method of claim 2, wherein the coordinate information is a set of coordinate characters, and the generating the spatial identification of the physical facility according to the geometry type, the coordinate information, and an earth-dissected grid algorithm comprises: If the geometric type is a multi-organization type, segmenting the coordinate character set to obtain coordinate character groups of all organizations in the entity facility; Generating a space identifier of each organization according to the coordinate character set and the earth subdivision grid algorithm of each organization; And if the geometric type is a single organization type, generating a space identifier of the entity facility according to the coordinate character set and an earth subdivision grid algorithm.
- 5. The method of claim 1, wherein the determining an index identity of the entity facility based on the environmental information comprises: judging whether the entity facilities have similar entity facilities in the same category and in the same time and space according to the environment information; when the similar entity facilities exist, determining index identifiers of the entity facilities according to the time identifiers, the space identifiers and the basic identifiers; And when the similar entity facilities do not exist, determining index identification of the entity facilities based on the identification template.
- 6. The method of claim 5, wherein determining the index identity of the entity facility based on the temporal identity, the spatial identity, and the base identity comprises: Splicing the space identifier, the time identifier and the basic identifier to obtain an initial space-time identifier; And inquiring the occurrence times of the initial space-time identifier, and determining the index identifier of the entity facility according to the identifier template and the occurrence times.
- 7. The method of claim 6, wherein after determining the index identification of the entity facility based on the environmental information, the method further comprises: acquiring a geographic entity data set of a geographic entity corresponding to the entity facility, wherein the geographic entity data set comprises unique space identity codes of the geographic entity; judging whether the geographic entity meets preset association conditions or not based on the space identifier and the geographic entity data set; And associating the unique spatial identity code with the initial space-time identification in the case that the geographic entity meets the association condition.
- 8. The method of claim 7, wherein the set of geographic entity data includes a geospatial identification of the geographic entity, and wherein the determining whether the geographic entity satisfies a preset association condition based on the spatial identification and the set of geographic entity data comprises: If the geographic space identifier is the same as the space identifier, determining that the geographic entity meets a preset association condition; If the geographic space identifier is different from the space identifier, acquiring the entity geometric area of the entity facility and the geographic geometric area of the geographic entity; And under the condition that the area similarity between the entity geometric area and the geographic geometric area reaches a preset first similarity threshold value, determining that the geographic entity meets a preset association condition.
- 9. The method of claim 1, wherein the populating the base identifier, the spatial identifier, the temporal identifier, and the index identifier into the identifier template results in the identity of the entity facility, the method further comprising: Receiving an identification update request of an entity facility to be updated, which is sent by a user terminal, wherein the identification update request comprises an update data set of the entity facility to be updated; Judging whether the historical identification of the entity facility to be updated meets a preset updating condition according to the updating data set; and under the condition that the history identifier meets the updating condition, executing the step of acquiring the facility data set of the entity facility according to the updating data set until the identifier of the entity facility to be updated is obtained, and updating the history identifier according to the identifier.
- 10. The method according to claim 9, wherein the update data set includes new geometric data and a history identifier of the entity facility to be updated, and the determining whether the history identifier of the entity facility to be updated meets a preset update condition according to the update data set includes: according to the history identification, inquiring the history geometric data of the history entity facilities corresponding to the entity facilities to be updated in a database; determining geometrical similarity between the entity facility to be updated and the historical entity facility according to the new geometrical data and the historical geometrical data; And under the condition that the geometric similarity reaches a preset second similarity threshold, determining that the history identifier meets a preset updating condition.
Description
Data processing method Technical Field The present application relates to the field of data processing technologies, and in particular, to a data processing method. Background Urban physical facilities are one of key objects for urban fine governance. In the city construction process, the city entity facility result with tens of millions of data volume is formed. In order to be able to manage various urban entity facilities (entity facilities for short), it is necessary to set the identity of the entity facility to manage the entity facility. In the traditional technology, a unified address library is established manually according to experience. For each entity facility, determining an address corresponding to the entity facility in an address library, and taking the address as an identification of the entity facility. However, in the conventional technology, a lot of time is required for building the address library, and meanwhile, some urban entity facilities (such as manhole covers) cannot be identified by the address uniquely and accurately, so that the accuracy of the identification is low. Therefore, the current data processing method has lower accuracy and lower efficiency. Disclosure of Invention In view of the foregoing, it is desirable to provide a data processing method, apparatus, computer device, computer readable storage medium, and computer program product. In a first aspect, the present application provides a data processing method, including: Acquiring a facility data set of an entity facility, and acquiring an identification template, wherein the facility data set comprises coordinate information, entity category, time information and environment information of the entity facility; Generating a space identifier of the entity facility based on the coordinate information and an earth subdivision grid algorithm, and generating a time identifier according to the time information; Determining a basic identifier of the entity facility according to a preset identifier generation rule and the entity category, and determining an index identifier of the entity facility based on the environment information; And filling the basic identifier, the space identifier, the time identifier and the index identifier into the identifier template to obtain the identifier of the entity facility. In one embodiment, the generating the spatial identifier of the entity facility based on the coordinate information and the geostationary grid algorithm, and generating the time identifier according to the time information, includes: Judging whether the facility data set meets preset data processing conditions or not, wherein the facility data set also comprises geometric types, If the facility data set meets the data processing conditions, generating a space identifier of the entity facility according to the geometric type, the coordinate information and an earth subdivision grid algorithm; And generating the time identifier of the entity facility according to the generation time or the current time in the time information. In one embodiment, the determining whether the facility data set meets a preset data processing condition includes: performing air verification processing on each facility data in the facility data set to obtain an air verification result; determining a verification condition corresponding to each facility data under the condition that the facility data are not empty as the verification result; performing verification processing on the facility data based on the verification conditions to obtain a verification result; And under the condition that each verification result is verification passing, determining that the facility data set meets the preset data processing condition. In one embodiment, the coordinate information is a coordinate character set, and the generating the spatial identifier of the entity facility according to the geometric type, the coordinate information and the geostationary grid algorithm includes: If the geometric type is a multi-organization type, segmenting the coordinate character set to obtain coordinate character groups of all organizations in the entity facility; Generating a space identifier of each organization according to the coordinate character set and the earth subdivision grid algorithm of each organization; And if the geometric type is a single organization type, generating a space identifier of the entity facility according to the coordinate character set and an earth subdivision grid algorithm. In one embodiment, the determining, based on the environmental information, an index identification of the entity facility includes: judging whether the entity facilities have similar entity facilities in the same category and in the same time and space according to the environment information; when the similar entity facilities exist, determining index identifiers of the entity facilities according to the time identifiers, the space identifiers and the basic identifier