KR-20260066772-A - Method and system for integrating and processing geospatial and location data from different classification systems
Abstract
Techniques for geospatial data processing are disclosed. In one aspect, a data processing system stores an integrated hierarchical and spatial database comprising all geospatial units of each geospatial entity stored in multiple datasets from one or more data publishers, and stores integrated hierarchical and spatial relationship information for each geospatial unit with respect to other geospatial units, receives a query to the integrated hierarchical and spatial database, wherein the query specifies a geospatial place, geospatial unit, time period, search topic, or any combination thereof applicable across one or more data publishers, multiple datasets, and multiple layers of multiple datasets, transmits a response to the query, wherein the response comprises one or more search results linked together across one or more data publishers, multiple datasets, multiple layers, or any combination thereof.
Inventors
- 후튼, 크리스토퍼 알렉스
Assignees
- 후튼, 크리스토퍼 알렉스
Dates
- Publication Date
- 20260512
- Application Date
- 20240930
- Priority Date
- 20231003
Claims (20)
- As a data processing system, One or more memories; One or more network interfaces; and It includes one or more processors coupled to the one or more memories and the one or more network interfaces, The above one or more processors, either alone or in combination: To store in the above one or more memories an integrated hierarchical and spatial database comprising all geospatial units of each geospatial entity stored in multiple datasets from one or more data publishers—the multiple datasets are arranged according to corresponding multiple layers, and the integrated hierarchical and spatial database stores integrated hierarchical and spatial relationship information for each geospatial unit with respect to other geospatial units of each geospatial entity stored in the multiple datasets—; To receive a query for the integrated hierarchical and spatial database through the one or more network interfaces above—the query specifies a value for a geographic place, a value for a geospatial unit, a value for a time period, a value for a search topic, or any combination thereof to be applied across the one or more data publishers, the plurality of datasets, and the plurality of layers—; and It is configured to transmit a response to the query through one or more of the above network interfaces, and The above response includes one or more search results connected together across the one or more data publishers, the plurality of datasets, the plurality of layers, or any combination thereof, Data processing system.
- In paragraph 1, The above one or more processors, either alone or in combination: To retrieve the plurality of datasets arranged according to the corresponding plurality of layers from the one or more data publishers through the one or more network interfaces; To store a geospatial unit database comprising all geospatial units of each geographic entity stored in the plurality of datasets in the above one or more memories — said geospatial unit database stores geospatial unit identification variables for each geospatial unit —; To store in the one or more memories a geospatial intersection database comprising all geospatial units of each geospatial entity stored in the plurality of datasets—the geospatial intersection database stores spatial features of each geospatial unit representing the spatial relationships of said geospatial unit to other geospatial units of each geospatial entity stored in the plurality of datasets—; and The above one or more memories are configured to store a hierarchical relational database comprising all geospatial units of each geospatial entity stored in the plurality of datasets, wherein the hierarchical relational database stores hierarchical information for each geospatial unit representing the hierarchical and non-hierarchical spatial relationships of the geospatial unit with respect to other geospatial units of each geospatial entity stored in the plurality of datasets. Data processing system.
- In paragraph 2, The geospatial unit identification variables for each geospatial unit include the name, type, code, and geometric shape of the geospatial unit, Data processing system.
- In paragraph 1, The types of the aforementioned integrated hierarchical and spatial relationship information for each geospatial unit are: Hierarchical, Spatial, Complete spatial overlap, Partial spatial overlap, or No spatial overlap including one or more of, Data processing system.
- In paragraph 1, The above-mentioned integrated hierarchical and spatial relationship information includes one or more variables indicating that a pair of geospatial units are spatially compatible, Data processing system.
- In paragraph 5, The above pair of geospatial units are determined to be spatially compatible based on the determination that the pair of geospatial units originate from the same data publisher, that the pair of geospatial units originate from the same hierarchy, that they share a basic sub-unit, that they have spatial overlap, or that any combination thereof; or The above pair of geospatial units are determined to be spatially compatible based on the determination that the above pair of geospatial units originate from different data publishers, that the above pair of geospatial units originate from the same hierarchy, share a basic sub-unit, have spatial overlap, or any combination thereof. Data processing system.
- In paragraph 1, The above one or more processors, either alone or in combination, The above one or more memories are additionally configured to store a geospatial unit-based database of geo-tagged items, wherein the geospatial unit-based database of geo-tagged items stores connections between non-geospatial unit geo-tagged items and each geospatial unit of each geographic entity stored in the plurality of datasets. Data processing system.
- In Paragraph 7, The geospatial unit-based database of the above-mentioned geo-tagged items comprises a 4×4 dimensional matrix that identifies four dimensions of location information of non-geospatial unit geo-tagged items for four dimensions of geospatial space, Data processing system.
- In paragraph 8, The four dimensions of the above location information include the geographic place, the geospatial unit, the time period, and the search topic, and the four dimensions of the geospatial space include length, width, height, and time, Data processing system.
- In Paragraph 9, Search results matching the value of the above geographical place, the value of the above geospatial unit, the value of the above time period, the value of the above search topic, or a combination thereof are dynamically filtered based on the value of the length, the value of the width, the value of the height, and the value of the time to determine the one or more search results. Data processing system.
- In Paragraph 10, The above query additionally specifies the values of the length, the width, the height, and the time, Data processing system.
- As a method performed by a data processing system, A step of storing an integrated hierarchical and spatial database comprising all geospatial units of each geographic entity stored in multiple datasets from one or more data publishers in one or more memories coupled to the data processing system above—the multiple datasets are arranged according to a plurality of corresponding layers, and the integrated hierarchical and spatial database stores integrated hierarchical and spatial relationship information for each geospatial unit with respect to other geospatial units of each geographic entity stored in the multiple datasets—; A step of receiving a query for the integrated hierarchical and spatial database through one or more network interfaces coupled to the data processing system—the query specifies values for geographic places, values for geospatial units, values for time periods, values for search topics, or any combination thereof to be applied across the one or more data publishers, the plurality of datasets, and the plurality of layers—; and The method includes the step of transmitting a response to the query through one or more of the above network interfaces, The above response includes one or more search results connected together across the one or more data publishers, the plurality of datasets, the plurality of layers, or any combination thereof, A method performed by a data processing system.
- In Paragraph 12, A step of retrieving the plurality of datasets arranged according to the corresponding plurality of layers from the one or more data publishers through the one or more network interfaces; A step of storing a geospatial unit database comprising all geospatial units of each geographic entity stored in the plurality of datasets in the one or more memories above — the geospatial unit database stores geospatial unit identification variables for each geospatial unit —; A step of storing a geospatial intersection database comprising all geospatial units of each geospatial entity stored in the plurality of datasets in the one or more memories above—the geospatial intersection database stores spatial features of each geospatial unit representing spatial relationships of the geospatial unit to other geospatial units of each geospatial entity stored in the plurality of datasets—; and The method further includes the step of storing a hierarchical relational linkage database comprising all geospatial units of each geographic entity stored in the plurality of datasets in the above one or more memories, and The above hierarchical relationship linkage database stores hierarchical information for each geospatial unit representing the hierarchical and non-hierarchical spatial relationships of the geospatial unit with respect to other geospatial units of each geospatial entity stored within the plurality of datasets. A method performed by a data processing system.
- In Paragraph 13, The geospatial unit identification variables for each geospatial unit include the name, type, code, and geometric shape of the geospatial unit, A method performed by a data processing system.
- In Paragraph 12, The types of the aforementioned integrated hierarchical and spatial relationship information for each geospatial unit are: Hierarchical, Spatial, Complete spatial overlap, Partial spatial overlap, or No spatial overlap including one or more of, A method performed by a data processing system.
- In Paragraph 12, The above-mentioned integrated hierarchical and spatial relationship information includes one or more variables indicating that a pair of geospatial units are spatially compatible, A method performed by a data processing system.
- In Paragraph 16, The above pair of geospatial units are determined to be spatially compatible based on the determination that the pair of geospatial units originate from the same data publisher, that the pair of geospatial units originate from the same hierarchy, that they share a basic sub-unit, that they have spatial overlap, or that any combination thereof; or The above pair of geospatial units are determined to be spatially compatible based on the determination that the above pair of geospatial units originate from different data publishers and that the above pair of geospatial units originate from the same hierarchy, share a basic sub-unit, have spatial overlap, or any combination thereof. A method performed by a data processing system.
- In Paragraph 12, The method further includes the step of storing a geospatial unit-based database of geo-tagged items in one or more of the above memories, and The geospatial unit-based database of the above geo-tagged items stores connections between non-geospatial unit geo-tagged items and each geospatial unit of each geographic entity stored within the plurality of datasets, A method performed by a data processing system.
- In Paragraph 18, The geospatial unit-based database of the above-mentioned geo-tagged items comprises a 4×4 dimensional matrix that identifies four dimensions of location information of non-geospatial unit geo-tagged items for four dimensions of geospatial space, A method performed by a data processing system.
- As a device, Means for storing an integrated hierarchical and spatial database comprising all geospatial units of each geographic entity stored in multiple datasets from one or more data publishers—said that the multiple datasets are arranged according to corresponding multiple layers, and said integrated hierarchical and spatial database stores integrated hierarchical and spatial relationship information for each geospatial unit with respect to other geospatial units of each geographic entity stored in said multiple datasets—; Means for receiving a query for the integrated hierarchical and spatial database above—the query specifies a value for a geographic place, a value for a geospatial unit, a value for a time period, a value for a search topic, or any combination thereof applicable across the one or more data publishers, the plurality of datasets, and the plurality of layers—; and It includes means for transmitting a response to the above query, The above response includes one or more search results connected together across the one or more data publishers, the plurality of datasets, the plurality of layers, or any combination thereof, device.
Description
Method and system for integrating and processing geospatial and location data from different classification systems Cross-reference regarding related applications [0001] This patent application claims priority to U.S. Regular Application No. 18/480,383, filed on October 3, 2023, titled “METHOD AND SYSTEM FOR INTEGRATING AND PROCESSING GEOSPATIAL AND LOCATIONAL DATA FROM DIFFERENT TAXONOMIES”, the entirety of which is expressly incorporated herein by reference. [0002] The aspects of the present disclosure generally relate to integrating and processing geospatial and location data from different classification systems. [0003] Current geospatial and location data processing and analysis systems face two major obstacles that significantly slow down the speed of data processing and limit the ability to perform comprehensive and sophisticated analyses. First, geospatial data is siloed across data publishers, formal hierarchy of relationships, and individual datasets. Second, current systems for geospatial data processing and analysis require significant technical expertise to perform any processing or analysis, especially when the processing or analysis involves multiple classification schemes. These obstacles hinder users from seamlessly switching between data from different publishers, hierarchy (including different geographic units), and datasets while performing analysis, and from analyzing data across these classification schemes. [0004] The first issue, data silos, creates significant barriers to processing and analysis by hindering the connection of geospatial units to the full range of their related data through a whole set of formal, hierarchical, and informal spatial relationships of units, with the exception of individual, manual transformations performed afterward by the user. In general, each individual geospatial data—whether it is unit data or geotagged/location-informed item data—can be placed within one or more hierarchical systems and within one or more spatial systems. [0005] Across these hierarchical and spatial systems, there are various types of relationships possible between geospatial units, which must first be manually documented in order to fully analyze the entire set of relevant data of the geospatial units. Specifically, there are two types of formal relationships and three types of informal relationships. First, there are formal, hierarchical relationships between units from the same hierarchical track system, where a unit is a direct component of one or more other units, and one or more other units are, in turn, direct components of yet another one or more other units (e.g., Jackson County is a formal hierarchical sub-unit of Missouri). This application refers to this type of formal relationship as a hierarchical relationship. Second, there are formal, organizational system relationships where geospatial units do not overlap in space and do not come from the same hierarchical track but belong to the same organizational system (e.g., Jackson County is in the same system as Nebraska, but is not a hierarchical component of Nebraska and has no spatial overlap with Nebraska). These geospatial units may coexist in space and be informally connected, but not due to hierarchical or spatial relationships. This application refers to this type of relationship as organizational system relationships. Third, there are informal, spatial relationships (e.g., zip code x overlaps in space with census area y but comes from different hierarchical and/or organizational systems). This application refers to this type of relationship as informal, spatial relationships. Fourth, there is a subtype of informal, spatial relationships in which different hierarchies and/or organizational systems use the same geospatial unit in their respective hierarchical systems. In this case, there may also be informal, spatial relationships between the units; However, there are additional considerations because direct relationships exist between shared units (since they are the same entities), and informal hierarchical relationships from different units of each level to units of other levels can be derived based on approximate aggregation levels (e.g., Census Block x is a formal sub-unit of both Zip Code y and Census Area z; consequently, Zip Code y and Census Area z may also have an informal relationship). This application refers to this type of relationship as a basic unit relationship. Finally, there are also geospatial units that lack hierarchical, organizational, and spatial relationships. These exist together in space but can be grouped only arbitrarily (e.g., New York City is often compared to London, but they are not formally or informally related). [0006] Each geospatial data observation can be connected to various different geospatial units through these relationships. However, the siloed nature of geospatial data means that one cannot immediately identify such relationships or derive analytical conclusions from the