Search

US-20260127184-A1 - AUTOMATED LINKAGE OF ANCHOR PAGES

US20260127184A1US 20260127184 A1US20260127184 A1US 20260127184A1US-20260127184-A1

Abstract

Example implementations relate to automated modification of interface pages to include relevant link information. In an example, an anchor page and a plurality of candidate pages are obtained and an embedding is generated for each page by applying a page embedding model to a representative keyword construction extracted from each of the at least one anchor page or a candidate page of the plurality of candidate pages. A set of similar candidate pages is selected using a comparison of the embeddings and a relevance score is determined for each candidate page. The relevance score represents a relevance of a candidate page to the anchor page. A diversification score is generated for each candidate page and a linked set of candidate pages is selected based on the relevance scores and diversification scores. The anchor page is modified to include a link to each page in the linked set.

Inventors

  • Xia Zhao
  • Silu Wang
  • Bingjie HUANG
  • Wei Shen

Assignees

  • WALMART APOLLO, LLC

Dates

Publication Date
20260507
Application Date
20241107

Claims (20)

  1. 1 . A system, comprising: a processor; and a non-transitory memory storing instructions that, when executed, cause the processor to: obtain at least one anchor page and a plurality of candidate pages; generate an embedding for the at least one anchor page and each of the plurality of candidate pages by applying a page embedding model to a representative keyword construction extracted from the at least one anchor page or a candidate page of the plurality of candidate pages; select a set of similar candidate pages from the plurality of candidate pages using a comparison of the embedding for the at least one anchor page and the embedding for each candidate page in the plurality of candidate pages; determine a relevance score for each candidate page in the set of similar candidate pages, wherein the relevance score is representative of a relevance of a corresponding candidate page to the at least one anchor page; generate a diversification score for each candidate page in the set of similar candidate pages; select a linked set of candidate pages based on the relevance score and diversification score for each candidate page in the set of similar candidate pages; and modify the at least one anchor page to include a link to each page in the linked set of candidate pages.
  2. 2 . The system of claim 1 , wherein responsive to receiving a request for the at least one anchor page, the processor executes the instructions to display a seasonal anchor page associated with the at least one anchor page.
  3. 3 . The system of claim 1 , wherein the page embedding model comprises a two tower Bidirectional Encoder Representations from Transformers (BERT) model.
  4. 4 . The system of claim 1 , wherein the relevance score is generated by an Approximate Nearest Neighbor (ANN) search.
  5. 5 . The system of claim 1 , wherein the linked set of candidate pages is selected by ranking each candidate page in the set of similar candidate pages based on the relevance score, the diversification score, and a page type.
  6. 6 . The system of claim 1 , wherein the representative keyword construction includes text data extracted from the at least one anchor page or a candidate page of the plurality of candidate pages and an item identifier.
  7. 7 . The system of claim 1 , wherein the at least one anchor page comprises a seasonal anchor page.
  8. 8 . A computer-implemented method, comprising obtaining at least one anchor page and a plurality of candidate pages; generating an embedding for the at least one anchor page and each of the plurality of candidate pages by applying a page embedding model to a representative keyword construction extracted from each of the at least one anchor page or a candidate page of the plurality of candidate pages; selecting a set of similar candidate pages from the plurality of candidate pages using a comparison of the embedding for the at least one anchor page and the embedding for each candidate page in the plurality of candidate pages; determining a relevance score for each candidate page in the set of similar candidate pages, wherein the relevance score is representative of a relevance of a corresponding candidate page to the at least one anchor page; generating a diversification score for each candidate page in the set of similar candidate pages; selecting a linked set of candidate pages based on the relevance score and diversification score for each candidate page in the set of similar candidate pages; and modifying the at least one anchor page to include a link to each page in the linked set of candidate pages.
  9. 9 . The computer-implemented method of claim 8 , comprising displaying a seasonal anchor page associated with the at least one anchor page responsive to receiving a request for the at least one anchor page.
  10. 10 . The computer-implemented method of claim 8 , wherein the page embedding model comprises a two tower Bidirectional Encoder Representations from Transformers (BERT) model.
  11. 11 . The computer-implemented method of claim 8 , wherein the relevance score is generated by an Approximate Nearest Neighbor (ANN) search.
  12. 12 . The computer-implemented method of claim 8 , wherein the linked set of candidate pages is selected by ranking each candidate page in the set of similar candidate pages based on the relevance score, the diversification score, and a page type.
  13. 13 . The computer-implemented method of claim 8 , wherein the representative keyword construction includes text data extracted from the at least one anchor page or a candidate page of the plurality of candidate pages and an item identifier.
  14. 14 . The computer-implemented method of claim 8 , wherein the at least one anchor page comprises a seasonal anchor page.
  15. 15 . A non-transitory computer-readable medium having instructions stored thereon, wherein the instructions, when executed by at least one processor, cause at least one device to perform operations comprising: obtaining at least one anchor page and a plurality of candidate pages; generating an embedding for the at least one anchor page and each of the plurality of candidate pages by applying a page embedding model to a representative keyword construction extracted from each of the at least one anchor page or a candidate page of the plurality of candidate pages; selecting a set of similar candidate pages from the plurality of candidate pages using a comparison of the embedding for the at least one anchor page and the embedding for each candidate page in the plurality of candidate pages; determining a relevance score for each candidate page in the set of similar candidate pages, wherein the relevance score is representative of a relevance of a corresponding candidate page to the at least one anchor page; generating a diversification score for each candidate page in the set of similar candidate pages; selecting a linked set of candidate pages based on the relevance score and diversification score for each candidate page in the set of similar candidate pages; and modifying the at least one anchor page to include a link to each page in the linked set of candidate pages.
  16. 16 . The non-transitory computer-readable medium of claim 15 , wherein the instructions cause the device to perform operations comprising displaying a seasonal anchor page associated with the at least one anchor page responsive to receiving a request for the at least one anchor page.
  17. 17 . The non-transitory computer-readable medium of claim 15 , wherein the page embedding model comprises a two tower Bidirectional Encoder Representations from Transformers (BERT) model.
  18. 18 . The non-transitory computer-readable medium of claim 15 , wherein the relevance score is generated by an Approximate Nearest Neighbor (ANN) search.
  19. 19 . The non-transitory computer-readable medium of claim 15 , wherein the linked set of candidate pages is selected by ranking each candidate page in the set of similar candidate pages based on the relevance score, the diversification score, and a page type.
  20. 20 . The non-transitory computer-readable medium of claim 15 , wherein the representative keyword construction includes text data extracted from the at least one anchor page or a candidate page of the plurality of candidate pages and an item identifier.

Description

TECHNICAL FIELD This application relates generally to network linkage, and more particularly, to linking network pages to improve interface navigability. BACKGROUND Automated indexing and search functions, such as search engines, rely on multiple factors to identify relevant interface pages in response to user queries, such as content relevancy, sitemaps, content quality evaluations, internal interlinking, and user experience feedback. Although these systems utilize multiple factors, one or more factors, such as internal interlinking, are more significant weighting factors for determining search results. An interface page, such as a webpage, that scores low on a significant factor, such as internal interlinking, may not be returned in search results despite being highly relevant, high quality, and of interest to a user. When adding new interface pages to existing interface sets (e.g., websites), the added interface pages may have limited, if any, links to other pages within the existing interface set. Some current systems utilize manual processes to update some existing interface pages that have content related to the added pages, such as topic pages, summary pages, etc., but these manual processes are cumbersome and cannot be applied to systems having a large volume of interface pages. Similarly, if a system relies on rotating pages, such as seasonal pages, the newly generated seasonal pages are manually generated to include linkages to all relevant pages, presenting similar issues to manual updates of existing pages but additionally adding a time constraint. BRIEF DESCRIPTION OF THE DRAWINGS Various examples will be described below with reference to the following figures. FIG. 1 depicts an example system for automatically generating network linkages for an interface page, in accordance with some embodiments. FIG. 2 depicts an example system for generating linked evergreen anchor pages and linked seasonal anchor pages, in accordance with some embodiments. FIG. 3 depicts an example system for selecting a set of candidate anchor pages for link generation, in accordance with some embodiments. FIG. 4 depicts an example system for page embedding generation, in accordance with some embodiments. FIG. 5 depicts an example system for page diversification, in accordance with some embodiments. FIG. 6 is a flowchart depicting an example method for automatically generating linked anchor pages, in accordance with some embodiments. FIG. 7 depicts an example system with a machine-readable medium that includes instructions for automatically generating linked anchor pages, in accordance with some embodiments. FIG. 8 depicts an example computer system that implements one or more of the disclosed processes, in accordance with some embodiments. DETAILED DESCRIPTION The disclosed systems and methods for generating network linkages provide an automated process that allows for automatic generation of interface linkages for inclusion in an interface page to enhance the corresponding interface page interlinking in a network environment without manual interaction. In some embodiments, the identification of a set of candidate pages for linking within a corresponding anchor page based on generated embeddings, a relevance score, and/or a corresponding diversification score allows for automatic identification of relevant pages that enhance interlinking tasks, such as mapping for automated traversal of network structures by search engines or other indexing processes. In some embodiments, the identification of seasonal pages and corresponding linkage based on one or more seasonal features provides for enhancement of short-term or rotated interface pages within usable timeframes to enhance the automated traversal of network structures by search engines or other indexing processes corresponding to seasonal updates or refreshes. This description of the example embodiments is intended to be read in connection with the accompanying drawings that are to be considered part of the entire written description. Terms concerning data connections, coupling and the like, such as “connected” and “interconnected,” and/or “in signal communication with” refer to a relationship wherein systems or elements are electrically connected (e.g., wired, wireless) to one another, either directly or indirectly, through intervening systems, unless expressly described otherwise. The term “operatively coupled” is such a coupling or connection that allows the pertinent structures to operate as intended by virtue of that relationship. In the following, various embodiments are described with respect to the claimed systems as well as with respect to the claimed methods. Features, advantages, or alternative embodiments herein may be assigned to the other claimed objects and vice versa. In other words, claims for the systems may be improved with features described or claimed in the context of the methods. In this case, the functional features of the method are embodied by obj