Search

WO-2026092949-A1 - REWARD SIGNAL REQUIREMENT PARAMETER

WO2026092949A1WO 2026092949 A1WO2026092949 A1WO 2026092949A1WO-2026092949-A1

Abstract

A user device may transmit to a network node, a request for at least one reward signal associated with a supported artificial intelligence and machine learning (AIML) functionality, the request may include a requirement parameter associated with the at least one reward signal of the supported AIML functionality. The user device may receive the at least one reward signal based on the requirement parameter.

Inventors

  • SAHIN, Taylan

Assignees

  • NOKIA TECHNOLOGIES OY

Dates

Publication Date
20260507
Application Date
20251002
Priority Date
20241029

Claims (20)

  1. 1 . An apparatus comprising: at least one processor; and at least one memory storing instructions that, when executed by the at least one processor, cause the apparatus at least to: transmit to a network node, a request for at least one reward signal associated with a supported artificial intelligence and machine learning (AIML) functionality, the request comprising a requirement parameter associated with the at least one reward signal of the supported AIML functionality; and receive the at least one reward signal based on the requirement parameter.
  2. 2. The apparatus of claim 1 , wherein the requirement parameter associated with the at least one reward signal comprises at least one of: a time parameter of a delay requirement associated with the at least one reward signal of the supported AIML functionality, wherein the time parameter comprises at least one of: a time value; a minimum time value; a maximum time value; or a range of time values; a period associated with the at least one reward signal of the supported AIML functionality; a starting time associated with the at least one reward signal of the supported AIML functionality; an ending time associated with the at least one reward signal of the supported AIML functionality; a time duration associated with the at least one reward signal of the supported AIML functionality; a triggering condition associated with the at least one reward signal of the supported AIML functionality; a total number of the at least one reward signal; or a batch size of the at least one reward signal, wherein the batch size corresponds to a number of instances of each reward signal. 55
  3. 3. The apparatus of claim 1 or 2, the apparatus is further caused to transmit an indication of support for one or more artificial intelligence and machine learning (AIML) functionalities.
  4. 4. The apparatus of any of claims 1 to 3, wherein the supported AIML functionality comprises at least one of: one or more functionalities related to beam management; one or more functionalities related to beamforming; one or more functionalities related to positioning; one or more functionalities related to allocation of radio resources; one or more functionalities related to scheduling; one or more functionalities related to power control; one or more functionalities related to link adaptation; one or more functionalities related to mobility; or one or more functionalities related to selection of a modulation and coding (MCS) scheme.
  5. 5. The apparatus of claim 4, the apparatus is further caused to receive a request to activate one of the one or more AIML functionalities.
  6. 6. The apparatus of claim 5, the apparatus is further caused to: select the one of the one or more AIML functionalities; and transmit an indication of the selected one of the one or more AIML functionalities.
  7. 7. The apparatus of any of claims 1 to 6, wherein the supported AIML functionality is associated with an AIML model, and wherein the AIML model uses an AIML algorithm that is based on a reinforcement learning (RL).
  8. 8. The apparatus of any of claims 1 to 7, the apparatus is further caused to: transmit an outcome of the supported AIML functionality; and receive the at least one reward signal based on the outcome of the supported AIML functionality.
  9. 9. The apparatus of any of claims 1 to 8, the apparatus is further caused to receive an indication of whether a requirement can be met based on the requirement parameter associated with the at least one reward signal of the supported AIML functionality.
  10. 10. The apparatus of any of claims 1 to 9, the apparatus is further caused to receive an indication of deactivating a first supported AIML functionality and activating a second supported AIML functionality.
  11. 11. The apparatus of any of claims 1 to 10, the apparatus is further caused to: determine to deactivate a first supported AIML functionality and transmitting an indication of deactivating the first supported AIML functionality; and determine to activate a second supported AIML functionality and transmitting an indication of activating the second supported AIML functionality.
  12. 12. The apparatus of any of claims 1 to 11 , the apparatus is further caused to transmit an update of the requirement parameter associated with the at least one reward signal of the supported AIML functionality.
  13. 13. The apparatus of any of claims 1 to 12, wherein the transmitting the request for the at least one reward signal is based on at least one of: determining that an outcome of the supported AIML functionality does not meet a performance requirement; or a triggering event associated with the supported AIML functionality.
  14. 14. The apparatus of any of claims 1 to 13, wherein the request comprises an identifier of an AIML model associated with the supported AIML functionality.
  15. 15. The apparatus of any of claims 1 to 14, the apparatus is further caused to performe at least one of a training, re-training, monitoring, or update for a reinforcement learning model based on the at least one reward signal.
  16. 16. The apparatus of any of claims 1 to 15, the apparatus is further caused to: transmit a first request indicating start of transmitting reward signals, wherein the first request comprises a time interval between transmission of the reward signals; and transmit a second request indicating stop of transmitting the reward signals.
  17. 17. The apparatus of any of claims 1 to 16, the apparatus is further caused to transmit a first request indicating start of transmitting reward signals for a time duration, wherein the first request comprises a value of the time duration.
  18. 18. The apparatus of any of claims 1 to 17, wherein each reward signal of the at least one reward signal is associated with a corresponding action of the AIML functionality and a corresponding requirement parameter.
  19. 19. The apparatus of any of claims 1 to 18, wherein the request for the at least one reward signal comprises at least one of: one request for the at least one reward signal per AIML functionality; one request for one reward signal, wherein the one reward signal corresponds to one action of the AIML functionality; or one request for at least one reward signal, wherein each reward signal of the at least one reward signal corresponds to at least one action of the AIML functionality.
  20. 20. The apparatus of any of claims 1 to 19, wherein the receiving the at least one reward signal comprises receiving at least one of: at least one reward signal per AIML functionality; one reward signal, wherein the one reward signal corresponds to one action of the AIML functionality; or at least one reward signal, wherein each reward signal of the at least one reward signal corresponds to at least one action of the AIML functionality.

Description

REWARD SIGNAL REQUIREMENT PARAMETER TECHNICAL FIELD [0001] This description relates to wireless communications. BACKGROUND [0002] A communication system may be a facility that enables communication between two or more nodes or devices, such as fixed or mobile communication devices. Signals can be carried on wired or wireless carriers. [0003] An example of a cellular communication system is an architecture that is being standardized by the 3rd Generation Partnership Project (3GPP). Long-term evolution (LTE) is referred to as 4G radio-access technology of the Universal Mobile Telecommunications System (UMTS). EUTRA (evolved UMTS Terrestrial Radio Access) is the air interface of 3GPP’s Long Term Evolution (LTE) upgrade path for mobile networks. In LTE, base stations or access points (APs), which are referred to as enhanced Node AP (eNBs), provide wireless access within a coverage area or cell. In LTE, mobile devices, or mobile stations are referred to as user equipments (UE). LTE has included a number of improvements or developments. Aspects of LTE are also continuing to improve. [0004] 5G New Radio (NR) development is part of a continued mobile broadband evolution process to meet the requirements of 5G, similar to earlier evolution of 3G and 4G wireless networks. In addition, 5G is also targeted at the new emerging use cases in addition to mobile broadband. A goal of 5G is to provide significant improvement in wireless performance, which may include new levels of data rate, latency, reliability, and security. 5G NR may also scale to efficiently connect the massive Internet of Things (loT) and may offer new types of mission-critical services. For example, ultra-reliable and low-latency communications (URLLC) devices may require high reliability and very low latency. 6G and other networks are also being developed. SUMMARY [0005] In some aspects, the techniques described herein relate to a method including: transmitting, by a user device to a network node, a request for at least one reward signal associated with a supported artificial intelligence and machine learning (AIML) functionality, the request including a requirement parameter associated with the at least one reward signal of the supported AIML functionality; and receiving the at least one reward signal based on the requirement parameter. [0006] In some aspects, the techniques described herein relate to a method including: receiving, by a network node from a user device, a request for at least one reward signal associated with a supported artificial intelligence and machine learning (AIML) functionality, the request including a requirement parameter associated with the at least one reward signal of the supported AIML functionality; and transmitting the at least one reward signal based on the requirement parameter. [0007] In some aspects, the techniques described herein relate to an apparatus including: at least one processor; and at least one memory storing instructions that, when executed by the at least one processor, cause the apparatus at least to perform: transmitting to a network node, a request for at least one reward signal associated with a supported artificial intelligence and machine learning (AIML) functionality, the request including a requirement parameter associated with the at least one reward signal of the supported AIML functionality; and receiving the at least one reward signal based on the requirement parameter. [0008] In some aspects, the techniques described herein relate to an apparatus including: at least one processor; and at least one memory storing instructions that, when executed by the at least one processor, cause the apparatus at least to perform: receiving from a user device, a request for at least one reward signal associated with a supported artificial intelligence and machine learning (AIML) functionality, the request including a requirement parameter associated with the at least one reward signal of the supported AIML functionality; and transmitting the at least one reward signal based on the requirement parameter. [0009] In some aspects, the techniques described herein relate to a method including: transmitting, by a user device to a network node, an indication of support for one or more artificial intelligence and machine learning (AIML) models, the indication including a requirement parameter associated with at least one reward signal of a supported AIML model of the one or more AIML models; transmitting a request for the at least one reward signal associated with the supported AIML model; and receiving the at least one reward signal based on the indicated requirement parameter. [0010] In some aspects, the techniques described herein relate to a method including: receiving, by a network node from a user device, an indication of support for one or more artificial intelligence and machine learning (AIML) models, the indication including a requirement parameter associated with at least one reward signal of a support