CN-121985151-A - Low-delay video stream-partial frame
Abstract
The present disclosure relates to low-latency video stream-portion frames. In an example, a device may include logic to receive a plurality of video frames from a video source, logic to process the plurality of video frames, logic to provide at least some of the plurality of video frames to a video receiver, and logic to reduce display latency of each of the video frames provided to the video receiver by decoding and/or processing only portions of some frames.
Inventors
- RAJESH MAMIDWAR
- CHEN XUEMIN
- BRIAN HENG
Assignees
- 安华高科技股份有限公司
Dates
- Publication Date
- 20260505
- Application Date
- 20251021
- Priority Date
- 20241030
Claims (20)
- 1. An apparatus, comprising: logic for receiving a plurality of video frames from a video source; Logic for processing the plurality of video frames to generate a plurality of processed video frames, the plurality of processed video frames comprising one or more processed partial video frames, and Logic for providing at least some of the plurality of processed video frames to a video receiver; Wherein the one or more processed partial frames reduce display latency of one or more of the video frames provided to the video receiver.
- 2. The apparatus of claim 1, wherein: receiving the plurality of video frames includes receiving one or more encoded partial video frames.
- 3. The apparatus of claim 1, wherein: processing the plurality of video frames includes: the one or more partial video frames are created by one of the plurality of received video frames.
- 4. The apparatus of claim 1, wherein: The plurality of video frames received from the video source comprises a plurality of encoded video frames; processing the plurality of video frames includes: Decoding at least some of the plurality of encoded video frames to generate a plurality of decoded video frames, the plurality of decoded video frames comprising one or more decoded partial video frames; providing at least some of the plurality of processed video frames to the video receiver includes: providing at least some of the plurality of decoded video frames to the video receiver, and Reducing the display latency of each of the decoded video frames provided to the video receiver further comprises: The decoded partial video frames are provided to the video receiver.
- 5. The apparatus of claim 4, wherein Providing at least some of the plurality of decoded video frames to the video receiver includes: Each of the plurality of decoded video frames including the decoded partial video frame is provided to the video receiver via a High Definition Multimedia Interface (HDMI) connection.
- 6. The apparatus of claim 4, wherein: Providing at least some of the plurality of decoded video frames to the video receiver includes: providing at least some of the plurality of decoded video frames to the video receiver via a high definition multimedia interface, HDMI, connection, and The decoded partial video frames are provided to the video receiver via an alternative path separate from the HDMI connection.
- 7. The device of claim 4, further comprising: Logic for providing metadata to the video receiver to enable the video receiver to create a decoded full video frame using the decoded partial video frame.
- 8. The apparatus of claim 4, wherein: The plurality of encoded video frames comprising one or more encoded partial video frames encoded at an adaptive resolution, and Decoding the one or more encoded partial video frames includes: The one or more encoded partial video frames encoded at an adaptive resolution are decoded.
- 9. The apparatus of claim 4, wherein: The plurality of encoded video frames comprises: A first encoded full video frame encoded at a first resolution, and An encoded partial video frame that follows the encoded full video frame, the encoded partial video frame encoded at a second resolution, and Decoding at least some of the plurality of encoded video frames includes: decoding the encoded full video frames to produce decoded full video frames, and The encoded partial video frames are decoded to produce decoded partial video frames.
- 10. The apparatus of claim 9, wherein: Providing at least some of the plurality of decoded video frames to the video receiver includes: The decoded partial video frames are provided to the video receiver.
- 11. The apparatus of claim 9, wherein: processing the plurality of video frames includes: generating a second decoded full video frame from the first decoded full video frame and the decoded partial video frame, and Providing at least some of the plurality of decoded video frames to the video receiver includes: the second decoded full video frame is provided to the video receiver.
- 12. The apparatus of claim 9, wherein: The encoded partial video frame includes content predicted from a portion of the encoded full video frame corresponding to the encoded partial video frame using inter-coding.
- 13. The apparatus of claim 9, wherein: processing the plurality of video frames includes: Store the first decoded full video frame in a reference buffer; receiving a second encoded full video frame subsequent to the encoded partial video frame, the second encoded full video frame encoded from the first encoded full video frame with inter-frame encoding, and Decoding the second encoded full video frame using the first decoded full video frame to generate a second decoded full video frame, and Providing at least some of the plurality of decoded video frames to the video receiver includes: the second decoded full video frame is provided to the video receiver.
- 14. The device of claim 9, further comprising: logic for receiving information from the video source to indicate a position of the encoded partial video frame within a full frame.
- 15. The apparatus of claim 9, wherein: processing the plurality of video frames includes: Calculating the position of the decoded partial video frame within the full frame, and Information is provided to the video receiver to indicate the position of the decoded partial video frame within the full frame.
- 16. The apparatus of claim 1, wherein the apparatus is a set-top box, a component of a set-top box, or a system-on-a-chip SoC.
- 17. The apparatus of claim 1, wherein the apparatus is a television.
- 18. The device of claim 1, further comprising: logic for selectively disabling the logic for processing the plurality of video frames based on: An application associated with the video frame; Configuration setting or And (5) controlling a user.
- 19. A method, comprising: Receiving a plurality of video frames from a video source; processing the plurality of video frames to generate a plurality of processed video frames, the plurality of processed video frames comprising one or more processed partial video frames, and Providing at least some of the plurality of processed video frames to a video receiver; Wherein the one or more processed partial frames reduce display latency of one or more of the video frames provided to the video receiver.
- 20. A set top box, comprising: an input interface for receiving a plurality of video frames from a video source; A decoder for decoding the plurality of video frames to generate a plurality of decoded video frames, the plurality of decoded video frames comprising one or more decoded partial video frames; an output interface for providing at least some of the plurality of decoded video frames to a video receiver, and A processor for reducing display latency of the at least some of the plurality of decoded video frames provided to the video receiver using the one or more decoded partial video frames.
Description
Low-delay video stream-partial frame Cross reference to related applications This disclosure may be related to U.S. patent application Ser. No. 18/932,328 (attorney docket No. 5009.220197US01), entitled "Low latency video stream-reduced frame buffer (Low Latency Video Streaming-Reducing Frame Buffers)", and U.S. patent application Ser. No. 18/932,369 (attorney docket No. 5009.220197US03), entitled "Low latency video stream-Modified frame rate (Low Latency Video Streaming-Modified FRAME RATES)", each filed herewith by its inventor. The respective disclosures of each of these applications are incorporated herein by reference for all purposes. Technical Field This document relates generally to video streaming and, more particularly, to low latency video streaming by reducing the display latency of video frames provided to a video receiver. Background Traditional video streaming, whether broadcast or IP-based, has in the past relied on complex network infrastructure to provide a smooth video experience to end users. This typically involves the video player decoding each video and audio frame and sending it timely to the TV. Such conventional approaches rely on complex network architectures and extensive buffering to ensure consistent frame delivery. This generally results in higher costs as bandwidth and storage requirements increase. To address these challenges, various video coding standards (e.g., MPEG2, AVC, HEVC, VP, AV 1) have been developed for efficient compression, but this is typically at the cost of increased computational complexity. Furthermore, to ensure smooth video playback, buffering mechanisms have been implemented at different stages of the video pipeline, including cloud servers, networks, and video players (e.g., set top boxes or on-the-fly (OTT) clients). For example, popular streaming services like YouTube and Netflix typically buffer 10 to 40 seconds of video frames. Emerging applications such as cloud gaming, video conferencing, and virtual reality require low latency performance. These applications are pushing new network, encoder and system standards to evolve. As network speeds increase, cloud games have begun to become popular. In this model, the actual game server resides in the cloud, while the local game controller sends commands to the cloud server. The game is presented on a cloud server and the encoded video is transmitted through a video pipeline to the user's device (e.g., TV or set-top box) for display. In addition, video conferencing applications have become an indispensable need for tele-office, online education, and telemedicine. Low latency is critical to a seamless experience. Many users are turning to OTT devices or set-top boxes for larger screen displays rather than traditional conferencing equipment. Furthermore, virtual reality experiences typically require high resolution video and low latency. Cloud-based rendering may provide the required processing power, while local devices may focus on displaying rendered content. For applications such as gaming, video conferencing, and virtual reality and the like, end-to-end latency is more important than fluent video. Conventional set-top boxes and OTT devices designed for streaming media video typically rely on fixed frame rates and buffering at multiple stages of the video pipeline. In other words, the video pipeline typically includes frame buffers at different stages, such as encoder, decoder, video processing, high Definition Multimedia Interface (HDMI) input, and within the TV itself. This buffering ensures smooth playback but also causes delay because each stage buffers multiple frames to ensure that full frame data is available at the input before feeding into the output stage. Conventional pipelines typically need to transmit an entire frame regardless of the number of pixels that have changed. This approach also introduces a significant amount of latency that is not desirable for latency sensitive applications such as cloud gaming, video conferencing, and virtual reality. In other examples, at the HDMI interface, the frame rate is typically fixed and cannot be dynamically adjusted during playback. For example, in the 60 FPS configuration, one full frame of data must be sent every 1/60 second. If the next frame is not ready in time (i.e., an underflow condition), then the previous frame is repeated, resulting in potential visual artifacts. This fixed frame rate requirement may limit the ability to reduce latency in applications requiring real-time responses. Disclosure of Invention In one aspect, the disclosure relates to a device comprising logic for receiving a plurality of video frames from a video source, logic for processing the plurality of video frames to generate a plurality of processed video frames, the plurality of processed video frames comprising one or more processed partial video frames, and logic for providing at least some of the plurality of processed video frames to a video receiver, wherein th