CN-122002090-A - Display device and image editing method
Abstract
Some embodiments of the present application provide a display device and an image editing method, where the method may, when playing a target video, respond to an image editing instruction, intercept a video playing frame currently displayed by a display, so as to obtain an image to be edited. And controlling the display to display an image editing interface, and displaying an image to be edited on the image editing interface, wherein the display layer level of the image editing interface is higher than the display layer level of the video playing picture. And acquiring the image prompt word, and executing image editing processing on the image to be edited according to the image prompt word through the image editing model so as to obtain a target image. And finally, controlling a display to display the target image on an image editing interface. The method can intercept images when playing video, and edit the images based on the model to generate images which meet the expectations of users.
Inventors
- WANG XIAOHONG
Assignees
- 海信视像科技股份有限公司
Dates
- Publication Date
- 20260508
- Application Date
- 20241107
Claims (15)
- 1. A display device, characterized by comprising: a display configured to display a user interface; A controller coupled with the display and configured to: When playing a target video, responding to an image editing instruction, and intercepting a video playing picture currently displayed by the display to obtain an image to be edited; Controlling the display to display an image editing interface, and displaying the image to be edited on the image editing interface, wherein the display layer level of the image editing interface is higher than the display layer level of the video playing picture; Acquiring an image prompt word, wherein the image prompt word is used for representing the requirement of image editing processing; Executing image editing processing on the image to be edited according to the image prompt words through an image editing model to obtain a target image, wherein the image editing model is a pre-trained neural network model for executing image editing; And controlling the display to display the target image on the image editing interface.
- 2. The display device of claim 1, further comprising a communication means configured to establish a communication connection with a control device, the controller further configured to: And when the target video is played, receiving an image editing instruction sent by the control equipment through the communication device.
- 3. The display device of claim 1, further comprising a sound collector for collecting a voice signal, the controller further configured to: Acquiring a voice signal input by a user and acquired by the sound acquisition device; Identifying an intent of the speech signal; And if the intention characterization executes image editing, generating an image editing instruction.
- 4. The display device according to claim 1, wherein the controller is configured to intercept a video play frame currently displayed by the display to obtain an image to be edited, and is configured to: And calling a screenshot interface to intercept a currently played video frame so as to obtain an image to be edited.
- 5. The display device according to claim 1, wherein the controller is configured to intercept a video play frame currently displayed by the display to obtain an image to be edited, and is configured to: Calling a cache access interface to acquire a cached video frame sequence of the target video; Acquiring a first time point, wherein the first time point is a time point when the image editing instruction is received or a time point of a first time threshold before the image editing instruction is received; and extracting the video frame corresponding to the first time point from the video frame sequence to obtain an image to be edited.
- 6. The display device of claim 1, wherein the controller is configured to control the display to display an image editing interface, in particular: creating the image editing interface, wherein the interface size of the image editing interface is smaller than or equal to the interface size of the display; And pausing playing of the target video, and controlling the display to display the image editing interface on the upper layer of the video playing picture.
- 7. The display device of claim 6, wherein the controller is configured to control the display to display an image editing interface and to display the image to be edited on the image editing interface, and is configured to: starting an image editing application; controlling the image editing application to send a pause instruction for indicating to pause playing of the target video to a video player, and controlling the image editing application to send an interface display request carrying interface display data to an interface management module, wherein the interface display data comprises the image to be edited and the interface data of the image editing interface; responding to the interface display request, controlling the display to display an image editing interface through the interface management module according to the interface display data, and displaying the image to be edited on the image editing interface; and responding to the pause instruction, and controlling the video player to pause the playing of the target video.
- 8. The display device of claim 6, wherein the controller, after performing the step of controlling the display to display the target image at the image editing interface, is further configured to: In response to a closing instruction for indicating to close the image editing interface, destroying the image editing interface, and controlling the display to cancel displaying the image editing interface; And resuming the playing of the target video, and controlling the display to continuously display the video playing picture corresponding to the target video.
- 9. The display device of claim 8, wherein the controller executes a close instruction for instructing to close the image editing interface, destroys the image editing interface, controls the display to cancel displaying the image editing interface, resumes playing the target video, controls the display to continue displaying the video playing picture corresponding to the target video, and is specifically configured to: Responding to a closing instruction for indicating to close the image editing interface, and controlling the image editing application to send an interface cancel display request to the interface management module; responding to the interface cancel display request, destroying the image editing interface through the interface management module, and controlling the display to cancel displaying the image editing interface through the interface management module; Controlling the interface management module to send a playing instruction for indicating to play a target video to a video player; And responding to the playing instruction, and controlling the video player to continue playing the target video.
- 10. The display device of claim 1, wherein the image editing interface comprises an original region for displaying the image to be edited and a generation region for displaying the target image.
- 11. The display device of claim 1, wherein the image editing interface comprises an input area and a prompt word area, the input area comprising a plurality of input controls, the prompt word area comprising a text editing control, the controller executing the acquire image prompt word specifically configured to: and responding to a selection instruction aiming at the input control, receiving an image prompt word input by a user according to an input mode corresponding to the input control, and controlling the display to display the image prompt word in the text editing control.
- 12. The display device of claim 1, wherein the image editing interface comprises an application extension, the application extension comprising a plurality of extension controls, the controller performing the step of controlling the display to display the target image at the image editing interface is further configured to: And responding to a selected instruction aiming at the expansion control, and executing image processing operation corresponding to the expansion control on the target image.
- 13. The display device according to claim 1, wherein the controller performs image editing processing on the image to be edited in accordance with the image prompt word by the image editing model, specifically configured to: identifying image description information of the image to be edited, wherein the image description information comprises description information of physical objects in the image to be edited; performing masking operation on the image to be edited according to the image description information to obtain a masking image, wherein the masking image is used for identifying a physical object in the image to be edited; Inputting the image prompt words and the mask image into the image editing model; And acquiring the target image which is output by the image editing model and subjected to image editing processing.
- 14. The display device of claim 1, wherein the controller is further configured to: when a target video is played, responding to a video editing instruction, and acquiring a video prompt word, wherein the video prompt word is used for representing the requirement of video editing processing; acquiring a video frame sequence of the target video; Extracting a video frame sequence to be edited from the video frame sequence, wherein the video frame sequence to be edited comprises video frames after a second time point in the video frame sequence, and the second time point is a time point when the video editing instruction is received or a time point of a second time threshold after the video editing instruction is received; Executing video editing processing on the video frame sequence to be edited according to the video prompt words through a video editing model to obtain a target video frame sequence, wherein the video editing model is a pre-trained neural network model for executing video editing; replacing a video frame sequence to be edited in the video frame sequence with the target video frame sequence; and continuing to play the target video according to the replaced video frame sequence.
- 15. An image editing method, characterized by being applied to a display device, the display device comprising a display and a controller, the method comprising: When playing a target video, responding to an image editing instruction, and intercepting a video playing picture currently displayed by the display to obtain an image to be edited; Controlling the display to display an image editing interface, and displaying the image to be edited on the image editing interface, wherein the display layer level of the image editing interface is higher than the display layer level of the video playing picture; Acquiring an image prompt word, wherein the image prompt word is used for representing the requirement of image editing processing; Executing image editing processing on the image to be edited according to the image prompt words through an image editing model to obtain a target image, wherein the image editing model is a pre-trained neural network model for executing image editing; And controlling the display to display the target image on the image editing interface.
Description
Display device and image editing method Technical Field The present application relates to the field of display devices, and in particular, to a display device and an image editing method. Background The display device refers to a terminal device capable of outputting a specific display screen, and may be a terminal device such as a smart television, a communication terminal, a smart advertisement screen, and a projector. Taking intelligent electricity as an example, the intelligent television is based on the Internet application technology, has an open operating system and a chip, has an open application platform, can realize a bidirectional man-machine interaction function, and is a television product integrating multiple functions of video, entertainment, data and the like, and the intelligent television is used for meeting the diversified and personalized requirements of users. The display device may run a digital image processing application through which various complex image editing operations are performed to generate images that meet user requirements. However, the digital image processing application requires a user to have high editing skills and experience, and limits the convenience of use of the user. The display device may also generate images using Text-to-Image (Text-to-Image) models, which have a high degree of freedom in generating images, but require the user to input complex prompt words to generate an ideal Image. Disclosure of Invention The application provides a display device and an image editing method, which are used for solving the problem of high image editing complexity. In a first aspect, the present application provides a display device comprising a display and a controller coupled to the display. Wherein the display is configured to display a user interface, and the controller is configured to: when playing a target video, responding to an image editing instruction, and intercepting a video playing picture currently displayed by a display to obtain an image to be edited; controlling a display to display an image editing interface, and displaying an image to be edited on the image editing interface, wherein the display layer level of the image editing interface is higher than the display layer level of a video playing picture; acquiring an image prompt word, wherein the image prompt word is used for representing the requirement of image editing processing; Executing image editing processing on the image to be edited according to the image prompt words through an image editing model to obtain a target image, wherein the image editing model is a pre-trained neural network model for executing image editing; and controlling the display to display the target image on the image editing interface. The technical scheme has the advantages that in the process of playing the target video, the video playing picture is taken as the image to be edited, the image to be edited is displayed on an intuitive and high-level image editing interface, the image editing model is combined to edit the image to be edited, the convenience and the efficiency of image editing are improved, and the diversified requirements of users on image editing can be met. In an alternative embodiment, the controller further comprises a communication device configured to establish a communication connection with the control device, the controller further configured to: When playing the target video, the communication device receives an image editing instruction sent by the control equipment. The technical scheme has the advantages that the user can use the control equipment to send the instruction by establishing communication connection with the control equipment, so that the image editing operation is more convenient and flexible, and the user experience is improved. In an alternative embodiment, the device further comprises a sound collector for collecting voice signals, and the controller is further configured to: acquiring a voice signal input by a user and acquired by a sound acquisition device; identifying an intent of the speech signal; If the intention characterization performs image editing, generating an image editing instruction. The technical scheme has the advantages that through introducing the voice control function, a user can trigger image editing operation through voice instructions, and the convenience of operation and user experience are further improved. In an alternative embodiment, the controller performs capturing a video playing frame currently displayed by the display to obtain an image to be edited, and is specifically configured to: And calling a screenshot interface to intercept a currently played video frame so as to obtain an image to be edited. The technical scheme has the advantages that the video frames which are currently played are intercepted by directly calling the screenshot interface, so that the quality of the intercepted images can be ensured, and the instantaneity is high. Me