KR-102964506-B1 - Information processing device, information processing device and program
Abstract
This information processing device comprises a media playback unit that acquires and plays video data including a service object available for processing a request by voice from a user, and a control unit that adds an additional image to the played video to teach the service object to the user, and stores identification information of the video data and information of the start and end times of the additional image as a bookmark for a scene having the additional image arbitrarily selected by the user.
Inventors
- 야마기시 야스아키
- 기야마 유카
Assignees
- 소니그룹주식회사
Dates
- Publication Date
- 20260513
- Application Date
- 20190318
- Priority Date
- 20180326
Claims (16)
- As an information processing device, Acquiring video data and playing the said video data to present a played video including a service object capable of using a service that processes requests by voice from a user; Acquire metadata including identification information of the above image data; According to the above metadata, an additional image is added to the played video to teach the service object to the user; A processing circuit configured to store identification information of the image data and time information of the start and end times of the additional image, as a bookmark selectable by the user for playback of a scene of the played image having the additional image in response to a user instruction. An information processing device equipped with
- In paragraph 1, The above processing circuit is, Receive the selection of the above bookmark from the above user, and In response to the above selection, configured to play the image data to present the scene of the played image having the additional image based on identification information of the image data corresponding to the bookmark and time information of the start time and end time of the additional image, Information processing device.
- In paragraph 2, The above processing circuit is, Metadata including identification information of the above image data and time information of the start time and end time of the above additional image is obtained, Configured to generate the above additional image and add the above additional image based on the above acquired metadata, Information processing device.
- In paragraph 3, The above metadata includes service backend control information including a function name representing a function of the service specified by an utterance from the user, and The above processing circuit is configured to present the function name of the service backend control information included in the metadata corresponding to the bookmark selected by the user. Information processing device.
- In paragraph 4, The above metadata includes information for requesting different functions for each time period under a single function name, and The above processing circuit is configured to transmit the request to a server that switches the function of the service based on the above information, Information processing device.
- In paragraph 1, The above processing circuit is configured to impose a restriction that limits the use of the service for each of the above service objects. Information processing device.
- In paragraph 6, The above restriction is a restriction due to billing, Information processing device.
- In paragraph 6, The above restriction is a restriction regarding whether the metadata of the above-mentioned additional image can be shared on a community service, Information processing device.
- In paragraph 1, The above additional image has unique visual characteristics for each service object so that the service object can be uniquely identified by voice recognition in the service. Information processing device.
- In paragraph 1, The above additional image is presented at a location attached to the service object, Information processing device.
- In paragraph 3, The above processing circuit is, Acquire a Media Presentation Description file containing the AdaptationSet of the above metadata; Interpret this Media Presentation Description file; The above video data and the above metadata are each acquired as Media Segments of MPEG-Dynamic Adaptive Streaming over HTTP; Configured to present the above-mentioned reproduced video and the above-mentioned additional image based on the above-mentioned metadata in synchronization with each other, Information processing device.
- In paragraph 1, The above metadata further includes filtering information indicating whether the addition of the above additional image of the above service object to the above-reproduced image is restricted. Information processing device.
- A step of acquiring image data and playing said image data to present a reproduced image including a service object capable of using a service that processes a request by voice from a user; A step of acquiring metadata including identification information of the above image data; A step of adding an additional image to the reproduced image to teach the service object to the user, according to the metadata, by means of a processing circuit of an information processing device; and A step of preserving identification information of the image data and time information of the start time and end time of the additional image as a bookmark selectable by the user for playback of the scene of the played image having the additional image, in response to a user instruction by the processing circuit of the information processing device. Information processing method including
- In Paragraph 13, The above metadata further includes filtering information indicating whether the addition of the above additional image of the above service object to the above-reproduced image is restricted. Information processing method.
- A non-transient computer-readable storage medium storing a computer program that causes the computer to perform the method according to claim 13 when executed by the computer.
- In paragraph 15, The above metadata further includes filtering information indicating whether the addition of the above additional image of the above service object to the above-reproduced image is restricted. Non-transient computer-readable storage media.
Description
Information processing device, information processing device and program The present invention relates to an information processing device, an information processing device, and a program for receiving and playing video content including video, and in particular to an information processing device, an information processing method, and a program suitable for cases where the video content is associated with a voice-based information service for a user of the information processing device. In recent years, voice AI assistant services have become widespread. This is an information service in which a terminal corresponding to the service receives and recognizes a request made by voice uttered by a user of an information processing device through a microphone or the like, interprets the data, executes a service according to the user's request, and responds to the user with the result of the execution in the form of voice (see, for example, Patent Document 1). Currently, Amazon Echo's Alexa (registered trademark) is known as a cloud-based voice AI assistant service. FIG. 1 is a block diagram showing the overall configuration of an information processing system (100) including an information processing device (4) of a first embodiment according to the present technology. FIG. 2 is a sequence diagram showing the flow of the overall operation (the first) in the information processing system (100) of FIG. 1. FIG. 3 is a sequence diagram showing the overall flow of operation (the second) in the information processing system (100) of FIG. 1. FIG. 4 is a sequence diagram showing the overall flow of operation (the third) in the information processing system (100) of FIG. 1. Figure 5 is a diagram showing an example of an image in which an additional image is superimposed. Figure 6 is a block diagram showing the configuration of POI metadata. Figure 7 is a diagram showing another example of an image in which an additional image is superimposed. Figure 8 is a diagram showing the limitations on the presentation of additional images. FIG. 9 is a sequence diagram showing the flow of the overall operation (the third one) including the presentation restriction of additional images in the information processing system (100) of FIG. 1. Figure 10 is a diagram illustrating trick play playback based on POI metadata. FIG. 11 is a diagram showing an example of an application execution environment (43) that processes POI metadata. FIG. 12 is a diagram showing another example of an application execution environment (43) that processes POI metadata. Figure 13 is a diagram showing an example of a Multi-part MIME format for packaging a web application and POI metadata. Figure 14 is a diagram showing the configuration of the Media Segment in the MP4 file format. Figure 15 is a diagram showing the data structure of the MPD of MPEG-DASH. FIG. 16 is a diagram showing the exchange of information via network communication between an MPEG-DASH server (15) and an information processing device (4). Figure 17 is a diagram showing the flow of presentation control for MPEG-DASH video content. Figure 18 is a diagram showing the configuration of an MPD with an AdaptationSet of POI metadata added. Figure 19 is a diagram showing a more specific example of an MPD with an AdaptationSet of POI metadata added. Figure 20 is a diagram showing the presentation flow of images and additional images based on MPD. FIG. 21 is a diagram showing POI metadata in the case where the presentation position of an additional image is moved along with the movement of a service object. Figure 22 is a diagram illustrating presentation update control over multiple sample times of POI metadata. Figure 23 is a diagram showing a technical example of POI usage restriction information by ODRL. FIG. 24 is a sequence diagram showing the operation of a billing restriction for using a voice assistant service. FIG. 25 is a block diagram showing the configuration according to time shift playback using bookmarks in the information processing system (100) of the present embodiment. Figure 26a is a sequence diagram showing the flow of time-shifted playback using bookmarks. FIG. 26b is a sequence diagram showing the flow of time-shifted playback using bookmarks, continuing from FIG. 26a. Figure 27 is a diagram showing an example of POI metadata associated with a bookmark. FIG. 28 is a diagram showing the change in the value of the ContextID attribute in POI metadata associated with each of the scenes of two time periods to which different voice AI assistant service programs are assigned. FIG. 29 is a diagram showing a technical example of share availability control information by ODRL. Figure 30 is a diagram showing a method for creating a scene capture. Hereinafter, embodiments according to the present technology are described. <Summary of the information processing device of the present embodiment> The information processing device (4) of the present embodiment has an AV decoder (41) that acqu