US-12622617-B2 - Audio and visual input analysis

US12622617B2US 12622617 B2US12622617 B2US 12622617B2US-12622617-B2

Abstract

Methods, devices, and systems associated with audio and visual input analysis are described. A method can include recording audio during production of artwork using a microphone, storing the audio recording in tiered memory, generating an image of the artwork using an image sensor, storing the image of the artwork in the tiered memory, receiving a reference library including a number of audio recordings and a number of images of artwork, comparing the image of the artwork to the number of images of artwork and the audio recording to the number of audio recordings, and providing an output in response to at least one of: the image of the artwork matching one of the number of images of artwork or the audio recording matching one of the number of audio recordings.

Inventors

Yunju LEE
Sue-Fern Ng
Bhumika CHHABRA

Assignees

MICRON TECHNOLOGY, INC.

Dates

Publication Date: 20260512
Application Date: 20211229

Claims (13)

1 . A method, comprising: recording audio during production of artwork using a microphone; storing the audio recording in tiered memory; generating an image of the artwork using an image sensor; storing the image of the artwork in the tiered memory; receiving a reference library including a number of audio recordings and a number of images of artwork; comparing the image of the artwork to the number of images of artwork and the audio recording to the number of audio recordings, wherein a processor is configured to store the audio recording and the image of the artwork in the tiered memory to perform the comparison more efficiently; providing an output in response to at least one of: the image of the artwork matching one of the number of images of artwork or the audio recording matching one of the number of audio recordings; and requesting permission, via a user interface, to transmit at least one of: the image of the artwork or the audio recording in response to at least one of: the image of the artwork being different from the number of images of artwork or the audio recording being different from the number of audio recordings.
2 . The method of claim 1 , further comprising providing instructions to produce the artwork via the user interface.
3 . The method of claim 1 , further comprising providing a number of questions via the user interface.
4 . The method of claim 3 , further comprising receiving answers to the number of questions via the user interface.
5 . The method of claim 1 , further comprising receiving permission, via the user interface, to transmit at least one of: the image of the artwork or the audio recording.
6 . The method of claim 5 , further comprising identifying a doctor, a psychologist, a psychiatrist, a social worker, or a support group based at least in part on at least one of: the image of the artwork or the audio recording in response to at least one of: the image of the artwork being different from the number of images of artwork or the audio recording being different from the number of audio recordings.
7 . The method of claim 6 , further comprising transmitting at least one of: the image of the artwork or the audio recording to at least one of: the doctor, the psychologist, the psychiatrist, the social worker, or the support group.
8 . The method of claim 7 , further comprising receiving correspondence, via the user interface, from at least one of: the doctor, the psychologist, the psychiatrist, the social worker, or the support group.
9 . The method of claim 1 , further comprising providing the output including at least one of: a diagnosis, an exercise, advice, or contact information via the user interface.
10 . An apparatus, comprising: a microphone configured to record audio during production of artwork; an image sensor configured to generate an image of the artwork; a memory resource including tiered memory; and a processing resource configured to: compare the image of the artwork to a number of images of artwork and the audio recording to a number of audio recordings, wherein the processor is configured to store the audio recording and the image of the artwork in the tiered memory to perform the comparison more efficiently; and request permission, via a user interface, to transmit at least one of: the image of the artwork or the audio recording in response to at least one of: the image of the artwork being different from the number of images of artwork or the audio recording being different from the number of audio recordings.
11 . The apparatus of claim 10 , wherein the processing resource is configured to input at least one of: a timestamp or a location into an artificial intelligence (AI) model.
12 . The apparatus of claim 10 , wherein the user interface is configured to receive data from a caretaker.
13 . The apparatus of claim 12 , wherein the processing resource is configured to: input the data from the caretaker into an artificial intelligence (AI) model; perform an AI operation using the AI model; and output a result of the AI operation.

Description

TECHNICAL FIELD The present disclosure relates generally to apparatuses, systems, and methods associated with audio and visual input analysis. BACKGROUND A computing device can be a smartphone, a wearable device, a tablet, a laptop, a desktop computer, or a smart assistant device, for example. The computing device can receive and/or transmit data and can include or be coupled to one or more memory devices. Memory devices are typically provided as internal, semiconductor, integrated circuits in computers or other electronic systems. There are many different types of memory including volatile and non-volatile memory. Volatile memory can require power to maintain its data (e.g., host data, error data, etc.) and includes random access memory (RAM), dynamic random-access memory (DRAM), static random-access memory (SRAM), synchronous dynamic random-access memory (SDRAM), and thyristor random access memory (TRAM), among others. Non-volatile memory can provide persistent data by retaining stored data when not powered and can include NAND flash memory, NOR flash memory, and resistance variable memory such as phase change random access memory (PCRAM), resistive random-access memory (RRAM), and magnetoresistive random access memory (MRAM), such as spin torque transfer random access memory (STT RAM), among others. BRIEF DESCRIPTION OF THE DRAWINGS FIG. 1 illustrates an example of a computing device for audio and visual input analysis in accordance with a number of embodiments of the present disclosure. FIG. 2 illustrates an example of a system for audio and visual input analysis in accordance with a number of embodiments of the present disclosure. FIG. 3 illustrates an example of a flow diagram for audio and visual input analysis in accordance with a number of embodiments of the present disclosure. FIG. 4 is a flow diagram of a method for audio and visual input analysis in accordance with a number of embodiments of the present disclosure. DETAILED DESCRIPTION The present disclosure includes apparatuses, systems, and methods related to recording audio during production of artwork using a microphone, storing the audio recording in tiered memory, generating an image of the artwork using an image sensor, storing the image of the artwork in the tiered memory, receiving a reference library including a number of audio recordings and a number of images of artwork, comparing the image of the artwork to the number of images of artwork and the audio recording to the number of audio recordings, and providing an output in response to the image of the artwork matching one of the number of images of artwork and/or the audio recording matching one of the number of audio recordings. Diagnosing mental and/or behavioral issues, disorders, and/or diseases are normally done in person during a pre-scheduled appointment with a healthcare provider. A patient may not be displaying signs or symptoms during a pre-scheduled appointment which could prevent the patient from getting a diagnosis and/or proper medications or tools to cope with their disorder or disease. This can inconvenience or endanger the patient. Embodiments of the present disclosure can enable detection of mental and/or behavioral issues, disorders, and/or diseases and provide assistance in real-time. Often data including signs and symptoms over time are not available to a healthcare provider for diagnosing a patient or gauging progress of the patient. Instead, healthcare providers rely on the patient and caretaker to convey symptoms based on their memory and knowledge, which may be limited, incomplete, and/or over a short period of time. Providing audio and visual inputs over time can convey a more comprehensive and detailed picture of the patient's symptoms and progression for the healthcare provider, which can lead to a more accurate diagnosis and a more precise treatment. As used herein, “a number of” something can refer to one or more of such things. A “plurality” of something intends two or more. The figures herein follow a numbering convention in which the first digit or digits correspond to the drawing figure number and the remaining digits identify an element or component in the drawing. Similar elements or components between different figures may be identified by the use of similar digits. For example, reference numeral 106 may reference element “6” in FIG. 1, and a similar element may be referenced as 206 in FIG. 2. In some instances, a plurality of similar, but functionally and/or structurally distinguishable, elements or components in the same figure or in different figures may be referenced sequentially with the same element number. As will be appreciated, elements shown in the various embodiments herein can be added, exchanged, and/or eliminated so as to provide a number of additional embodiments of the present disclosure. In addition, the proportion and the relative scale of the elements provided in the figures are intended to illustrate various embodiments of the