US-20260127458-A1 - VISUAL ARGUMENTATION REASONING DEVICE AND METHOD
Abstract
The present disclosure relates to a visual argumentation reasoning device, comprising: a visual premise unit (VPU) that receives an image and detects an argumentation premise from the image to decide a visual premise; a commonsense premise unit (CPU) that extracts at least one piece of background knowledge associated with the visual premise to decide a commonsense premise; and a conclusion derivation unit that derives at least one intermediate conclusion based on the commonsense premise and the visual premise and derives a final conclusion through a logical association with the at least one intermediate conclusion.
Inventors
- Youngjae YU
- Jiwan CHUNG
Assignees
- UIF (UNIVERSITY INDUSTRY FOUNDATION), YONSEI UNIVERSITY
Dates
- Publication Date
- 20260507
- Application Date
- 20250326
- Priority Date
- 20241106
Claims (11)
- 1 . A visual argumentation reasoning device, comprising: a visual premise unit (VPU) that receives an image and detects an argumentation premise from the image to decide a visual premise; a commonsense premise unit (CPU) that extracts at least one piece of background knowledge associated with the visual premise to decide a commonsense premise; and a conclusion derivation unit that derives at least one intermediate conclusion based on the commonsense premise and the visual premise and derives a final conclusion through a logical association with the at least one intermediate conclusion.
- 2 . The device of claim 1 , wherein the visual premise unit detects an object from the image and decides an argumentation premise object by determining whether the detected object is able to be used as the argumentation premise.
- 3 . The device of claim 2 , wherein the visual premise unit evaluates a possibility of the argumentation premise based on at least one of a shape, color, size, and texture features of the detected object.
- 4 . The device of claim 3 , wherein the visual premise unit evaluates the possibility of the argumentation premise based on an object disposition relationship considering a location of the detected object within the image.
- 5 . The device of claim 2 , wherein the visual premise unit evaluates the possibility of the argumentation premise based on whether the detected object includes text or symbols.
- 6 . The device of claim 2 , wherein the visual premise unit decides the visual premise by performing semantic clustering through analyzing a similarity to the argumentation premise object.
- 7 . The device of claim 1 , wherein the commonsense premise unit generates a textual representation of the visual premise and extracts candidate background knowledge by searching the textual representation in a knowledge base.
- 8 . The device of claim 7 , wherein the commonsense premise unit evaluates logical validity of the candidate background knowledge for the visual premise to decide the at least one piece of background knowledge.
- 9 . The device of claim 8 , wherein the commonsense premise unit decides the commonsense premise by calculating correlation of the visual premise to each of the at least one piece of background knowledge.
- 10 . The device of claim 8 , wherein the conclusion derivation unit decides a logical order of the at least one intermediate conclusion and performs selection and ruling out of the at least one intermediate conclusion in a process of deciding the logical order to integrate the at least one intermediate conclusion.
- 11 . A visual argumentation reasoning method performed by a visual argumentation reasoning device, the method comprising: a visual premise stage that receives an image and detects an argumentation premise from the image to decide a visual premise; a commonsense premise stage that extracts at least one piece of background knowledge associated with the visual premise to decide a commonsense premise; and a conclusion derivation stage that derives at least one intermediate conclusion based on the commonsense premise and the visual premise and derives a final conclusion through a logical association with the at least one intermediate conclusion.
Description
CROSS-REFERENCE TO RELATED PATENT APPLICATION This application claims the benefit of Korean Patent Application No. 10-2024-0156314, filed on Nov. 6, 2024, in the Korean Intellectual Property Office, the disclosure of which is incorporated herein in its entirety by reference. TECHNICAL FIELD The present disclosure relates to a visual argumentation reasoning technique, and more specifically, to a visual argumentation reasoning device and method capable of deriving at least one intermediate conclusion based on commonsense premises and visual premises and capable of deriving a final conclusion through a logical association with the at least one intermediate conclusion. BACKGROUND Commonsense-based reasoning is artificial intelligence technology that helps computers understand commonsense naturally and interact based thereon by finding methods to collect commonsense and teach the same to the computers. The four stages of commonsense-based reasoning technology are as follows: In a commonsense extraction stage, commonsense information may be expressed in the form of an ontology (expressing various concepts such as relationships between objects in a form that may be processed by computers) or a graph from various data sources such as text corpora, web documents, videos, and crowdsourcing. In a commonsense verification stage, the constructed commonsense information may be verified through question-and-answer with people such as experts. In a data construction stage for learning, training data for training a commonsense reasoning model from data sources and benchmark data used for verification may be configured. In a commonsense reasoning stage, a deep learning and probability-based reasoning model may be trained, or a situation that changes in commonsense according to a specific event may be defined as a standardized rule, and then commonsense and appropriate answers to questions may be provided. Research is actively underway to construct standardized commonsense from various sources and benchmark data for its reasoning. Korean Patent Application Publication No. 10-2010-0031039 (Mar. 19, 2010) discloses that: When job groups and value systems that function in society are filtered through a filter called history, they have the potential to positively contribute to social design. However, the determination of whether such potential is realized is not made by each job group and value system itself. This is possible when a network of various job groups and value systems is secured to suit the problem-solving context. The person who may work on such a network is the one with organizational thinking ability, and the purpose of various aptitude evaluation tests is to select such person. The aspect of the present disclosure is directed to providing a learning system for training organizational thinking ability, which is an essential competency to be possessed by talented people in such a modern society. The aspect of the present disclosure is achieved by a method for efficiently analyzing an argument structure of a character like an organic body by devising a visual patternization and manipulation method of the argument structure. In summary, the present disclosure relates to an argument analysis learning system in which an operation method of using a visualization tool suitable for a problem solver in an implicit manner is devised based on a visualization tool of argument structure patterns, a quasi-rule for each type, and cases to which the visualization tool and the quasi-rule are applied. RELATED ART DOCUMENT Patent Document Korean Patent Application Publication No. 10-2010-0031039, Mar. 19, 2010 SUMMARY An embodiment of the present disclosure provides a visual argumentation reasoning device and method capable of inputting an image and detecting argumentation premises from the image to decide visual premises. An embodiment of the present disclosure provides a visual argumentation reasoning device and method capable of deciding a commonsense premise by extracting at least one piece of background knowledge associated with the visual premises. An embodiment of the present disclosure provides a visual argumentation reasoning device and method capable of deriving at least one intermediate conclusion based on commonsense premises and visual premises and capable of deriving a final conclusion through a logical association with the at least one intermediate conclusion. According to embodiments, the visual argumentation reasoning device includes: a visual premise unit (VPU) that receives an image and detects an argumentation premise from the image to decide a visual premise; a commonsense premise unit (CPU) that extracts at least one piece of background knowledge associated with the visual premise to decide a commonsense premise; and a conclusion derivation unit that derives at least one intermediate conclusion based on the commonsense premise and the visual premise and derives a final conclusion through a logical association wi