Search

JP-7855788-B2 - Methods, devices, and systems for storing information in molecules

JP7855788B2JP 7855788 B2JP7855788 B2JP 7855788B2JP-7855788-B2

Inventors

  • ジャン、ルーシュアイ

Assignees

  • デジコドン・テクノロジーズ・カンパニー・リミテッド

Dates

Publication Date
20260508
Application Date
20220920
Priority Date
20220817

Claims (20)

  1. A method for storing information in molecules, Information to be stored is obtained, and herein, the information to be stored has one or more bit groups, the position of each bit group in the information to be stored is represented by a first address, the value of each bit group is represented by a first content, and each bit group has one or more bits. Determining a molecular module corresponding to at least one bit group from the one or more bit groups, wherein the molecular module comprises a first molecular module, the first molecular module is configured to represent both the first address and the first content of the corresponding bit group , and determining the molecular module corresponding to at least one bit group from the one or more bit groups, Converting at least a portion of the information to be stored from the first numeral system to the second numeral system, Dividing at least a portion of the information to be stored after conversion into one or more bit groups, Determine the molecular module corresponding to at least one bit group from the one or more bit groups obtained after the conversion. Equipped with, A method comprising generating a composition based on a determined molecular module such that the composition corresponds to the information to be stored.
  2. Determining the molecular module corresponding to at least one bit group among the one or more bit groups is The method according to claim 1, comprising determining the first molecular module corresponding to each bit group of the one or more bit groups.
  3. The information to be stored has a plurality of bit groups, and the values of the plurality of bit groups include at least two types of first content. Determining the molecular module corresponding to at least one bit group among the one or more bit groups is: The method according to claim 1, comprising determining that a bit group having a first content value of one kind does not correspond to any molecular module, and determining the first molecular module that corresponds to at least one bit group of other bit groups having a first content value of another kind.
  4. Determining the molecular module corresponding to at least one bit group among the one or more bit groups is: To determine whether the second bit group and the first bit group among the one or more bit groups satisfy a predetermined relationship, and in this case, whether the first address of the second bit group is different from the first address of the first bit group. If the second bit group and the first bit group satisfy the predetermined relationship, determine the molecular module corresponding to the second bit group, wherein the molecular module comprises the second molecular module, and the second molecular module is configured to represent the predetermined relationship between the corresponding second bit group and the first bit group. The method according to claim 1, comprising:
  5. The second molecular module described above is A stroke increment module, wherein the stroke increment module corresponding to a first preset number a1 is configured to represent that the second bit group and the first bit group satisfy the following preset relationship, namely, that there are a1 consecutive second bit groups immediately following the first bit group, and the first content of each second bit group is the same as the first content of the first bit group, and a1 is a positive integer. A stroke multiplex module, wherein the stroke multiplex module corresponding to a second preset number a2 is configured to represent that the second bit group and the first bit group satisfy the following preset relationship, namely, that there are (a2-1) consecutive second bit groups immediately following the first bit group, and the first content of each second bit group is the same as the first content of the first bit group, and a2 is a positive integer greater than 1. A stroke inversion module, wherein the stroke inversion module corresponding to a third preset number a3, is configured to represent the following preset relationship between the second bit group and the first bit group: that each bit in the first bit group and the second bit group has only values of 0 or 1, the second bit group has only two bits with different values, there are a3 consecutive second bit groups immediately following the first bit group, and the value of the first bit of the second bit group is different from the value of the last bit of the first bit group, and a3 is a positive integer. A stroke repetition module, wherein the stroke repetition module is configured to represent that the second bit group and the first bit group satisfy the following predetermined relationship, namely, that the first content of the second bit group is the same as the first content of the first bit group. A bit increment module, wherein the bit increment module corresponding to a fourth preset number a4 is configured to represent that the second bit group and the first bit group satisfy the following preset relationship: namely, the second bit group has only one bit, there are a4 consecutive second bit groups immediately following the first bit group, and the value of the second bit group is the same as the value of the last bit of the first bit group, and a4 is a positive integer, or a bit decrement module, wherein the bit decrement module corresponding to a fifth preset number a5 is configured to represent that the second bit group and the first bit group satisfy the following preset relationship: namely, a5 consecutive bits from back to front in the first bit group are determined to be the second bit group, and the second bit group in the first bit group is deleted, and a5 is a positive integer, and a5 is less than or equal to the total number of bits in the first bit group, The method according to claim 4, comprising at least one of the following.
  6. The first number system is binary, the second number system is decimal, and the conversion of at least a portion of the information to be stored from the first number system to the second number system is The method according to claim 1, comprising performing an iterative calculation according to xi+1 = 2xi + bi+1 to determine the at least portion of the information to be stored in decimal, wherein x0 is a preset initial value, bi+1 is the value of the (i+1)-th bit from left to right in the at least portion of the information to be stored in binary, i is an integer greater than or equal to 0, and (i+1) is equal to the total number of bits of the at least portion of the information to be stored in binary, the corresponding xi+ 1 is the at least portion of the information to be stored in decimal.
  7. The method according to claim 1, wherein the relative positions of the determined molecular modules in the composition coincide with the relative positions of the corresponding bit groups in the information to be stored.
  8. To generate the composition based on the determined molecular module such that the composition corresponds to the information to be stored is: Obtaining a unit module corresponding to the determined molecular module, and in this case, each molecular module is synthesized from one or more unit modules. The process includes synthesizing the acquired unit module in order to generate the composition corresponding to the information to be stored, or To generate the composition based on the determined molecular module such that the composition corresponds to the information to be stored is: At the connection end of the determined molecular module, a terminal portion corresponding to a pre-set sequence is formed, The method according to claim 1, comprising mixing the molecular modules formed with the corresponding terminal portions to produce the composition corresponding to the information to be stored.
  9. The molecular module comprises at least one of the following: deoxyribonucleic acid, ribonucleic acid, unnatural nucleotide, modified nucleotide, artificially synthesized nucleotide, peptide, organic polymer, small organic molecule, carbon nanomaterial, inorganic substance, or separated molecular fragment, The method according to claim 1, wherein various molecular modules are distinguished by at least one of the sequence distribution, sequence length, secondary structure, crystalline or amorphous nature, or morphology of the molecular module.
  10. A method for storing information in molecules, Information to be stored is obtained, and herein, the information to be stored has one or more bit groups, the position of each bit group in the information to be stored is represented by a first address, the value of each bit group is represented by a first content, and each bit group has one or more bits. Determining one or more target bit groups according to the distribution of each type of first content in the information to be stored, Determining a molecular module corresponding to each of the one or more target bit groups, wherein the molecular module comprises a third molecular module, and the third molecular module is configured to represent the first address of the corresponding target bit group. A method comprising generating a composition based on a determined molecular module such that the composition corresponds to the information to be stored.
  11. Determining the one or more target bit groups according to the distribution of each type of first content in the information to be stored is: The frequency of occurrence of each type of first content in the information to be stored is counted, and the most frequent content with the highest occurrence frequency is determined. The method according to claim 10 , further comprising determining a bit group having a value other than the most frequent content as the target bit group.
  12. The third molecular module comprises one or more submolecular modules, Determining the molecular module corresponding to each target bit group of the one or more target bit groups is: Dividing the first address of the target bit group into one or more address bit groups, wherein the position of each address bit group in the first address of the target bit group is represented by a second address, the value of each address bit group is represented by the second content, and each address bit group has one or more bits. Determining a submolecule module corresponding to at least one address bit group among the one or more address bit groups, wherein the submolecule module comprises a first submolecule module, the first submolecule module is configured to represent both the second address and the second content of the corresponding address bit group. The method according to claim 10 , further comprising determining the third molecular module based on the combination of submolecular modules determined above.
  13. Determining the submolecule module corresponding to at least one address bit group among the one or more address bit groups is: To determine whether a second address bit group and a first address bit group among the one or more address bit groups satisfy a predetermined relationship, and in this case, whether the second address of the second address bit group is different from the second address of the first address bit group. If the second address bit group and the first address bit group satisfy the predetermined relationship, the submolecule module corresponding to the second address bit group is determined, wherein the submolecule module comprises the second submolecule module, and the second submolecule module is configured to represent the predetermined relationship between the corresponding second address bit group and the first address bit group. The method according to claim 12 , comprising:
  14. The second submolecule module described above is The method according to claim 13, comprising a stroke increment module, wherein the stroke increment module corresponding to a sixth preset number a6 is configured to represent that the second address bit group of the second bit group and the first address bit group of the first bit group satisfy the following preset relationship: namely, the sum of the first address bit group and a6 is the second address bit group, and the first content of the second bit group is the same as the first content of the first bit group.
  15. The method according to claim 10 , wherein the molecular module further comprises a fourth molecular module, the fourth molecular module configured to represent the first content of the corresponding target bit group.
  16. Determining the molecular module corresponding to each target bit group of the one or more target bit groups is: Converting at least a portion of the information to be stored from the first numeral system to the second numeral system, Dividing at least a portion of the information to be stored after conversion into one or more bit groups, The method according to claim 10 , further comprising determining a molecular module corresponding to at least one bit group from the one or more bit groups obtained after conversion.
  17. The first number system is binary, the second number system is decimal, and the conversion of at least a portion of the information to be stored from the first number system to the second number system is The method according to claim 16, comprising performing an iterative calculation according to xi+1 = 2xi + bi+1 to determine the at least portion of the information to be stored in decimal, wherein x0 is a preset initial value, bi+1 is the value of the (i+1)-th bit from left to right in the at least portion of the information to be stored in binary, i is an integer greater than or equal to 0, and (i+1) is equal to the total number of bits of the at least portion of the information to be stored in binary, the corresponding xi+ 1 is the at least portion of the information to be stored in decimal.
  18. The method according to claim 10 , wherein the relative positions of the determined molecular modules in the composition coincide with the relative positions of the corresponding bit groups in the information to be stored.
  19. To generate the composition based on the determined molecular module such that the composition corresponds to the information to be stored is: Obtaining a unit module corresponding to the determined molecular module, and in this case, each molecular module is synthesized from one or more unit modules. The process includes synthesizing the acquired unit module in order to generate the composition corresponding to the information to be stored, or To generate the composition based on the determined molecular module such that the composition corresponds to the information to be stored is: At the connection end of the determined molecular module, a terminal portion corresponding to a pre-set sequence is formed, The method according to claim 10 , comprising mixing the molecular modules formed with the corresponding terminal portions to produce the composition corresponding to the information to be stored.
  20. The molecular module comprises at least one of the following: deoxyribonucleic acid, ribonucleic acid, unnatural nucleotide, modified nucleotide, artificially synthesized nucleotide, peptide, organic polymer, small organic molecule, carbon nanomaterial, inorganic substance, or separated molecular fragment, The method according to claim 10 , wherein various molecular modules are distinguished by at least one of the sequence distribution, sequence length, secondary structure, crystalline or amorphous nature, or morphology of the molecular module.

Description

Cross-references to related applications [0001] This application claims priority to Chinese Patent Application No. 202210988065.6, filed on 17 August 2022, entitled “METHOD, DEVICE AND SYSTEM FOR STORING INFORMATION IN MOLECULE,” the entirety of which is incorporated herein by reference. [0002] This disclosure relates to the field of storage technology, and more particularly to methods, devices, and systems for storing information in molecules. [0003] With the significant advancements in information technology, people's demand for data storage is rapidly increasing. Conventional data storage media include hard disks, flash memory, magnetic tape, and optical discs, which have problems such as low storage density, short storage time, and high energy consumption. To achieve higher storage density and more reliable storage efficiency, it is now possible to store information in molecules. [0004] In related technologies, a suitable solution for storing information in molecules that can be integrated with computer systems, in terms of coding efficiency, write speed, and cost, still does not exist. Therefore, existing molecular storage technologies need to be improved. [0005] Accordingly, embodiments of this disclosure propose the following solutions. [0006]According to one embodiment of the present disclosure, a method for storing information in a molecule, [0007] Acquiring information to be stored, wherein the information to be stored has one or more bit groups, the position of each bit group in the information to be stored is represented by a first address, the value of each bit group is represented by a first content, and each bit group has one or more bits, [0008] Determining a molecular module corresponding to at least one bit group from one or more bit groups, wherein the molecular module includes a first molecular module, and the first molecular module is configured to represent both a first address and a first content of the corresponding bit group. [0009] A method is provided which includes generating a composition based on a determined molecular module such that the composition corresponds to information to be stored. [0010] In some embodiments, retrieving the information to be stored is [0011] Dividing initial information to be stored in order to generate one or more pieces of information to be stored, wherein the number of bits of each piece of information to be stored is less than the number of bits of the initial information to be stored, and the number of bits of each piece of information to be stored are equal to or unequal to each other, [0012] In order to generate information to be stored, a combination of multiple initial pieces of information to be stored is made, wherein the number of bits of the information to be stored is greater than the number of bits of each initial piece of information to be stored. Includes. [0013] In some embodiments, determining a molecular module corresponding to at least one bit group out of one or more bit groups is [0014] This includes determining a first molecular module corresponding to each bit group of one or more bit groups. [0015] In some embodiments, the information to be stored has a plurality of bit groups, and the values of the plurality of bit groups include at least two kinds of first content, [0016] Determining a molecular module that corresponds to at least one bit group out of one or more bit groups is: [0017] The process includes determining that a bit group having a first content value of one kind does not correspond to any molecular module, and determining a first molecular module that corresponds to at least one bit group of other bit groups having a first content value of another kind. [0018] In some embodiments, determining the molecular module corresponding to at least one bit group out of one or more bit groups is: [0019] Determining whether a second bit group and a first bit group among one or more bit groups satisfy a predetermined relationship, and in this case, whether the first address of the second bit group is different from the first address of the first bit group. [0020] If the second bit group and the first bit group satisfy a predetermined relationship, the molecular module corresponding to the second bit group is determined, wherein the molecular module includes the second molecular module and is configured to represent the predetermined relationship between the corresponding second bit group and the first bit group. Includes. [0021] In some embodiments, the second molecular module is [0022] Stroke incremental module, wherein the stroke incremental module corresponding to a first preset number a1 is configured to represent that the second bit group and the first bit group satisfy the following preset relationship, namely, that there are a1 consecutive second bit groups immediately following the first bit group, and the first content of each second bit group is the same as the first content of the first bit group, and a1 is a posi