Search

CN-121995488-A - OVT data coverage frequency extraction method and storage medium

CN121995488ACN 121995488 ACN121995488 ACN 121995488ACN-121995488-A

Abstract

The invention provides an OVT data coverage frequency extraction method and a storage medium, and belongs to the field of seismic exploration seismic data processing. The method comprises the steps of 1, importing OVT domain seismic data into a distributed file system HDFS, 2, performing slicing processing on the seismic data in the distributed file system HDFS, 3, constructing a first elastic distributed data set RDD, 4, constructing a second elastic distributed data set RDD, 5, performing data aggregation, constructing a third elastic distributed data set RDD, 6, counting the number of channels at the same position, obtaining the number of times of coverage, and 7, generating a coverage number attribute graph. The invention can more conveniently know the coverage frequency condition of the OVT domain data, and effectively solves the quality control problem of the coverage frequency of the OVT domain data.

Inventors

  • GUO QINGHUA
  • YANG XIANGSEN
  • XIE JINE

Assignees

  • 中国石油化工股份有限公司
  • 中石化石油物探技术研究院有限公司

Dates

Publication Date
20260508
Application Date
20241104

Claims (10)

  1. 1. The method for extracting the coverage times of the OVT data is characterized by comprising the following steps of: step 1, importing OVT domain seismic data into a distributed file system HDFS; step 2, performing slicing processing on the seismic data in the distributed file system HDFS; Step 3, constructing a first elastic distributed data set RDD; step 4, constructing a second elastic distributed data set RDD; Step 5, data aggregation is carried out, and a third elastic distributed data set RDD is constructed; Step 6, counting the number of channels at the same position to obtain the number of coverage times; And 7, generating a coverage time attribute graph.
  2. 2. The method of claim 1, wherein the first elastic distributed data set RDD in the step 3 is composed of (Key, value) Record, wherein Key represents an OVT number OVT_NUM of the seismic data, and Value represents a seismic data.
  3. 3. The method for extracting the number of times of coverage of OVT data according to claim 2, wherein said seismic data comprises a header and trace data.
  4. 4. The method of extracting the number of times of coverage of OVT data of claim 2, wherein said step 3 further comprises setting the number of partitions using rdd.
  5. 5. The method of extracting the number of times of over-data of claim 1, wherein the step 4 includes splitting each data Record into one or more key Value pairs (OVT_NUM, (Inline_Num, crossline _Num, X_ coor, Y_ coor, fold)) by flatMap () function, storing Inline_Num, crossline _Num, X_ coor, Y_ coor, fold information in List, constructing a second elastic distributed dataset RDD with OVT_NUM as key and List stored information as Value, and obtaining a set Record of OVT_NUM, (Inline_Num, crossline _Num, X_ coor, Y_ coor, fold) fields.
  6. 6. The method for extracting the number of times of OVT data coverage of claim 5, wherein the step of obtaining the collection Record of the OVT_NUM (inline_Num, crossline _Num, X_ coor, Y_ coor, field) fields comprises the steps of: S41, creating a ListBuffer; S42, adding information of all channels together through a buff. Compensator; S43, through ReduceByKey, the tracks (OVT_NUM, (inline_Num, crossline _Num, X_ coor, Y_ coor, fold)) of the same OVT_NUM are combined together to obtain (OVT_NUM,List((Inline_Num,Crossline_Num,X_coor,Y_coor,Fold)),(Inline_Num,Crossline_Num,X_coor,Y_coor,Fold)……).
  7. 7. The method of extracting the number of times of coverage of OVT data of claim 1, wherein said step 4 further comprises setting an initial value of the number of times of coverage to 1, said number of times of coverage referring to the number of tracks under the same Inline_Num and Crossline _Num.
  8. 8. The method of extracting the number of times of coverage of OVT data as set forth in claim 1, wherein the step 5 of aggregating data is performed by using ReduceByKey () function to reduce records in the first elastic distributed data set RDD and the second elastic distributed data set RDD, and the values having the same Key Key are combined together.
  9. 9. The method of extracting the number of times of coverage of OVT data as set forth in claim 1, wherein step 6 uses a map () function to aggregate the number of times of coverage every two identical lanes of the same Inline_Num and Crossline _Num in the same Key keys in the second elastic distributed data set RDD.
  10. 10. A computer-readable storage medium, characterized in that the computer-readable storage medium stores at least one program executable by a computer, which when executed by the computer, causes the computer to perform the steps of the OVT data coverage number extraction method according to any one of claims 1-9.

Description

OVT data coverage frequency extraction method and storage medium Technical Field The invention belongs to the field of seismic exploration seismic data processing, and particularly relates to an OVT data coverage frequency extraction method and a storage medium. Background Along with the continuous deep oil and gas exploration, the precision requirement of the processing personnel on the seismic data is higher and higher, and the processing personnel gradually develop to the broadband and full-wave field directions. OVT domain processing has become the first choice for wide azimuth seismic data processing, and OVT binning has become the primary task for OVT domain data processing. And after finishing the face element division, extracting the gather to obtain the OVT domain data. A complete OVT data is a single coverage section in an ideal situation, but is influenced by factors such as construction environment, technical capacity, earth surface acquisition conditions and the like, and the seismic data cannot meet the standard of regular sampling, so that the phenomenon of uneven coverage times occurs. Therefore, quality control of the coverage times of the OVT domain data is particularly important for processing the subsequent OVT domain data. Disclosure of Invention The invention aims to solve the problems in the prior art, and provides an OVT data coverage frequency extraction method aiming at the defects in the prior art so as to better know the coverage frequency condition of the OVT domain data. The invention is realized by the following technical scheme: In a first aspect of the present invention, there is provided an OVT data coverage number extraction method, the method comprising: step 1, importing OVT domain seismic data into a distributed file system HDFS; step 2, performing slicing processing on the seismic data in the distributed file system HDFS; Step 3, constructing a first elastic distributed data set RDD; step 4, constructing a second elastic distributed data set RDD; Step 5, data aggregation is carried out, and a third elastic distributed data set RDD is constructed; Step 6, counting the number of channels at the same position to obtain the number of coverage times; And 7, generating a coverage time attribute graph. Further, the first resilient distributed data set RDD in the step 3 is formed by (Key, value) Record, where Key represents ovt_num of the seismic data, and Value represents a seismic data. Further, the seismic data is composed of trace header and trace data. Further, the step 3 further includes setting the partition number using rdd. Further, the step 4 includes splitting each data Record into one or more key Value pairs (ovt_num, (inline_num, crossline _num, x_ coor, y_ coor, field)) by using flatMap () function, saving the inline_num, crossline _num, x_ coor, y_ coor, field information into List, and constructing a second resilient distributed dataset RDD with ovt_num as key and List saved information as Value to obtain a set Record of ovt_num, (inline_num, crossline _num, x_ coor, y_ coor, field) fields. Further, the obtaining the collection Record of the ovt_num, (inline_num, crossline _num, x_ coor, y_ coor, field) fields includes the steps of: S41, creating a ListBuffer; S42, adding information of all channels together through a buff. Compensator; S43, through ReduceByKey, the tracks (OVT_NUM, (inline_Num, crossline _Num, X_ coor, Y_ coor, fold)) of the same OVT_NUM are combined together to obtain (OVT_NUM,List((Inline_Num,Crossline_Num,X_coor,Y_coor,Fold)),(Inline_Num,Crossline_Num,X_coor,Y_coor,Fold)……). Further, the step 4 further includes setting an initial value of the number of times of coverage to 1, where the number of times of coverage refers to the number of tracks under the same line_num and Crossline _num. Further, the data aggregation in step 5 is performed to perform a reduction operation on the records in the first resilient distributed data set RDD and the second resilient distributed data set RDD by using ReduceByKey () function, and the Value values with the same Key are combined together. Further, the step 6 uses a map () function to aggregate the tracks of each two identical inline_num and Crossline _num in the identical Key keys in the second resilient distributed data set RDD, and counts the number of times of coverage. In a second aspect of the present invention, there is provided a computer-readable storage medium storing at least one program executable by a computer, the at least one program, when executed by the computer, causing the computer to perform the steps of the OVT data coverage number extraction method according to any one of claims 1 to 9. Compared with the prior art, the invention has the beneficial effects that: According to the invention, the calculation of the coverage times is realized by using the Spark parallel mode, so that the coverage times condition of the OVT domain data can be more conveniently known, and the quality contr