2021 年 5 月
特征

地球科学工作流程:跟踪多种场景和不确定性

使用行业标准数据格式管理整个勘探与生产生命周期的资产状况,使未来的用户能够返回到关键步骤,探索替代方案或重新审视关键的不确定性,以减少开发工作。
Marco Piantanida / 埃尼勘探与生产 Clara Andreoletti / 埃尼勘探与生产 Matilde Dalla Rosa / 埃尼勘探与生产 Bruno Volpi / 埃尼勘探与生产 Philip Neri / Energistics

资产团队开发地下模型并构建油藏模拟网格来描述碳氢化合物系统的岩石和流体。来自钻孔直接测量和间接检测技术(例如地球物理或测井方法)的数据量呈指数级增长,并且在资产的生命周期中频繁更新。

由于需要创新技术来处理、解释和分析这些数据,因此需要使用许多不同的应用程序平台和附加模块来完成整个工作流程。RESQML™ 是由能源行业联盟开发的一种数据交换标准,旨在促进以快速可靠的方式将完整的项目数据集从一个地下应用程序传输到另一个地下应用程序。RESQML 支持创建一组完全与应用程序无关的文件,除了数据传输作用之外,还可以随时用作项目的综合存档。

作者将探索该标准的有效使用,无论是其促进数据交换的原始作用,还是该标准的创新使用,作为管理碳氢化合物资产生命周期内的情景和不确定性的手段。

RESQML格式

RESQML 标准定义了数据模式和强制元数据,以确保正在传输的数据在完整性、所有数据对象之间的关系以及正确的空间和时间引用方面能够被接收应用程序正确摄取。它还建立将由导出应用程序生成的文件。这些文件包括一个压缩的 Zip 文件,其中包含 XML 格式的所有数据和元数据(即“有关数据的数据”),以及一个基于 HDF5 标准的批量数据文件,其中包含所有大型数据对象,例如向量和数组。该 zip 文件使用开放打包约定 (OPC) 的行业特定版本,称为 Energistics 打包约定 (EPC)。

为了使应用程序符合该标准,软件开发人员需要开发一个用于创建 EPC 和 HDF5 文件的导出模块,以及一个可以读取标准文件并填充新项目或更新现有项目的导入模块。该标准规定使用分配给每个数据对象的通用唯一标识符(UUID)。这确保了如果将数据对象的实例(例如地质层位网格)导出到另一个应用程序并在另一个应用程序内进行修改,则当将其读回原始应用程序时,它不会被视为新的数据对象,而是适当地被视为数据对象的更新版本。一个现有的对象。

该标准不强制要求创建数据交换文件;还可以选择使用 Energistics 传输协议 (ETP) 将数据从一个应用程序直接传输到另一个应用程序。ETP基于WebSocket全双工标准,建立了标准消息,允许两个应用程序评估它们交换特定数据对象的能力,并直接选择和检索数据,而无需使用中间文件,这是一个更高效、更快的过程。

情景和不确定性。现代碳氢化合物资产数据集是组装、评估、处理、解释和分析一组复杂数据的结果。在地球科学家和工程师构建和更新这些数据的这些年里,经常会发生这样的情况:前进的道路有多种可能的解决方案。这可能是由于数据的模糊性、某些对象的值的不确定性或碳氢化合物系统的知识差距造成的,特别是在发现和开发的早期阶段。专业人士的判断和经验,在分析、人工智能 (AI) 和机器学习 (ML) 的协助下,将利用当时可用的信息和知识来选择最有可能的解决方案。

随着时间的推移和新数据的获取,工作流程和现场生命周期中不同步骤所采用的“最情况”场景可能会被证明是有缺陷的。这将要求团队恢复到工作流程中做出该决定的步骤,并使用新数据或更新概念支持的替代场景执行后续步骤。在传统的应用程序使用模式中,这最多是困难的,而且通常是不可能的。由于版本更改,特定于应用程序的档案可能不再可读,或者可能很难确定何时执行场景选项,因为过去的团队成员(可以回忆起项目工作执行的时间)可能不再是业务部门的一部分。

解决方案:在每个主要工作流程步骤存储 RESQML 项目文件。埃尼是一家全球性的意大利独立能源公司,开发了一种流程,可以有效管理运营多个地球科学应用程序 (Geo-Apps) 时遇到的场景和不确定性挑战。如图 1所示,用户在项目的每个主要步骤中创建标准 RESQML 文件集。该插图显示了这些文件被创建并存储为数据,用于不同地理应用程序之间的交换,但同样可以在大型应用程序套件中的重要步骤中创建此类文件。RESQML 文件与应用程序无关的性质确保了这些包是常绿的,并且可以在未来随时由符合标准的应用程序或数据管理系统使用。

图 1. 在工作流程的各个步骤存储打包数据集需要多个地球科学应用程序。
图 1. 在工作流程的各个步骤存储打包数据集需要多个地球科学应用程序。

e-RESQML 项目版本控制存储库系统。基于RESQML标准,数据集的实际存储发生在名为e-RESQML的系统中,该系统由三个主要组件组成,如图2所示:

图 2. e-RESQML 组件及其与 Geo-Apps 的接口示意图。
图 2. e-RESQML 组件及其与 Geo-Apps 的接口示意图。
  • 引用存储在中央服务器上的数据集快照的数据库,包括 RESQML 文件的二进制内容和标识工作流程中的步骤的特定元数据以及有助于日后识别和检索的其他信息。
  • Web 应用程序,允许用户使用 ETP 协议上传/下载 RESQML 文件或直接连接到解释应用程序。
  • 埃尼的数据平台,即公司开发的平台,用于在所有地下应用程序之间管理、托管和共享地下数据,包括测井、地图、层位、断层、地震体等。

RESQML 模型具有以下优点: 1) 模型作为一个整体保存,其中包括在解释步骤结束时从解释应用程序中提取的 3D 网格、地平线/断层表面和井数据的打包内容。该打包内容可以直接由解释过程的以下步骤中使用的应用程序使用,因此允许跨应用程序传输完整模型的有效方法;2) 数据也被分解为基本组件,以便每个组件都可以在埃尼的数据平台内保存为 3D 网格、地平线、地图或测井数据类型的版本化元素。

这个功能非常重要,因为未来的解释工作可能从基本数据组件开始(例如,从最新版本的解释层位和断层面,或从最新的CPI日志),而不是从打包的RESQML内容开始。因此,有必要允许应用程序浏览埃尼数据平台并以最小粒度检索最新更新的数据元素。如果这些元素仅保留在打包的 RESQML 内容中,则无法保证这一点。

因此,在上传 RESQML 模型时,用户可以预览 RESQML 模型的内容并选择要分解并显式保存到 Eni 数据平台中的组件,如图 3 所示。选择数据元素后,例如作为 3D 网格,向用户显示: 1) 链接到网格的所有“子表示”,包括用于定义网格的地平线和断层面;2) 与网格单元相关的属性(例如孔隙度、渗透率、水饱和度),可以选择/取消选择要保存在数据平台内的属性。

图 3. 选择最合适的 RESQML 元素进行分解并保存在 Eni 的数据平台中。
图 3. 选择最合适的 RESQML 元素进行分解并保存在 Eni 的数据平台中。

为了保存一组一致的对象并保留数据元素之间的所有关系,e-RESQML 应用程序使用图形数据库实用程序(图 4)来识别:

图 4. 分析元素之间的关系以确保 RESQML 模型的一致性。
图 4. 分析元素之间的关系以确保 RESQML 模型的一致性。
  • 选定的 3D 网格元素,如图中左侧部分所示。
  • 所有附加属性,如图中黄色圆圈所示。
  • 所有附加的子表示,显示为红色圆圈。
  • 所有附加的“解释”和“特征”,即向 3D 网格对象添加语义的 RESQML 元素。这些在图的左侧部分以绿色和浅蓝色显示。
  • 指向存储二进制内容的 HDF5 文件相应部分的所有链接。此类链接在图的右侧部分以浅绿色显示。

在确定了要分解到 Eni 数据平台中的正确元素集后,该系统成功的关键是描述 RESQML 模型内容的元数据(图 5),包括两组重要的元数据:1)在图5的上半部分,收集了有关生成模型的工作流程和解释步骤的信息;2)在图的下半部分,收集了有关RESQML模型的语义内容的信息。例如,用户可以指定在工作流程的此步骤中创建新的力学特性模型和井动态分析模型,而集成的岩石物理分析和相分类模型已经存在于工作流程的上一步中。工作流程,已更新。此类元数据使用户能够轻松识别正确的起点,从而可以重新启动未来的油藏研究。例如,如果获得新的井数据并且必须重新计算储层相分布,储层专家可以从包含最新“储层相分类和岩石物理特征”的 RESQML 文件开始。

图 5.模型每个版本的元数据输入面板。
图 5.模型每个版本的元数据输入面板。

从技术角度来看,e-RESQML 应用程序经过精心设计,即使使用数十 GB 的 RESQML 模型也能正常工作(图 6)。用户从“应用程序场”运行解释应用程序,其中有专用 NAS区域可用于 RESQML 文件交换。NAS 区域对于运行 e-RESQML 应用程序的应用程序服务器甚至后端数据库服务器也是可见的。这使得系统能够以高效的方式将数据从应用程序服务器转移到数据库服务器,将它们移动到同一存储卷内,而无需进行耗时的数据复制操作。 

图 6. e-RESQML 应用程序的架构和技术选择。
图 6. e-RESQML 应用程序的架构和技术选择。

此外,RESQML 文件会在后台上传到数据库二进制存储中,同时在文件系统上保留文件的缓存本地副本,该副本将保留指定的时间。这保证了用户与系统的非常快速的交互:上传和下载操作被认为是立即的,因为缓存的副本被快速地从一个存储卷移动到另一个存储卷,以使它们再次对应用程序服务器可见。此外,数据库内使用具有高级压缩功能的 BLOB 存储可提供平均 15:1 的压缩比,优化存储空间并使上传/下载操作更快。

如图 6所示, e-RESQML 应用程序被设计为微服务容器化应用程序。它利用图形数据库技术来跟踪 RESQML 数据元素之间的关系、HDF5 文件支持,并公开 ETP 服务器和 ETP 客户端功能以直接与解释应用程序交互。

结论

e-RESQML 系统已证明能够在结构化环境中存储具有充足元数据的碳氢化合物资产的地下和储层数据集的连续版本。该解决方案使未来的用户能够识别每个数据集版本,并选择恢复到工作流程中的关键步骤,以探索替代方案或重新审视关键的不确定性。使用现有的 RESQML 行业标准减少了开发工作,并确保该解决方案面向未来并与 Geo-App 市场中的大量产品兼容。

关于作者
马可·皮安塔尼达
埃尼勘探与生产公司
Marco Piantanida 是意大利米兰 Eni SpA 的技术/科学应用和流程的技术顾问。Piantanida 先生于 1992 年加入埃尼,全面参与了多个技术 IT 项目的开发。他正在参与埃尼的数字化转型计划,除了机器学习应用程序和认知技术外,还专门处理公司的数据平台。
克拉拉·安德烈奥莱蒂
埃尼勘探与生产公司
Clara Andreoletti 于 2002 年加入埃尼,担任研发地球物理学家,致力于开发公司专有的地震成像平台。2011年,她调到勘探部门,此后担任过各种管理职务。她目前领导地下数据治理和管理小组。Andreoletti 女士于 2001 年毕业于米兰理工大学,获得电信工程学位。
马蒂尔德·达拉·罗莎
埃尼勘探与生产公司
Matilde Dalla Rosa 是埃尼勘探部门地球物理数据管理部门的负责人。她是埃尼数字化转型计划中地下数据平台开发的项目经理。她与人合着了多本有关地质建模和地质统计学的技术出版物。Dalla Rosa 女士从米兰理工大学毕业后加入埃尼集团,在地球科学部门工作。
布鲁诺·沃尔皮
埃尼勘探与生产公司
Bruno Volpi 是意大利米兰埃尼公司油藏研究和创新高级顾问。此前,他曾担任油藏表征和岩石物理学部门经理八年,之后担任埃尼油藏研发活动的协调人,与埃尼数据平台项目合作。沃尔皮先生于1988年加入埃尼集团,主要从事油藏及油藏管理领域工作,参与多个油藏研究和技术创新项目。
菲利普·内里
能量学
Philip Neri 是休斯顿 Energistics 的营销总监,负责推动数字数据交换标准的采用。内里先生的职业生涯始于壳牌在欧洲和亚洲的探险家。他还为欧洲和美国的软件提供商(包括斯伦贝谢和艾默生)从事地下软件和数据管理解决方案的设计、开发和商业化工作。他拥有地质学学士学位以及地球物理学和计算机科学硕士学位。
相关文章 来自档案
原文链接/worldoil
May 2021
Features

Geoscience workflow: Tracking multiple scenarios and uncertainties

Managing asset situations throughout the E&P lifecycle, using industry-standard data formats, enables future users to revert back to a critical step, to explore alternate scenarios or revisit critical uncertainties to reduced development efforts.
Marco Piantanida / Eni E&P Clara Andreoletti / Eni E&P Matilde Dalla Rosa / Eni E&P Bruno Volpi / Eni E&P Philip Neri / Energistics

Asset teams develop subsurface models and build reservoir simulation grids to describe the rocks and fluids of a hydrocarbon system. The amount of data, both from direct measurements in boreholes and derived from indirect detection technologies, such as geophysical or logging methods, is growing exponentially and is frequently updated over the life cycle of the asset.

The need for innovative technologies to process, interpret and analyze these data results in the use of a number of different application platforms and add-on modules to work through the complete workflow. RESQML™ is a data exchange standard developed by the Energistics industry consortium to facilitate the transfer of a complete project dataset from one subsurface application to another in a rapid and reliable manner. RESQML supports the creation of a set of files that are fully application-agnostic, which can be used as a comprehensive archive of a project at any time, in addition to their data transfer role.

The authors will explore the efficient use of this standard, both in its original role facilitating data exchanges and also an innovative use of the standard, as a means of managing scenarios and uncertainties over the lifespan of a hydrocarbon asset.

RESQML FORMAT

The RESQML standard defines the data schema and mandatory metadata that ensure that the data being transferred are properly ingested by the receiving application in terms of their completeness, the relationships between all data objects, and their correct spatial and time referencing. It also establishes the files that will be generated by the exporting application. These consist of a compressed Zip file containing all the data and metadata (i.e. the “data about the data”) in XML format, and a bulk data file based on the HDF5 standard that contains all the large data objects, such as vectors and arrays. The zip file uses an industry-specific version of the open packaging conventions (OPC) called Energistics packaging conventions (EPC).

To make an application compliant with the standard, software developers need to develop an export module that will create the EPC and the HDF5 file, and an import module that can read the standard files and populate a new project or update an existing one. The standard stipulates the use of Universal Unique Identifiers (UUIDs) that are assigned to each data object. This ensures that if an instance of a data object, e.g. a geological horizon grid, is exported to and modified within another application, when it is read back into the original application, it is not considered a new data object but appropriately an updated version of an existing object.

The standard does not mandate the creation of a data exchange file; there is also an option to directly transfer data from one application to another, using the Energistics transfer protocol (ETP). ETP, based on the WebSocket full duplex standard, establishes standard messages that allow two applications to assess their ability to exchange specific data objects and directly select and retrieve data without the use of intermediate files, which is a more efficient and faster process.

Scenarios and uncertainties. A modern dataset for a hydrocarbon asset is the result of assembling, assessing, processing, interpreting and analyzing a complex set of data. In the years that it takes the geoscientists and engineers to build and update these data, it happens frequently that the path forward is open to multiple possible solutions. This can be caused by ambiguities in the data, uncertainties in the values for certain objects or knowledge gaps for a hydrocarbon system, especially in the early phase of discovery and development. The judgement and experience of the professionals, assisted by analytics, artificial intelligence (AI) and machine learning (ML), will result in choosing the most likely solution, using the information and knowledge available at that time.

As time goes by and new data are acquired, the “best case” scenario adopted at different steps in the workflow and the life of the field may prove to be flawed. This will require that the team reverts to the step in the workflow at which that decision was made, and execute the subsequent steps with an alternate scenario supported by the new data or updated concepts. In traditional application usage patterns, this would be at best difficult and often impossible. Application-specific archives may no longer be readable, due to version changes, or it may be difficult to pinpoint when the scenario option was exercised, because past team members—who could recall the timing of project work execution—may no longer be part of the business unit.

A solution: Store RESQML project files at each major workflow step. Eni, the global Italian independent energy company, has developed a process to manage efficiently the scenario and uncertainty challenges encountered while operating multiple geoscience applications (Geo-Apps). Illustrated in Fig. 1, the users create a standard RESQML set of files at each major step of the project. The illustration shows these files being created and stored as data for exchanges between different Geo-Apps, but it would be equally possible to create such files at important steps within a large application suite. The application-agnostic nature of RESQML files ensures that these packages are ever-green and can be put to use at any time in the future by a standards-compliant application or data management system.

Fig. 1. Storing packaged datasets at various steps of a workflow requires multiple geoscience applications.
Fig. 1. Storing packaged datasets at various steps of a workflow requires multiple geoscience applications.

The e-RESQML project versioning repository system. Based on the RESQML standard, the actual storage of datasets takes place in a system named e-RESQML, which consists of three major components, Fig. 2:

Fig. 2. Schematic of the components of e-RESQML and their interface to Geo-Apps.
Fig. 2. Schematic of the components of e-RESQML and their interface to Geo-Apps.
  • A database that references dataset snapshots stored on a central server, including the binary content of the RESQML file and specific metadata that identify the step in the workflow and other information that will facilitate identification and retrieval at a later date.
  • A web application, allowing the user to upload/download RESQML files or to directly connect to interpretation applications, using the ETP protocol.
  • Eni’s data platform, i.e. the company platform developed to govern, host and share subsurface data among all the subsurface applications, including well logs, maps, horizons, faults, seismic volumes, etc.

The RESQML model provides the following benefits: 1) Models are saved as a whole, with the packaged content of 3D grids, horizon/fault surfaces and well data that were extracted from the interpretation application at the end of the interpretation step. This packaged content can be consumed directly by applications used in the following steps of the interpretation process, therefore allowing an efficient way of transferring full models across applications; and 2) data are also disaggregated into the basic components, so that each can be saved as a versioned element of a 3D grid, horizon, map or well log data type within Eni’s data platform.

This feature is extremely important, because future interpretation efforts might start from the basic data components (e.g. from the latest version of the interpreted horizons and fault surfaces, or from the latest CPI logs), rather than from the packaged RESQML content. It is therefore necessary to allow applications to browse across the Eni data platform and retrieve the latest updated data element at the minimum level of granularity. This would not be guaranteed, if such elements were kept only in the packaged RESQML content.

Therefore, when uploading a RESQML model, users are provided the possibility of previewing the content of the RESQML model and selecting the components to be disaggregated and explicitly saved into Eni’s data platform, Fig. 3. Upon selection of a data element, such as a 3D grid, the user is shown: 1) All the “sub-representations” that are linked to the grid, including the horizon and fault surfaces that were used for the definition of the grid; and 2) the properties associated with the cells of the grid (e.g. porosity, permeability, water saturation), with the possibility of selecting/deselecting the properties to save within the data platform.

Fig. 3. Choosing the most appropriate RESQML elements to disaggregate and save in Eni’s data platform.
Fig. 3. Choosing the most appropriate RESQML elements to disaggregate and save in Eni’s data platform.

To save a consistent set of objects and preserve all the relationships across the data elements, the e-RESQML application uses a graph database utility (Fig. 4) to identify:

Fig. 4. Analyzing the relationships across elements to ensure consistency of the RESQML model.
Fig. 4. Analyzing the relationships across elements to ensure consistency of the RESQML model.
  • The selected 3D grid element, shown in the center-left section of the figure.
  • All the attached properties, shown as yellow circles in the figure.
  • All the attached sub-representations, shown as red circles.
  • All the attached “interpretation” and “features”, i.e. the RESQML elements that add semantics to the 3D grid object. These are shown in green and light-blue colors in the left part of the figure.
  • All the links to the appropriate section of the HDF5 file, where the binary content is stored. Such links are shown in light green in the right part of the figure.

After the right set of elements to be disaggregated into Eni’s data platform has been identified, the key to this system’s success is the metadata describing the RESQML model content (Fig. 5), including two important sets of metadata: 1) In the upper part of Fig. 5, information about the workflow and the interpretation step, where the model was generated is collected; and 2) in the lower part of the figure, information about the semantic content of the RESQML model is collected. For example, the user can specify that, within this step of the workflow, a new mechanical property model and well performance analysis model were created, while the integrated petrophysical analysis and the facies classification models, which were already present in the previous step of the workflow, have been updated. Such metadata allow the user to easily identify the correct starting point from which future reservoir studies can restart. For example, if new well data became available and the reservoir facies distribution must be recomputed, the reservoir specialist can start from the RESQML file with the most recent “reservoir facies classification and petrophysical characterization.”

Fig. 5. Metadata input panel for each version of the model.
Fig. 5. Metadata input panel for each version of the model.

From the technological point of view, the e-RESQML application has been designed carefully to work, even with tens-of-gigabytes RESQML models, Fig. 6. Users run the interpretation applications from an “application farm,” where dedicated NAS areas are made available for RESQML file exchange. The NAS areas are also made visible to the application server, where the e-RESQML application runs, and even to the back-end database server. This allows the system to shift the data from the application server to the database server in a highly efficient manner, moving them within the same storage volume, without time-consuming copy operations of the data. 

Fig. 6. Architecture of the e-RESQML application and technological choices.
Fig. 6. Architecture of the e-RESQML application and technological choices.

Moreover, the RESQML files are uploaded into the database binary storage in the background, while keeping a cached local copy of the file on the file system, which will be retained for a specified amount of time. This guarantees the user a very quick interaction with the system: upload and download operations are perceived as immediate, because the cached copies are moved quickly from one storage volume to another, to make them visible again to the application servers. Moreover, the usage of BLOB storage with advanced compression within the database provides an average 15-to-1 compression ratio, optimizing the storage space and making the upload/download operations even quicker.

As shown in Fig. 6, the e-RESQML application is designed as a microservice containerized application,. It exploits a graph database technology for the tracking of the relationships across RESQML data elements, HDF5 file support, and exposing both ETP server and ETP client functionalities to directly interact with interpretation applications.

CONCLUSION

The e-RESQML system has demonstrated the ability to store, in a structured environment, successive versions of a subsurface and reservoir dataset for a hydrocarbon asset with ample metadata. The solution enables future users to identify each dataset version and choose to revert to a critical step in a workflow to explore alternate scenarios or revisit critical uncertainties. The use of the existing RESQML industry standard reduced the development effort and ensures that the solution is future-proof and compatible with a large number of products available within the Geo-App market.

About the Authors
Marco Piantanida
Eni E&P
Marco Piantanida is technical advisor for technical/scientific applications and processes at Eni SpA, Milan, Italy. Mr. Piantanida joined Eni in 1992, and has been integrally involved with developing multiple, technical IT projects. He is taking part in the digital transformation program at Eni, specifically dealing with the company’s data platform, in addition to machine learning applications and cognitive technologies.
Clara Andreoletti
Eni E&P
Clara Andreoletti joined Eni in 2002 as an R&D geophysicist, working on development of the company’s proprietary seismic imaging platform. In 2011, she moved to the exploration department and has held various management positions since then. She currently leads the subsurface data governance and management group. Ms. Andreoletti graduated with a degree in telecommunications engineering from Politecnico di Milano in 2001.
Matilde Dalla Rosa
Eni E&P
Matilde Dalla Rosa is the head of the geophysical data management unit within Eni’s exploration department. She is project manager for the development of Eni’s subsurface data platform within the digital transformation program. She has co-authored multiple technical publications on geological modeling and geostatistics. Ms. Dalla Rosa joined Eni, working in the geoscience division, after graduating from Politecnico di Milan.
Bruno Volpi
Eni E&P
Bruno Volpi is senior advisor for reservoir study and innovation at Eni SpA in Milan, Italy. Previously, he was department manager for reservoir characterization and petrophysics for eight years before taking on his current role as focal point for reservoir R&D activities within Eni, cooperating with the Eni data platform project. Mr. Volpi joined Eni in 1988, working mainly in the reservoir and reservoir management area, with involvement in multiple reservoir studies and technology innovation projects.
Philip Neri
Energistics
Philip Neri is marketing director for Energistics in Houston, where he drives the adoption of digital data exchange standards. Mr. Neri started his career as an explorer for Shell in Europe and Asia. He also worked on design, development and commercialization of subsurface software and data management solutions for software providers in Europe and the U.S., including Schlumberger and Emerson. He holds a BS degree in geology and an MS degree in geophysics and computer science.
Related Articles FROM THE ARCHIVE