Organizations create, receive and share significant quantities of files and data during any given project, operation or exploratory activity. Unfortunately, the quality of each file received can vary, with quality issues usually arising when the file originator is no longer under contractual obligation to the organization.
Re-creation of information may seem like the immediate solution but this usually is not a long-term fix and can add to files retention issues, duplication, and cost. All resulting in potential future issues with the ability to create significant impacts.
Organizational Challenge
We are in the middle of a data explosion where the world is estimated to have added 90% of all available data in the last 2 years. We create 2.5 quintillion bytes of data each day at our current pace – resulting in real organizational challenges:
Moving legacy data (e.g., paper, old files) into current technologies
Excessive man-hours spent trying to locate information
Cost implications of unproductive hours being spent interrogating systems and unstructured areas
Data completeness
Data usability
Duplicate information
Data wrangling services
Dependable and trustworthy quality correction software and processes can be tailored to organizational requirements to safeguard the integrity of their files and the accuracy of their data. Data wrangling services partnered with domain SME experience enables file enhancement processes ensure high-quality files that are searchable; allowing systems to fully digest their content.
Data wrangling services also provide the ability to classify information appropriately; aiding findability and completeness checks.
Data wrangling approach
Quality correction and classification utilizes a combination of techniques, including vision analysis, machine learning, deep learning, and deep domain expertise to truly accommodate the complexity of today’s information. These techniques enable key results:
Creation of searchable PDF files
Identification of key items in files/images such as hazard warning
Language translation
Highly accurate classification
Table extraction
Targeted metadata extraction
The Result
Data wrangling addresses quality and classification issues, which maximizes information identification. This maintains the integrity of the files, ensuring their accuracy, consistency and reliability.
Janine Murray
Consulting Practice of Energy, Natural Resources, Utilities and Engineering & Construction
Janine Murray is an IM Consultant with over 15 years of experience in the O&G industry. She has extensive FE/Operations and Major Capital Project (MCP) Information Management experience. She has deep experience with IM brownfield modifications, greenfield enhancements, MCP joint ventures, Closeout, and MCP handover to Operations. Additionally, she is experienced with document cleansing and data extraction techniques for digitizing O&G legacy assets.
She can be reached at: janine.murray@wipro.com
Unlike conventional computers, artificial intelligence is not programmed. On the one hand, like humans, AI learns from information. Yet on the other, it learns far faster than humanly possible. Machine learning (ML) enables AI to make a correlation between a pattern and an outcome, formulate a hypothesis, take action and then integrate that feedback into its next hypothesis. AI continuously learns, with the goal of predicting future outcomes and events with greater accuracy.
An omni channel approach is becoming an unavoidable imperative for petroleum retailers to enhance their flexibility and scalability of operation, provide superior value proposition with respect to competitors and edge ahead on newer avenues for growth.
IT / OT Convergence is a driving industry trend that bridges IT and OT data and platforms in near real-time enabling visibility and business process reinvention across the enterprise