Realising enterprise-scale machine learning with data governance


It may not be long till machine learning becomes a norm in India.

Realising enterprise-scale machine learning with data governance
It may take a while to find the JARVIS to your Tony Stark, but real-world machine learning applications are already present in our everyday lives. In India, apps like Aarogya Setu are using artificial intelligence/machine learning (AI/ML) and location tracking capabilities to help the government and frontline workers monitor the spread of COVID-19 and ensure that social distancing norms are followed. According to PWC, global adoption of AI has been accelerated due to the pandemic, and organisations across sectors have embraced AI in a more definitive manner because it has become more of a business necessity than a ‘good-to-have’ solution. The financial services sector alone has witnessed an 82 percent increase in the adoption of AI to navigate the increased business uncertainties and disruptions.
Realising the promise of AI/ML calls for organisations to scale its use enterprise-wide and promote the culture of an insights-driven organisation. Additionally, data is the fuel that keeps machine learning (ML) going, making it important to have a data strategy that works for the business and effectively supports ML applications.
Better data for a better ML experience
A machine learning tool becomes more accurate as it gets fed with more data. Considering that the world will continue to generate vast amounts of information as more people and cities around the world are virtually connected, there is and will be a lot of information for ML tools to work with and learn from. This also implies that organisations need to improve their current in-house data management capabilities to accurately feed the information to ML tools that need to be trained. For this, it will be pertinent to centralise data stored across silos and ensure a uniform treatment so that data can be processed across several platforms, such as on-premises, private or public cloud.
Additionally, professionals who are adept with AI/ML applications will be highly sought after by organisations seeking to become insights driven. Especially with the rise of 5G and the Internet of Things, organisations that need to manage data in motion (i.e., streaming data, data in transit) and data in rest (i.e., data in storage and databases) will require talents who have the knowledge as well as the practical experience of fronting AI/ML projects. Streaming data is particularly important for machine learning systems running on the edges of the networks as those connected devices will decide whether or not to act, based on real-time insights and recommendations.
Democratizing data is another exercise that will encourage more people across an organisation to make informed decisions. This step will be an important one if firms want to counter the dearth of qualified professionals. It may not entirely nullify the need for a data expert but it will train people to effectively function with self-serving platforms and low-code, no-code models. An overarching data governance framework ought to helm the use of this data judiciously to ensure that information shared across teams is accurate and consistent, and data is not misused by unauthorised people.
One enterprise data cloud to govern them all!
Data quality is a fundamental element to successfully train ML models as insights generated from machine learning systems are only as reliable as the quality of the data fed into it. Overarching this use, effective data governance practices will help firms successfully scale machine learning across the organisation. By ensuring that data meets a certain standard – such as being accurate, timely, and relevant—data governance empowers users across the enterprise to make informed decisions. It also subsequently reduces the risk of falling prey to security breaches, and it helps organizations to comply with data privacy regulations and Know Your Customer (KYC) obligations.
With data spread across different platforms, organisations might struggle to effectively enforce data governance using traditional or point data management solutions. An enterprise data cloud can help as it provides an end-to-end, connected data lifecycle solution—from collection to enrichment to reporting to serving to prediction—that can run across a multi- and hybrid cloud environment. It also offers an integrated set of security and governance technologies built on metadata to deliver persistent context across all analytic functions. With context, organisations can ensure data access while being assured that data use is always authorised, tracked, and audited.
It may not be long till machine learning becomes a norm in India. Recognising the potential for AI/ML in transforming economies, the Indian government has mandated policy think-tank NITI Aayog to establish a national program on AI, with a view to guide the research and development in new and emerging technologies. However, poor data quality and lack of access to timely and relevant data can prevent organisations from unleashing machine learning’s full potential. By leveraging an end-to-end, connected data lifecycle solution such as an enterprise data cloud, organisations can confidently scale machine learning across the enterprise to gain timely and reliable insights that will help them unlock value from their data.
—The author, Piyush Agarwal is SE Leader, India, Cloudera. The views expressed are the author's personal

Market Movers

Maruti Suzuki7,265.40 365.50 5.30
UPL802.50 30.30 3.92
Shree Cements29,239.05 1,013.70 3.59
Wipro556.55 14.40 2.66
SBI Life Insura1,007.00 25.45 2.59
Maruti Suzuki7,263.75 362.25 5.25
Larsen1,499.15 32.25 2.20
UltraTechCement6,856.50 75.25 1.11
Bajaj Auto4,221.25 38.60 0.92
TCS3,300.65 28.20 0.86
Asian Paints3,010.80 -57.75 -1.88
Bajaj Finance6,016.80 -99.20 -1.62
Nestle17,436.55 -212.60 -1.20
HUL2,490.25 -24.10 -0.96
Kotak Mahindra1,757.10 -14.95 -0.84
Asian Paints3,010.85 -58.70 -1.91
Bajaj Finance6,017.40 -98.45 -1.61
Nestle17,451.20 -191.30 -1.08
HUL2,489.60 -26.50 -1.05
IndusInd Bank1,000.35 -7.45 -0.74


Rupee-100 Yen0.6714-0.0002-0.04