Comprehensive Software Reviews to make better IT decisions
Cloudera Shares Customer Lessons on How to Scale Production Machine Learning
To make machine learning (ML) repeatable and scalable, you need to invest in serving infrastructure (the “last mile”), ML operations, and governance, says Cloudera’s Sr. Product Manager Alex Breshears in the MIT-Cloudera webinar “How to Scale Production Machine Learning in the Enterprise.”
In the webinar, Breshears shared key challenges and lessons learned from Cloudera customers who have built large-scale production ML systems.
The webinar also featured Tom Davenport, a distinguished professor and author of several books including Competing on Analytics and The AI Advantage: How to Put the Artificial Intelligence Revolution to Work.
Many organizations experimenting with AI and ML learn very quickly that ML models make up only a small fraction of real-world ML systems – the small black box in the middle of the diagram below, said Breshears, citing a diagram from a paper by Google researchers. Production ML requires a lot more.
Organizations intending to put ML into production and run it at scale need to invest in the following:
- Serving infrastructure: How will the output of an ML model be served to its consumers or integrated with applications they are using? (The “last mile” delivery of ML predictions.)
- Model operations and monitoring at scale: These models need to be packaged, deployed, monitored for performance and drift, and retrained on a periodic basis. If you only have a handful of models, you can do that manually. If you have thousands of them in production like Cisco Systems (the example Davenport gave), you’ll need ModelOps. Cisco has gone from having a few models in production to 60,000 sales propensity models covering 160 million of its customers. The only way to achieve this without hiring an army of data scientists was by creating a “model factory.”
- ML governance: You will also need to think about – and plan for – model security, model governance, model catalogue, etc.
To achieve scale with ML and truly start reaping its benefits by embedding it everywhere, you will need to automate as many components in the ML development and deployment lifecycle as possible. While production ML projects are largely custom, a platform like Cloudera (and other tools – see “Want to Know More?”) can help you achieve that automation.
Want to Know More?
The EU plans to invest €6 billion to build a single European data space, reports EURACTIV. The envisioned space will house personal, business, and “high-quality industrial data” and create the infrastructure for data sharing and use across businesses and nations.
“Facebook quietly acquired another UK AI startup and almost no one noticed,” reported TechCrunch on February 10. We looked into why.
In a landmark ruling, a Dutch court has ordered an immediate halt to the government’s use of an automated system for detection of welfare fraud.
Databricks, a data processing and analytics platform with a strong focus on AI and ML, has partnered with Immuta to deliver automated end-to-end data governance for AI, data science, and ML projects.
CognitiveScale has been named one of the 50 Smartest Companies of the Year 2019 by The Silicon Review. The recognition is for “transforming customer engagement and lifetime value with Artificial Intelligence.”
Facebook agreed to pay $550 million to settle a class action lawsuit with a group of users in Illinois over its use of facial recognition technology (FRT) to tag individuals in photographs, reports the BBC.
AI has been making headlines in healthcare for some time, and the current outbreak of the coronavirus in Wuhan, China, (with cases now in other parts of the world) – or, more specifically, the early warning of the outbreak – is another example.
Google founders Larry Page and Sergey Brin are stepping down as CEO and President of Alphabet, respectively. Google CEO Sundar Pichai will take over as Alphabet’s CEO. Both Page and Brin will remain actively involved as board members, shareholders, and cofounders.
I recently had an opportunity to speak with a KPMG partner in the Canadian risk consulting practice and with the head of data science for Canada about several things, including KPMG Ignite. This is what I learned.