Comprehensive software reviews to make better IT decisions
Cloudera Shares Customer Lessons on How to Scale Production Machine Learning
To make machine learning (ML) repeatable and scalable, you need to invest in serving infrastructure (the “last mile”), ML operations, and governance, says Cloudera’s Sr. Product Manager Alex Breshears in the MIT-Cloudera webinar “How to Scale Production Machine Learning in the Enterprise.”
In the webinar, Breshears shared key challenges and lessons learned from Cloudera customers who have built large-scale production ML systems.
The webinar also featured Tom Davenport, a distinguished professor and author of several books including Competing on Analytics and The AI Advantage: How to Put the Artificial Intelligence Revolution to Work.
Many organizations experimenting with AI and ML learn very quickly that ML models make up only a small fraction of real-world ML systems – the small black box in the middle of the diagram below, said Breshears, citing a diagram from a paper by Google researchers. Production ML requires a lot more.
Organizations intending to put ML into production and run it at scale need to invest in the following:
- Serving infrastructure: How will the output of an ML model be served to its consumers or integrated with applications they are using? (The “last mile” delivery of ML predictions.)
- Model operations and monitoring at scale: These models need to be packaged, deployed, monitored for performance and drift, and retrained on a periodic basis. If you only have a handful of models, you can do that manually. If you have thousands of them in production like Cisco Systems (the example Davenport gave), you’ll need ModelOps. Cisco has gone from having a few models in production to 60,000 sales propensity models covering 160 million of its customers. The only way to achieve this without hiring an army of data scientists was by creating a “model factory.”
- ML governance: You will also need to think about – and plan for – model security, model governance, model catalogue, etc.
To achieve scale with ML and truly start reaping its benefits by embedding it everywhere, you will need to automate as many components in the ML development and deployment lifecycle as possible. While production ML projects are largely custom, a platform like Cloudera (and other tools – see “Want to Know More?”) can help you achieve that automation.
Want to Know More?
Recently I attended the inaugural Emotion AI conference, organized by Seth Grimes, a leading analyst and business consultant in the areas of natural language processing, text analytics, sentiment analysis, and their business applications. So, what is emotion AI, why is it relevant, and what do you need to know about it?
SortSpoke’s novel approach to machine learning answers a longstanding problem in financial services – how to efficiently extract critical data from inbound, unstructured documents at 100% data quality.
Amazon is offering its cashierless store technology to other retailers. The technology known as “Just Walk Out” eliminates checkout lines, offering an “effortless” shopping experience and shifting store associates to “more valuable activities”.
As the COVID-19 pandemic is shutting down whole countries, a few of you may be wondering whether AI can help create a vaccine for the virus responsible. After all, AI is magic, right?
Alphabet is facing backlash from its shareholders over its approach to digital privacy, reports the Financial Times. And not for the first time. This time, however, things will need to change.
The EU plans to invest €6 billion to build a single European data space, reports EURACTIV. The envisioned space will house personal, business, and “high-quality industrial data” and create the infrastructure for data sharing and use across businesses and nations.
“Facebook quietly acquired another UK AI startup and almost no one noticed,” reported TechCrunch on February 10. We looked into why.
In a landmark ruling, a Dutch court has ordered an immediate halt to the government’s use of an automated system for detection of welfare fraud.
Databricks, a data processing and analytics platform with a strong focus on AI and ML, has partnered with Immuta to deliver automated end-to-end data governance for AI, data science, and ML projects.