Is autoML eating data scientists' jobs?
Yes and no.
Yes - if all you do is the 'sexy' part of data science: training different models, tuning parameters, picking the best-model based on performance metrics, you should be worries because those tasks are getting automated.
However, if you have been doing a lot unsexy 'dirty work' like cleaning data, selecting data, model testing, scaling models in production, autoML can make your work more efficient. And your experience in the end-to-end data science workflow makes you more valuable.
The unsexy data science work is hardest to get automated.
Why? Because it involved human decisions.
For example:
• whether you should build a pipeline to collect new some data that has 50% chance of being useful? (data selection decisions)
• is the model good enough to launch now with 85% accuracy, or do we need 90% and wait for 6 months? ( trade-offs)
• is the model working well with other features in the product, or it's hurting the customer experience? (product sense)
Honestly, most data scientists are already doing the dirty work, but not a lot of us enjoy doing it and want to avoid it.
It's time to change the mindset - we are not just modelers, we are problem solvers. The data science work begins with understanding the problem and ends with generating impact.
A few things to start with:
• Challenge assumptions
• Learn domain knowledge
• Know when to make trade-offs
• Talk to product managers and engineers
• Get comfortable with non-DS related challenges
The future of data science is to use automation tools well and get good at the things that can't be automated.
*This is inspired by my conversation with Greg Tanaka, learn more about the future of data science, forecasting, and quantitative trading on 'the data scientist show':
Apple:https://lnkd.in/ga-KVEvY
Spotify:https://lnkd.in/gZ5ESM3J
#datascience #machinelearning