RYAN ZERNACH

Senior AI Systems Engineer

Ryan_Zernach_2025_Senior_AI_Systems_Engineer_Remote_United_States

🤖 DataRobot 10x Data Science Academy

After learning to train machine learning models at Lambda School, I was seriously impressed with DataRobot's AutoML platform. The clips below walk through how I used it in the inaugural academy cohort.

🤖 DataRobot 10x Data Science Academy

Summary

As a member of the inaugural DataRobot 10x Data Science Academy cohort, I learned to solve data science problems faster with DataRobot's automated machine learning (AutoML) platform — from exploratory analysis through deployment and explainability.

Tech stack

DataRobot Platform (AutoML, model blueprints, validation, What-If / deployment dashboards)

Mini projects

Three builds: Stack Overflow developer salaries, vehicle fuel efficiency (MPG), and county-level COVID spread. Each paired domain questions with feature importance, blueprints, lift curves, and dashboards.

What I focused on

Interpreting models (not just accuracy), comparing blueprints, and using DataRobot's visuals — feature effects, correlation maps, and deployed What-If tools — to explain predictions to a stakeholder.

💰 Predicting Stack Overflow Developer Salaries

Feature importances highlighted developer role type as a strong driver of salary. The blueprint showed extensive text features numerically encoded during prep. Validation plots revealed the model tended to over-predict at higher salaries and under-predict at lower ones; lift and predicted-vs-actual views made that easy to see. The feature importance map also surfaced correlation clusters among non-white features — especially oranges and greens. The built-in What-If dashboard supported interactive salary predictions from the trained model.

🤖 DataRobot 10x Data Science Academy
Feature importances: “DevType” among the most influential inputs for salary.
🤖 DataRobot 10x Data Science Academy
Model blueprint: many text features numerically encoded in data prep.
🤖 DataRobot 10x Data Science Academy
Validation: tendency to over-predict higher salaries and under-predict lower salaries.
🤖 DataRobot 10x Data Science Academy
Predicted vs. actual — zoom in to compare distributions.
🤖 DataRobot 10x Data Science Academy
Feature importance map: correlation clusters among colored (non-white) features.
🤖 DataRobot 10x Data Science Academy
Deployed What-If dashboard for salary predictions.

The same deployment path that worked for exploration also supported stakeholder-facing explanations: adjust inputs, compare scenarios, and keep the model’s behavior inspectable.

🚙 Fuel efficiency (MPG) for future vehicles

This workflow predicts miles-per-gallon from mechanical specs. As more manufacturers move to EVs, a miles-per-kWh model would lean on physics — weight and aerodynamics — rather than cylinders, transmissions, or ethanol-heavy fields from classic MPG datasets. Design-stage simulation can answer many of those questions before hardware exists.

🤖 DataRobot 10x Data Science Academy
What-If dashboard for the MPG prediction model.

🦠 Which county might see COVID next?

ROC, coefficients, blueprint, and lift views supported iteration on the county infection model. Multicollinearity does not always hurt accuracy, but it can blur interpretability of individual correlated features — the correlation map below shows twelve distinct correlation clusters. Partial dependence–style feature effect plots and gain / cumulative lift charts rounded out validation.

🤖 DataRobot 10x Data Science Academy
ROC / distribution view for the COVID county model.
🤖 DataRobot 10x Data Science Academy
Stage 1 coefficients.
🤖 DataRobot 10x Data Science Academy
Blueprint for the COVID-focused pipeline.
🤖 DataRobot 10x Data Science Academy
Lift curve (cross-validation).

Multicollinearity among features does not necessarily reduce model accuracy, but it can reduce how clearly you can attribute effects to individual correlated inputs — worth inspecting before presenting coefficients to others.

Feature correlation map: twelve correlation clusters across inputs.

🤖 DataRobot 10x Data Science Academy
Bold indicates the target: have_confirmed_18.
🤖 DataRobot 10x Data Science Academy
Feature effects: R_birth_2018.
🤖 DataRobot 10x Data Science Academy
Feature effects: POP_ESTIMATE_2018.
🤖 DataRobot 10x Data Science Academy
Gain chart (cross-validation).
🤖 DataRobot 10x Data Science Academy
Cumulative lift (cross-validation).