Alec Wong

Professional Experience

Progressive Casualty Insurance – Lead Data Analyst

2019 - Present | 2022 - Present (at Lead level)

Highlighted role achievements

  • Modeled claims adjuster productivity using zero-inflated negative binomial GLM using R package pscl as well as non-linear mixed-effects modeling with R package brms, developing theory for an alternative staffing criterion.
  • Forecasted when claim ID numbers will exceed limits using Bayesian generalized linear regression with R package rstanarm, providing a recommended date for the start of a large-scale effort to increase claim ID size.
  • Developed a data pipeline in python to quantify auto inspector drive times using location information and Google Maps Distance Matrix API, increasing efficiency by approximately 80 FTE.
  • Guided claims process leaders through under-staffing crisis by building a model to rank states’ under-staffing condition and correctly forecasting workload to predict impact of temporarily hired claims adjusters.
  • Broad experience in several SQL frameworks (TSQL, DB2, Snowflake, SQLite, duckdb) creating complex queries joining several tables each with hundreds of millions of records.
  • Recast critical team SQL dataset using delta compression to reduce storage size by >5x and reduce analyst time spent writing lengthy and repetitive queries.

Contributions additional to role

  • Pushed for code best-practices early, introduced git to our team, and checked critical code into Github repositories.
  • Developed curricula for and taught a dual R and Python course to 25 attendees over the course of 9 months.
  • Developed several R and python packages including tools to eliminate boilerplate in SQL connections, extend ggplot2 with a company color palette, and other miscellaneous functions.
  • Spent over 100 hours between 2021 and 2024 providing one-on-one support to analysts all over the company supporting R, Python, and SQL questions.
  • Helped facilitate R Beginner’s Workshops, led bi-weekly R Support Sessions, and produced an in-house newsletter.
  • Assisted in the setup and maintenance of Posit corporate productivity software, and an in-house python cloud platform.
  • Won 2nd place in company-wide modeling challenge, involving over 15 teams.

Cornell University – Graduate Research Assistant

2015 - 2018

  • Gained competency with Bayesian inference using Markov Chain Monte Carlo (MCMC) simulation, geostatistical modeling, maximum likelihood optimization procedures, and generalized linear models.
  • Developed two novel statistical models that use MCMC to estimate animal population size and relationships with spatial habitat covariates, applied to moose in New York.
  • The statistical models’ performance was tested via simulation analysis using high-performance computing techniques under an original dual-layer parallelization scheme of a cluster of computers.
  • Applied motivational and effective leadership leading field research teams of up to 10 personnel into wilderness conditions, and led laboratory discussions on the use of Git and R Markdown to improve organization of research.
  • Communicated results to statistical and ecological audiences nationally and internationally.

Education

Software

  • R (package development, statistical analysis)
  • Python (ETL, automation)
  • Git
  • SQL
  • Front-end development (HTML, CSS, JavaScript)
  • Tableau / Power BI
  • ArcGIS

Code Examples

Advent of Code

  • Ranked second out of 14 participating data scientists and BI developers on the company leaderboard.
  • Primarily interested in solving puzzles with awk to enhance programming skills with linux tools.
  • Selected entries:

Automated internet speed tests, hosted on my website

  • Source code.
  • Tests speeds daily and updates website.
  • Displays data with D3.

This resume!

  • Follows a make workflow to generate the output files. Just run make to compile everything!
  • Uses rmd, scss, pandoc, and wkhtmltopdf to create the document.