Data Scientist

2.5 years
equivalent full-time experience
2 companies
worked for
10+ Courses
on data science topics
 
 
 
 
 
CLA (CLiftonLarsenAllen, LLP)
Graduate Data Science Intern
October 2023 – Present Remote

Notable Projects:

  • Designed and implemented processes for automated extraction of values from tax forms using large language models and computer vision models
  • Performed statistical analyses on client-specific consulting projects in the agricultural industry
  • Created a custom PDF “chunking” algorithm to aid in retrieval-augmented generation for an internal chat bot
  • Used internal pricing data to generate reports and data on engagement pricing for leaders throughout the firm
Details:

Tools and Software:

  • Azure DevOps
  • Azure Document Intelligence Studio
  • Azure OpenAI
  • Camunda
  • Databricks
  • Docker

Programming Languages:

  • Python (Daily)
  • R (Daily)
  • LaTeX (Regularly)
  • Java (Occasionally)

Supervisors:

  • Joel Hennig, CPA
  • Alex White, PhD
  • Spencer Lourens, PhD
 
 
 
 
 
Bayer Crop Science
Data Science Intern - New Business Models
June 2022 – September 2023 Remote

Notable Projects:

  • Cleaned, explored, analyzed, and presented relevant business insights from 4 large public datasets focused on crop insurance policies and claims
  • Performed monthly data experiments and presented the results to business partners to facilitate decision making in new business models
  • Used economic theory to assess value of new business models from the the farmer’s point of view
Details:

Tools and Software:

  • AWS S3
  • AWS Sagemaker
  • Jira
  • Powerpoint

Programming Languages:

  • R (Daily)
  • Python (Occasionally)

Supervisors:

  • James (Daniel) Eubanks
 
 
 
 
 
CLA (CLiftonLarsenAllen, LLP)
Data Science Intern
January 2020 – September 2021 Remote

Notable Projects:

  • Developed R Shiny applications within the Golem framework for 5 internal projects which were hosted in the production environment and experienced heavy use by up to 300 users
  • Contributed functions and documentation to 4 internal R packages for easier operations with Azure Datalake and Azure Database
  • Automated PowerPoint creation with R to populate template slide decks with data and information about any subgroup of the firm
  • Extracted usable datasets from firm databases using Tidyverse data manipulation principles so leaders can assess progress
Details:

Tools and Software:

  • Azure Datalake
  • Golem
  • RStudio Connect
  • Shiny

Programming Languages:

  • R (Daily)
  • HTML/CSS (Frequently)
  • SQL (Regularly)
  • Javascript (Occasionally)

Supervisors:

  • Spencer Lourens, PhD
  • Matt Anderson, CPA

Professional Trainings and Courses

 
 
 
 
 
2023-2024
  • TensorFlow: Neural Networks and Working with Tables (LinkedIn Learning, October 2023)
  • Business Writing (Coursera, August 2023)
  • Introduction to Portfolio Construction and Analysis with Python (Coursera, July 2023)
 
 
 
 
 
2021-2022
  • Data Visualization for Data Analysis and Analytics (LinkedIn Learning, June 2021)
  • Data Visualization: Best Practices (LinkedIn Learning, May 2021)
  • Learning Data Visualization (LinkedIn Learning, April 2021)
  • Data Visualization: Storytelling (LinkedIn Learning, March 2021)
  • Designing an Infographic (LinkedIn Learning, March 2021)
  • Learning Data Science: Tell Stories with Data (LinkedIn Learning, March 2021)
  • Python for Data Science Essential Training Part 1 (LinkedIn Learning, February 2021)
 
 
 
 
 
2019-2020
  • R Essential Training: Wrangling and Visualizing Data (LinkedIn Learning, November 2020)
  • Databricks Spark+AI Summit (Virtual, June 2020)
  • Learning Azure DevOps (LinkedIn Learning, January 2020)