My top data science projects. See more on my Kaggle: https://www.kaggle.com/etsc9287
The objective of this project is to predict whether or not a pitch will be thrown in the strikezone based on a variety of factors from before the pitch is airborne. If MLB batters and their coaches are aware of the diverse factors influencing whether or not a pitch will be thrown in the strikezone, batters will have a better idea of when to swing and when not to swing. This project can also assist pitchers in being aware of their pitch predictability. The process of this project includes the following:
The objective of this project is to compare Kaggle.com’s Python and R users, including users of both and neither languages, in various areas including demographics, learning sources, career, machine learning, and more. With the usage of these analytics, data science enthusiasts, especially beginners, can determine which language(s) to learn or focus on based on their personal situations. Alternatively, this analysis can also be used to determine a data science enthusiast’s best course of action based on the language(s) they already prefer. The process of this project includes the following:
Note: This project was apart of a Kaggle analytics competition in which participants could choose any topic of focus to analyze from the survey. My notebook was the 19th most liked in the competition and is currently my top Kaggle project. My other projects involving Kaggle datasets can be found here.
In this project, I analyze the evolution of spin rates among MLB pitches over time, exploring any discernible patterns that may raise concerns regarding unfair play, particularly in light of the recent foreign substance scandal that has plagued Major League Baseball. I posted my analyis as a Medium article which includes: