Beau Kramer

Resume

Projects

Finance

Eigen Portfolios This was an implementation of a tactical asset allocation strategy where principal component analysis helped separate and weight assets into an offense and defense portfolio.
Nowcasting Recessions The goal of this project is to create a machine learning algorithm to detect whether the economy is in a recesssion or not. The NBER announces the beginning and end of recessions after they have passed. This presents an opportunity: if we could know the economy is likely already in a recession months before it is officially announced then we could take appropriate action. For example, a manufacturer could scale down production in anticipation of the coming recession. Alternatively, it could scale up production if it knew that the recession was likely already over. I have two primary goals in developing this model: for it to be 1) performant and 2) interpretable.
Pycerno This is a Python “translation” of the book “Quantitative Investment Portfolio Analytics In R” by James Picerno. I found this to be a uesful exercise because 1) it familiarized me with different Python packages relevant to financial analysis 2) made it easy for me to port R code I find to Python and 3) gave me lots of template code to do basic financial analysis of new datasets. Some of the code is clunky but this was in an effort to be faithful to the source code.
CPI Visualization This was a small project around picking a dataset and performing some EDA on it. Due to my background in finance, I chose the consumer price index. The numerous sub-components made it fun to disaggregate.

Machine Learning

Automated Essay Grading and Inference Using Linear and Deep Learning Models

NLP Deep Learning Feature Engineering Data Visualization

For this NLP project, my team selected an automated essay grading challenge. We were interested in automatically generating feedback for students and for contrasting linear models with deep learning ones. We performed extensive feature engineering and trained deep learning models on an essay and sentence level.

News Article Topic Classification

NLP Classification Data Processing

I trained classifiers using a bag-of-words model to identify the topic of a news article. After some intial attempts at the problem, I applied some preprocessing to the texts which improved the ability of the model to generalize.

Poisnous Mushroom Clustering

Clustering Data Visualization

Using PCA to reduce dimensionality, I clustered data about mushrooms to try to classify poisonous ones. I used KMeans and Gaussian Mixture Models.

Forest Cover Prediction

Classification Data Visualization Ensembling

For this group project, my team selected the forest cover prediction challenge. We had to predict the species of tree that lived in a 30x30m cell in several Colorado forests. We summarized our lessons learned in this presentation.

Statistics

Crime Policy North Carolina

Linear Regression EDA

Classic linear regression was the focus of this project. We were given a dataset about crime in North Carolina in the 1980s with the goal of providing policy recommendations to politicians.

Challenger Explosion

Discrete Response Logistic Regression

This discrete response modeling project addressed whether temperature and/or pressure have a relationship with the failure of the primary o-rings in the space shuttle.

Time Series Forecasting with SARIMA Model

Time Series ARIMA

This was a simple exercise in forecasting an e-commerce time series from the Federal Reserve Bank in St. Louis. After exploring and verifying the data’s suitability for the model, we fitted a seasonal ARIMA model to the data.

Fatality Rates with Fixed Effects

Panel Data Fixed Effects

This panel dataset on driving laws proved a challenge to explore visually, but we managed to create some visuals that helped us understand the data. We then used a fixed effects model to capture the effects of different policies on fatality rates.

Cereal Content and Shelf Probabilities

Multinomial Regression Odds Ratios

Multinomial regression of a cereal dataset was the main task in this project. In addition to calculating odds ratios, we built up toward visuals that showed the shelf probability by nutritional content.

Forest Fires

Data Visualization EDA

The focus of this project was on the importance of exploratory data analysis in any project. We explored a dataset about a portuguese national park that experienced severe wild fires to see if we could begin predicting conditions that make an area vulnerable to extreme fires.

beaukramer.github.io