Below are 5 more selections from the 28 projects that I worked on as part of my Python Engineering class at The Iron Yard, Durham.

ATUS Analysis

IPython Notebook containing analysis of the American Time Use Survey data.

This assignment was largely unguided, with the aim of exploring an unknown data source for ourselves. Getting my initial bearings was a challenge, as there were many files and a huge number of data points (not to mention the coded field names!). I focused on normal activities to ensure the data set was large enough to have relevance.

PROJECT DETAILS


Time: 3.5 days

Python Tools: pandas, NumPy, Matplotlib, IPython Notebook, datetime

IPython Notebook: ATUS ipynb

GitHub: American Time Use Survey Analysis

Programming Language Classifier

IPython Notebook loading and executing custom featurizers.

The challenge of this assignment was two-fold: (1) collect and access a corpus of snippets of programming languages and (2) use Scikit-Learn tools to classify the test snippets as accurately as possible. I manually built extensive featurizers and accompanying tests to be used with the DecisionTreeClassifier because the Naive Bayes classifier gave poor results for my data set.

PROJECT DETAILS


Time: 3.5 days

Python Tools: sklearn, nose, Matplotlib, IPython Notebook

IPython Notebook: Programming Language Classifier ipynb

GitHub: Programming Language Classifier

Road Rage

IPython Notebook presenting traffic simulation.

The goal of this assignment was to use a Monte Carlo simulation to determine the ideal speed limit for a 1 km stretch of road. I put a lot of effort into a version of hard mode that, although working, needed just a bit more troubleshooting.

PROJECT DETAILS


Time: 3.5 days

Python Tools: random, math, NumPy, Matplotlib, IPython Notebook

IPython Notebook: Road Rage ipynb

GitHub: Road Rage

What to Watch

Command-line program to provide meaningful access to data in the MovieLens 100k database.

The purpose of this project was to make effective use of classes, list comprehensions, the csv module to read in data. I built a Django app later in the class that has similar functionality for the MovieLens 1M data set.

PROJECT DETAILS


Time: 3.5 days

Python Tools: csv, math, os, sys

GitHub: What to Watch

Mystery Word / Evil Mystery Word

Command-line “hangman”-style game.

This was the very first weekend project and was a lot of fun to put together. I was able to finish the normal mode on Saturday afternoon, so was able to fully tackle the “Evil” version of the game—just try to beat it!

PROJECT DETAILS


Time: 3.5 days

Python Tools: deepcopy, os, random, unittest

GitHub: Mystery Word / Evil Mystery Word