Why Should I Be Interested in Survival Analysis?

Survival analysis is a field that a lot of analysts and scientists overlook - it's seen as sorta unsexy. This is too bad, it's actually a really cool field, and...

9^{00}

Visualizing PCA's Information Loss

In this screencast, we look at how much information is lost during a PCA transformation by applying PCA to an image: an interesting approach to image compression. Learn: How PCA...

Using Patsy for Categorical Data

Patsy is one of those really cool Python libraries that does one thing - and only one thing - really well. The data we are given often contains categorical variables...

Survival Analysis Bundle Pack

Pick up all three survival analysis screencasts! Why should I be interested in Survival Analysis Estimating the Survival Function Estimating the Hazard Rate

19^{00}

Scraping the Web using Pandas

We'll use an sweet hidden function in Pandas, read_html and some logic to create a webscraper for collecting tennis ranking data. Learn: How to use Pandas to scrape data from...

Sampling from Discrete Distributions

One common problem in implementing statistical algorithms is being able to sample from an arbitrary distribution: suppose you have N objects, each with a different probability of being chosen -...

5^{00}

Intro to PCA

An introduction to what PCA is doing to your data using an easy mental model. We show examples of how to use PCA in Python + an interesting side effect...

Sorting Colours using PCA

IPython Notebook used available here. We show an example of how to "sort" high dimensional objects using PCA, specifically we answer "how can we sort colours?" This screencast presents an...

Free

Estimating the Survival Function

We introduce the survival function, a core component in survival analysis, and the non-parametric estimate for it: the Kaplan-Meier curve. We explore these concepts on FBI's Most Wanted List and...

Estimating the Hazard Function

At the heart of survival analysis is the hazard curve, which can be thought of as the amount of risk of dying at any point in time. It's the fundamental...

Create Markov Chains Using Your Chrome Browsing History

If you're using Chrome, Firefox or Safari, likely you have a SQLite database on your computer that your browser stores all your history: search terms, pageviews, urls visited, and when...

Bayesian Modelling (Car Arrival Problem)

I was sent this question from a reader: I have a road intersection with one entrance and two exits, A and B. My goal is to estimate the number of...

Bayesian Beta-Binomial Model

In this screencast, we explore one of my favourite and useful Bayesian models: the Beta-Binomial model. This model is a building block for more advanced models, and has applications in...

A/B Testing Conversion Rates

Using the Beta-Binomial model, we introduce a common pattern to perform Bayesian A/B testing. There are three strong reasons to prefer Bayesian A/B testing over traditional hypothesis testing: uncertainty visualisation,...

Determining Ages using First Name Data

In this screencast, we use open Census data, combined with some laws of probability, to derive age distributions a population. This is very useful when you have a body of...

Limited Edition Data

Super hot data fresh off the servers

0^{01}

test

10^{00}