## Exploring modeling failure

In my last post, I gave an example of a multilevel model using a binomial generalized linear model (GLM). The varying intercept model helped illustrate pa...

In my last post, I gave an example of a multilevel model using a binomial generalized linear model (GLM). The varying intercept model helped illustrate pa...

I’ve been on a journey learning multilevel models and Bayesian inference through Richard McElreath’s Statistical Rethinking book. The concepts of shrinkage a...

Recently, I built a simple NLP algorithm for a work project, following the template described in this tutorial. As I looked to increase my model’s complexity...

Last month, I wrote about how I aimed to make daily contributions towards my goals, no matter how small. Not surprisingly, it has been challenging. Neverthel...

`np.dot`

and broadcasting
Vectorization and broadcasting are tricks I have used sparingly and absent-mindedly if at all. However, it is a critical skill for algorithmic code to run ef...

Yesterday morning, I continued my habit of reading a chapter from the book Atomic Habits by James Clear. The chapter described the importance of showing up a...

At last, we have come to the end. This is the final post in a series of linear regression posts using PyMC3, from my reading of Statistical Rethinking. Part ...

This is the next post in a series of linear regression posts using PyMC3. This series has been inspired by my reading of Statistical Rethinking. Part 1 was d...

In a previous post, I wrote about my inital experience using PyMC3. The point was to take a dive deep into some of the package’s objects using a linear regre...

I previously wrote about my discovery of Statistical Rethinking. The book’s title could not be more spot-on–it’s helped me look at statistics in a different ...

I’ve struggled to write consistently. Part of the reason for this is because I’m prone to self-censorship. Even as I type this, my mind is talking to itself....

This site is about my data science journey but various posts indicate my love for baseball. It starts with my GitHub handle: benslack19. Let’s break it down....

A few weeks ago, I learned about the wonderful Statistical Rethinking lecture series and book by Richard McElreath. It’s made me think about some of the Baye...

A year ago, I was anxious and excited about a planned life change. I had put in my notice and would start a data science program on January 13, 2020. I had b...

In the last post, we learned about the beta distribution and why it would be a more realistic prior in the context of our problem. We also selected appropria...

I meant to post this some time ago, but I have been busy. But with the baseball example I am using, it is only fitting that I post this now, just after this ...

In my previous post, we saw how Bayes’ theorem was applied to a relatively simple problem with Bertrand’s box paradox. Here I’ll talk about another applicati...

Bayes’ theorem is one of the most useful applications in statistics. But sometimes it is not always easy to recognize when and how to apply it. I was doing s...

A few weeks ago, while making a histogram in a SQL query, I discovered that some solutions out there do not include bins with 0 counts. This bugged me so I f...

I recently posted my project School and community features associated with low income student success. It was important to me to get reactions from educator...

I recently read this passage in the section on multiple linear regression from the fantastic book Introduction to Statistical Learning:

With the current days of COVID and working from home, I initially felt my data science learning and productivity slow down as my routine was disrupted. But i...

Problem statement

PostgreSQL is one of the most popular variants of SQL. It is common to use PostgreSQL with pgadmin but I am not a big fan of their UI. By contrast, interacti...

Introduction I have identified writing well as a skill I will prioritize. This improved skill will benefit both smaller forms I have taken for granted (like...

One of the things about Python that I haven’t fully appreciated are the use of iterators. I’ll go over some iterators that are a part of base Python and then...

Seems like every statistics class starts off with a coin toss. It’s simple enough for me. Some fancy teachers might start right off the bat and get into the ...

…from lessons learned from Andrew Ng’s ML course. Like other assignments of the course, the logistic regression assignment used MATLAB. Here, I translate MAT...

My wife and I have enjoyed living in the Bay Area where we’ve been able to satisfy our love of outdoor activities while being near a cool city. While we’re f...

I recently finished Andrew Ng’s fantastic and well-known Machine Learning course through Coursera. As I progress into my data science journey, I felt that ta...

I coded for a couple of years in R but switched over to Python almost a year ago. I have to say that I miss R’s ggplot2. Like a lot.

This is a summary of a discussion I led through a discussion group. This group is called the “STEM Education & Diversity Discussion Group” and it is orga...