Archives
How To Mine Your GMail with Google Takeout and MongoDB
Posted on February 14, 2014 Leave a Comment

Google has really been on the up-and-up lately with a service called Google Takeout that allows you to export your data from its cloud. For the thoughtful cloud user who is becoming increasingly concerned about privacy, accidental data loss, or data ownership, this is a product that’s sure to please. Likewise, for the data mining enthusiast, quantified-self […]
Twitter Data Mining Round Up
Posted on February 10, 2014 Leave a Comment

Since the release of Mining the Social Web, 2E in late October of last year, I have mostly focused on creating supplemental content that focused on Twitter data. This seemed like a natural starting point given that the first chapter of the book is a gentle introduction to data mining with Twitter’s API coupled with […]
Understanding the Reaction to Amazon Prime Air (Or: Tapping Twitter’s Firehose for Fun and Profit with pandas)
Posted on December 19, 2013 2 Comments

On Cyber Monday eve, Jeff Bezos appeared in a 60 Minutes segment and revealed to the world that he’s been working on an experimental effort called Amazon Prime Air. The general idea behind Amazon Prime Air is that Amazon may one day deliver relatively lightweight items directly to your doorstep in less than 30 minutes […]
What Do Tim O’Reilly, Lady Gaga, and Marissa Mayer All Have In Common?
Posted on November 22, 2013 4 Comments

This post examines the followers of some popular Twitter users as the final installment of a multi-part series about exploring Twitter influence by asking the (Freakonomics-inspired) question, What do Tim O’Reilly, Lady Gaga, and Marissa Mayer all have in common? Although it may initially seem like an obnoxious question to ask, some of the answers may intrigue you […]
Mining Social Web APIs with IPython Notebook [Slides]
Posted on October 30, 2013 1 Comment

Thanks so much to everyone who attended the Mining Social Web APIs with IPython Notebook workshop. It was really inspiring to see so many of you get your hands dirty hacking on data (as opposed to just talking or thinking about it.) It’s a lot of work to design a 3 hour workshop for such […]
Now Serving: Full-Text Sampler in IPython Notebook Format
Posted on October 25, 2013 3 Comments

The 2nd Edition of Mining the Social Web has officially soft-launched (the “hard-launch is at my Strata workshop next week), and as of late last week you could download either a PDF file or view an ebook excerpt of the first chapter that introduces data mining with Twitter’s API. Additionally, as of just a few hours ago, the full-text of the first […]
Reflections on Authoring a Minimum Viable Book
Posted on August 24, 2013 4 Comments

As the production of the second edition of Mining the Social Web nears completion with an estimated publishing date of mid-September 2013, I wanted to share some thoughts and reflections on what it has been like to write and then (mostly) re-write and re-launch a book. Like anything else, there’s always a backstory, and this […]