Archives

Twitter Data Mining Round Up

ContentRoundup_Twitter

Since the release of Mining the Social Web, 2E in late October of last year, I have mostly focused on creating supplemental content that focused on Twitter data. This seemed like a natural starting point given that the first chapter of the book is a gentle introduction to data mining with Twitter’s API coupled with […]

Read More

Mining Social Web APIs with IPython Notebook [Data Day Texas Workshop Slides]

Screen Shot 2014-01-12 at 3.47.55 PM

Thanks to everyone who attended the Mining Social Web APIs with IPython Notebook workshop at Data Day Texas. I’m really glad that I made the trip down to Austin and could share some of my work with you. The data truly is bigger in Texas, Austin was a fantastic city to visit, and everyone I had […]

Read More

Understanding the Reaction to Amazon Prime Air (Or: Tapping Twitter’s Firehose for Fun and Profit with pandas)

amazon-prime-air

On Cyber Monday eve, Jeff Bezos appeared in a 60 Minutes segment and revealed to the world that he’s been working on an experimental effort called Amazon Prime Air. The general idea behind Amazon Prime Air is that Amazon may one day deliver relatively lightweight items directly to your doorstep in less than 30 minutes […]

Read More

What Do Tim O’Reilly, Lady Gaga, and Marissa Mayer All Have In Common?

lady-gaga-nerd-glasses

This post examines the followers of some popular Twitter users as the final installment of a multi-part series about exploring Twitter influence by asking the (Freakonomics-inspired) question, What do Tim O’Reilly, Lady Gaga, and Marissa Mayer all have in common? Although it may initially seem like an obnoxious question to ask, some of the answers may intrigue you […]

Read More

Super Simple Storage for Social Web Data with MongoDB (Computing Twitter Influence, Part 4)

Icon_MongoDB_by_xkneo

In the last few posts for this series on computing twitter influence, we’ve reviewed some of the considerations in calculating a base metric for influence and how to acquire the necessary data to begin analysis. This post finishes up all of the prerequisite machinery before the real data science fun begins by introducing MongoDB as a […]

Read More

Getting Started with Twitter’s API: From Zero to Firehose in ~2.5 Minutes

7df3h38zabcvjylnyfe3

Mining the Social Web‘s goal is to teach you how to transform curiosity into insight, and its virtual machine features two IPython Notebooks that are designed to get you up and running with Twitter’s API as quickly as possible. The following ~2.5 minute screencast shows how to generate OAuth credentials, establish a Twitter API connection, and make API […]

Read More

Twitter Could Be So Much Better Than An Advertising Company

twitter-logo-break

If you’re a business with enough users, you can probably make some money by placing advertisements. Advertising drives commerce, and commerce is fundamental to a healthy economy. It’s a great and wonderful thing that profits are earned, jobs are created, taxes are paid, and a virtuous cycle develops around the commerce that results from advertising. […]

Read More

Mining Social Web APIs with IPython Notebook [Slides]

Mining Social Web APIs with IPython Notebook

Thanks so much to everyone who attended the Mining Social Web APIs with IPython Notebook workshop. It was really inspiring to see so many of you get your hands dirty hacking on data (as opposed to just talking or thinking about it.) It’s a lot of work to design a 3 hour workshop for such […]

Read More

How To Harvest Millions of Twitter Profiles Without Violating the ToS (Computing Twitter Influence, Part 3)

dataharvest46_460

In the last post in this continuing series on computing Twitter influence, we developed a wrapper function called make_twitter_request that handles the various sorts of HTTP error codes and network failures that you are likely to experience as you aspire to acquire non-trivial amounts of data from Twitter’s API. Although you are somewhat unlikely to […]

Read More

Why Is Twitter All the Rage?

Twitter-Bird-Icon

Next week, I’ll be presenting a short webcast entitled Why Twitter Is All the Rage: A Data Miner’s Perspective that is loosely adapted from material that appears early in Mining the Social Web (2nd Ed). Given that the webcast is now less than a week away, I wanted to share out the content that inspired the topic. This […]

Read More