Skip to content

How to use Twitter follower graph to identify subject matter experts topically - From the perspective of a PhD/Data Scientist #1861

@nathan-oplus

Description

@nathan-oplus

The Problem.
As a Twitter user, I am very interested in what thought leaders have to say, especially about their given area of expertise. I also recognize that Twitter is unique among all social network because it seems all the world experts flock to the platform to argue about current events. I would like a version of the Twitter algorithm that prioritizes comments from these subject matter experts.

But how can you algorithmically define an expert?

Proposed formulation

I propose that the thing that will make the accounts of thought leaders exceptional is that they will be engaged by other experts more often. The big names in Machine Learning for example, (think Andrew Ng, Yann LeCun) only follow a small number of other accounts. Those accounts are most likely some of the best ML accounts to follow. It seems to me this will be true of many many important subject of today.

If you take any news item, lets say the recent submersible implosion, it would be really interesting to see what the materials experts have to say about it, what the submersibles experts have to say etc etc.

If you looked at the follower graph, and you assigned a weight to each edge that was the ratio of followers to follows the account has. For example, @andrewyng follows this account: @jackk. @andrewyng has almost 900k followers and is only following 700 people so this follow is very meaningful. Let's make an equation:

w_e = n_followers/n_follows

w_e is the weight of the edge, n_followers is the number of followers of @andrewyng in this example ~900k, n_follows is the number of follows of @andrewyng.

If you then snip any edge that has w_e < threshold value (some number >1 lets say 5 or 10), you'll get a new follower graph. This graph should represent a map of the meaningful twitter voices.

You might then do some clustering in the graph, maybe make some vector embeddings of the posts by these accounts, and it's not a stretch of the imagination to think you'd be able to come up with topical clusters of the accounts that matter for any subject that people talk about.

How powerful would it be to take any given topic and get a stream of the biggest voices in that area automatically?

Isn't this what everyone is trying to do with their follows anyway??

Why I say this is democratic

The beauty of this method is that it finds experts by the number of people that have followed that account. These are the votes of the people. People listen to Andrew Ng, this is why he has followers. This is not arbitrary, and there was no man behind the curtain that needed to deem Andrew as the voice of truth. Therefore, assigning more weight to his interactions/follows makes sense because the people have given him that weight with their voice and follows. This could effectively turn twitter into a self regulating thought ecosystem able to find authoritative opinions from thought leaders in real time as events unfold.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions