OPEN SOURCE
1. Context
- At the heart of Twitter's business model is the recommendation algorithm. It is a set of rules that enables the platform to deliver contentenableon users' interests and preferences.
- It is with the help of this system that advertisers promote their brands.
- Content flows into a user's timeline through two channels.
- One pipeline channels content posted by people that the user follows
- The other is filled with posts flowing from accounts that could potentially interest the user.
- The latter timeline, called "For You", acts as a predictive tool to sugYou feeds that a user may be interested in.
- It helps the microblogging site find answers to questions like what is the probability a user will engage with another user in the future, what communities on Twitter might a user be interested in and what tweets are trending within them. Answers to such questions help the platform recommend relevant content.
3. Working on a new Algorithm
- Twitter's recommendation algorithm runs on a three-step process.
- First, it fetches tweets from multiple recommendation sources. The platform calls this process candidate sourcing.
- After sourcing these tweets, a machine-learning model ranks them.
- After ranking, tweets are filtered to remove those that a user may have blocked or has already seen.
- At the sourcing stage, the algorithm mixes tweets that eventually flow into a user's "For You" timeline.
- To mix, it picks candidates from people a user follows and from those they do not.
- It calls these two sources "in-network" and "out-network" and together they make up 50/50 of the mix.
- Twitter says that for each request it attempts to extract the top 1, 500 tweets from a pool of hundreds of millions.
- The network part is quite easy to build as information is picked in real time from people a user follows.
- But out-network sourcing is trickier as the platform must pick content from candidates the user does not follow.
- To do this, Twitter uses what it calls social graphs and embedded spaces.
- The former creates a stream of candidates based on what content followers of a user engage with.
- And the latter matches the profile of a user with a cluster that exhibits similar interests and preferences as the user.
- Once this is done, using a 48M parameter neural network that is continuously trained on tweet engagement, the platform starts ranking feeds.
4. The reaction from the public
- After Twitter open-sourced its recommendation algorithm, many people flocked to GitHub to view the code.
- Some see this reveal as "a step in the right direction for the future of humanity".
- Others note that the code does not reveal much about how it is used by the platform.
- They also highlight that important bits of information have been left out.
- For instance, the absence of information on the data it uses to build these pipelines prevents one from having a complete picture of the platform's recommendation system.
- Open-sourcing any algorithm requires its training set to be open-sourced as well and that is impossible for Twitter to do.
- Every effort is open-sourcing the algorithm without the data is completely dishonest.
5. Reasons for Twitter open source code
- Mr Mask is gradually building Twitter as a place for privileged users who can pay for verification tick marks and get additional features, including a higher ranking in the feeds.
- Additionally, he is making these changes at a time when he has fired most of the company's technical staff.
- Social media platforms need experienced developers to keep building new features and deploying them successfully.
- So, perhaps, Musk thinks opening the source code to external developers could potentially solve the human resource bottleneck.
- But it will be a tough road as Mr Musk has damaged Twitter's reputation in the open-source community.
- Opensource communities are built on relationships and trust and now Twitter has neither with these groups.
- They have lost any ability to participate meaningfully in those communities.
For Prelims & Mains
For Prelims: algorithm, Twitter, Open Source, GitHub
For Mains:
1. What is an open-source Algorithm? Discuss how the microblogging site's algorithm functions. (250 Words)
|
Previous year questions
1. Consider the following: (UPSC 2022)
1. Aarogya Setu
2. CoWIN
3. DigiLocker
4. DIKSHA
Which of the above are built on top of open-source digital platforms?
A. 1 and 2 only B. 2, 3 and 4 only C. 1, 3 and 4 only D. 1, 2, 3 and 4
Answer: D
|