Contributor :

Snehil Shah

Mentor :

Denny George
When dealing with online harms on social media in India you come across challenges that are unique to India. Firstly a majority of the data is in the form of images, videos and audios and secondly its in various Indian languages. These challenges make it hard to study the extent of harms on a large scale in an automated manner. Some of the foundational tools and datasets that we are building aims to close this game. Feluda is one such tool that makes analysing multimodal and multilingual content easier. As part of this project we reviewed state of the art ML models, evaluated their capabilities and limitations in working with the kind of data we find on Indian social media and built the capability to analyze a collection of videos and run automations to group similar videos together and assign tags to them. This will feed into downstream annotation and review workflows that will help surface thematic trends in videos.

Contributor :

Snehil Shah

Mentor :

Denny George
When dealing with online harms on social media in India you come across challenges that are unique to India. Firstly a majority of the data is in the form of images, videos and audios and secondly its in various Indian languages. These challenges make it hard to study the extent of harms on a large scale in an automated manner. Some of the foundational tools and datasets that we are building aims to close this game. Feluda is one such tool that makes analysing multimodal and multilingual content easier. As part of this project we reviewed state of the art ML models, evaluated their capabilities and limitations in working with the kind of data we find on Indian social media and built the capability to analyze a collection of videos and run automations to group similar videos together and assign tags to them. This will feed into downstream annotation and review workflows that will help surface thematic trends in videos.

About Contributor

Snehil is a computer science student with a strong interest in technology and software. Apart from that, he also has a dormant passion for audio DSP and synthesis.

About Mentor

Denny is trained in User Experience Design and Software Development. He is currently the co founder and tech lead at Tattle Civic Technologies. At Tattle he is interested in conceptualizing and creating open source tools and open datasets to study and respond to online harms in India.

Key Impact Takeaways:

  1.  This project will allow researchers and fact-checkers to easily analyze and work with videos on Tattle’s Kosh platform.
  2. This project will power Tattle’s Deepfake Analysis Unit.

Contributor Experience