I coordinated a project with my classmates (Chun-Yen Pan, McKayla Sharp, Sarah Stukalin and Xiaoyan Zhang) for a course on “Methods of data collection and production’’(EPPS 6302). The project culminated in a report which can be read here. The Github repository containing the data collected in the project and the code used for the data collection and analysis can be found here: admapper Github repo
This project involved collating a sample frame of URLs of US news sources followed by scraping data (
ads.txt) from these URLs. Then we proceeded to make a graph database on Neo4j for scaling up the dataset in the future and finally performed bipartite network analysis using the ‘Method of Reflections’ technique. A major focus of this class project was directed towards collection, cleaning and creation of the database since the class was on data collection. We also explored the theoretical aspects behind the digital advertising market and explicated it in the report. I am thankful to all my classmates for being patient in working with me on this nuanced subject!
If you have read my other posts on this blog, you will notice that this project is an extension of my interest in regulation of digital platforms which was kindled somewhere between late 2020 and early 2021, thanks to the Tech Policy course at the Takshashila Institution. Earlier, in May 2021, I had coordinated a project with the cohort of Complexity Weekend 2021 (Bernado Furtado, Kishor Acharya among others in the cohort). You can check out the Github repository of that project containing some skeleton code and a presentation that was done in the closing livestream of the Complexity Weekend event here. That project had a different aim and attempted to create an Agent-Based Model (ABM) to explain digital platforms. It succeeded in creating a very basic version of the ABM. But, this is the first time I have taken a jab at the problem using real world data. Alongside this project, I was also learning Economic Complexity (which I have written about here) for the work I was doing at the ArtSciLab and ended up stumbling upon the ‘Method of Reflections’ technique for analyzing bipartite networks which I used in this project. Overall, this project pulled from several past experiences and learnings. I am fairly certain this project will serve as a stepping stone to more work on digital platform regulations in the future!