Skip to content
View mediacontentatlas's full-sized avatar
  • Stanford University

Block or report mediacontentatlas

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
mediacontentatlas/README.md

Anonymous Submission for CHI'25 👋

As digital media use continues to evolve and influence various aspects of life, developing flexible and scalable tools to study complex media experiences is essential. This study introduces the Media Content Atlas (MCA), a novel pipeline designed to help researchers investigate large-scale screen data beyond traditional screen-use metrics. Leveraging state-of-the-art multimodal large language models (MLLMs), MCA enables moment-by-moment content analysis, content-based clustering, topic modeling, image retrieval, and interactive visualizations. Evaluated on 1.12 million smartphone screenshots continuously captured during screen use from 112 adults over an entire month, MCA facilitates open-ended exploration and hypothesis generation as well as hypothesis-driven investigations at an unprecedented scale. Expert evaluators underscored its usability and potential for research and intervention design, with clustering results rated 96% relevant and descriptions 83% accurate. By bridging methodological possibilities with domain-specific needs, MCA accelerates both inductive and deductive inquiry, presenting new opportunities for media and HCI research.

image

In this repo, you will find the code files for the following parts of MCA:

MCA Pipeline

MCA Evaluation

Notes

  • This repo will be updated regularly for better reproducibility, meanwhile open issues for any questions.
  • Working on creating a synthetic dataset to showcase the whole pipeline.

Popular repositories Loading

  1. mediacontentatlas mediacontentatlas Public

    Code for Media Content Atlas

    Python

  2. mediacontentatlas.github.io mediacontentatlas.github.io Public

    JavaScript