In an increasingly data-driven world, big data is infiltrating all aspects of our lives. As such, data literacy is becoming a central skill for the next generation of STEM students. However, most data science courses focus on pre-collected datasets and neglect challenges associated with data collection and data cleaning. This project is about teaching data science skills through real-world, ecological, multimodal datasets through a semester-long course. For this purpose, we developed a website where multimodal data (eye-tracking, motion, physiological) can easily be collected through traditional webcams. Traditionally collecting this kind of fine-grained data required dedicated hardware and software.