We are looking for a Postdoc to work on multimodal representation learning (advertisement coming up soon)! Get in touch!

 

Research Mission

Build fundamental AI/ML methods for computer vision and language modeling to address societal challenges.

Research Interests

Our research is highly interdisciplinary and collaborative, and interests include large language models, computer vision, text/image/video classification, text recognition (OCR, Handwritten), human action analysis, multispectral imaging and NLP tasks (sentiment analysis, Named entity recognition).

Research Projects

  • Multimodal deep learning and Vision-language(-action) models
  • Multi-spectral imaging for cultural heritage collections
  • Handwritten text recognition for severely degraded manuscripts
  • Human fall detection on untrimmed long form videos

News

  • MSc thesis projects announcement: Topic 1: Palimpsest text separation and recognition; Topic 2: Degradation-aware self-attention. Reach us out!
  • Sept. 2024: We welcome Robin Hollifeldt as our PhD student!
  • May 28, 2024: Ekta Vats got promoted to a Docent in Computerised Image Processing. Docent Lecture: Introduction to Large Language Models in Image Analysis: Theory and Applications. Read more!
  • New funding from the UU Graduate School in Cybersecurity for project: Large language models-powered social robots in cybersecurity applications (CYBERBOT). PI: Ginevra Castellano, Co-PIs: Ekta Vats, Katie Winkle and Boel Nelson. 
  • Ekta Vats’ interview with Beijerstiftelsen: 3 frågor till nya Beijerforskaren Ekta Vats (3 questions for the new Beijer Researcher Ekta Vats)
  • Dec. 2023: Women in Data Science Sweden (WiDS) mentorship program wrap up! Ekta Vats served as a mentor
  • Oct. – Dec. 2023: Till Grutschus from Technical University of Munich joined us on an exchange semester. Project: Human fall detection on untrimmed videos using large foundational video-understanding model
  • Oct. 5, 2023: Beijerforskardagen
  • Oct. 4, 2023: Raphaela M. Heil defended her thesis titled Document Image Processing for Handwritten Text Recognition. Deep Learning-based Transliteration of Astrid Lindgren’s Stenographic Manuscripts.

Hiring!