Research Mission
Build fundamental AI/ML methods for computer vision and language modeling to address societal challenges.
Research Interests
Our research is highly interdisciplinary and collaborative, and interests include vision-language models, large language models, computer vision, text/image/video classification, text recognition (OCR, Handwritten), human action analysis, multispectral imaging and NLP tasks.
Research Projects
- Multimodal deep learning
- Vision-language(-action) models
- Multispectral handwritten text modelling and recognition
- Degradation-aware Handwritten Text Recognition
- Large Language Models and applications in computer vision
- Human fall detection on untrimmed long form videos
News
- Dec. 2024: Robin Hollifeldt’s PhD project on Multimodal Deep Learning has been approved in Affiliated WASP PhD student position call.
- Sept. 2024: We welcome Robin Hollifeldt as our PhD student!
- May 28, 2024: Ekta Vats got promoted to a Docent in Computerised Image Processing. Docent Lecture: Introduction to Large Language Models in Image Analysis: Theory and Applications. Read more!
- New funding from the UU Graduate School in Cybersecurity for project: Large language models-powered social robots in cybersecurity applications (CYBERBOT). PI: Ginevra Castellano, Co-PIs: Ekta Vats, Katie Winkle and Boel Nelson.
- Ekta Vats’ interview with Beijerstiftelsen: 3 frågor till nya Beijerforskaren Ekta Vats (3 questions for the new Beijer Researcher Ekta Vats)
- Dec. 2023: Women in Data Science Sweden (WiDS) mentorship program wrap up! Ekta Vats served as a mentor
- Oct. – Dec. 2023: Till Grutschus from Technical University of Munich joined us on an exchange semester. Project: Human fall detection on untrimmed videos using large foundational video-understanding model
- Oct. 5, 2023: Beijerforskardagen
- Oct. 4, 2023: Raphaela M. Heil defended her thesis titled Document Image Processing for Handwritten Text Recognition. Deep Learning-based Transliteration of Astrid Lindgren’s Stenographic Manuscripts.
Hiring!
- [Closed] Postdoc position in Multimodal Deep Learning (Deadline: Dec. 9).
- [Closed] PhD position: PhD student in Machine Learning and Computer Vision (Deadline: Aug. 12).
- [Closed] PhD position: PhD student in social robotics with focus on large language models and cybersecurity (Deadline: May 23).
- [Closed] Postdoc position: Postdoctoral position in Deep Learning with a focus on Vision-Language Models (Deadline: May 14)
- [Closed] PhD position: PhD student in Machine Learning with a focus on Vision-Language Models (Deadline: March 28)