About the Job
We are in search of a highly motivated and accomplished Research Assistant to join our team and significantly contribute to our ongoing research project "Understanding and Generating Interleaved Image-Text Persuasion". The successful candidate will collaborate closely with our research team, working on the development of multimodal AI. This role offers an exciting opportunity to engage in cutting-edge research in multimodal learning and have a substantial influence on the field.
What You’ll Do
- Conduct literature reviews on multimodal persuasion, visual communication, and the evaluation of large vision-language models.
- Support the design and development of datasets and evaluation frameworks for understanding interactions between images and text.
- Assist in evaluating and enhancing multimodal models for analysing and generating interleaved image-text content
- Summarize experimental results and contribute to the preparation of research reports, presentations, and academic publications.
Who We’re Looking For
- Bachelor degree with background in computer science, data science, artificial intelligence, or a related field.
- Strong interest in multimodal AI, vision-language models.
- Basic experience with Python and machine learning frameworks such as PyTorch is preferred.
- Familiarity with natural language processing, computer vision, or large language models is an advantage.
- Good analytical, communication, and organizational skills.
- Ability to work independently while collaborating effectively with a research team.