SINGAPORE-MIT ALLIANCE FOR RESEARCH AND TECHNOLOGY CENTRE
See if you qualify before applying
Get your match score and top strengths in seconds.
We are seeking one Postdoctoral Associate in Computer Science to join the research program "Mens, Manus and Machina—How AI Empowers People, Institutions and Cities in Singapore (M3S)." Successful applicants for this position will have the opportunity to work on cutting-edge research at the intersection of reinforcement learning (RL), world models, embodied intelligence, and human learning. The research targets a new generation of intelligent agents that learn through sequential interaction, and is organized around two pillars united by a shared reinforcement learning foundation.
Embodied intelligence. This part focuses on robotic agents operating in unstructured 3D environments (homes, offices, industrial sites, shopping malls). The work centres on (a) model-based and hybrid RL algorithms built on multi-modal world models (vision, language, proprioception, action) that support long-horizon planning, principled exploration, and sim-to-real transfer; (b) compositional and hierarchical world models that translate higher-order, qualitative human goals into grounded sequences of manipulation and navigation actions, with suitable adaptation to account for task-specific performance constraints (e.g., accuracy, completion deadline); and (c) learning from human feedback, e.g., imitation, inverse RL, RLHF, and interactive/intervention-based learning, so that robot behaviour remains aligned with human intent under partial observability.
Human learning. This part develops an adaptive learning system in which an AI tutor estimates each student's evolving knowledge state from their interaction history (responses, errors, time-on-task) and personalizes what to deliver next, e.g., lessons, exercises, hints, and review items, to maximize long-term learning gains. RL is the natural framing here: the tutor is a sequential decision-maker, the learner's knowledge state is the (partially observed) environment, and the world model is a learner model that predicts how a student's mastery will evolve under different pedagogical actions. Notably, the tutor’s actions can encompass not just the temporal phasing of knowledge content (the curriculum) but also the 3D presentation of such content (including AI-based generation of 2D/3D vision-language embodiments of such content). The research advances knowledge tracing under partial observability, curriculum and content sequencing, and learning from human feedback (instructor preferences, learner self-reports) to keep recommendations pedagogically aligned.
This program is led by distinguished scholars, including Profs. Sanjay Sarma, Daniela Rus and Jinhua Zhao from MIT; and Prof. Archan Misra from Singapore Management University.
To find out more about this role, please contact Professor Archan Misra (archanm@smu.edu.sg) and Dr Alok Prakash (alok.prakash@smart.mit.edu).
To apply, please visit our website at: https://portal.smart.mit.edu/careers/career-opportunities
Interested applicants are invited to send in their full CV/resume, cover letter and list of three references (to include reference names and contact information). We regret that only shortlisted candidates will be notified.