Jeet Vora

prof_pic.jpg

I’m a researcher working at the intersection of Computer Vision, Video Understanding, and Multi-Camera Visual Perception, with applications in Vision-based Sports Analytics and AI for Creative Media, focusing on building intelligent systems that bridge research and real-world applications.

I’m currently working as a Research Consultant with the IVUL Lab at King Abdullah University of Science and Technology (KAUST), collaborating with Prof. Bernard Ghanem, Dr. Silvio Giancola, Dr. Merey Ramazanova, and Jintao Ma on research related to large-scale sports video understanding and AI systems for sports analytics.

Previously, I worked as a Research Engineer at Animaker, where I led AI research and development for large-scale video creation platforms. My work involved designing and deploying production-grade AI systems across products like Steve.ai, Animaker, Picmaker and Vmaker, spanning problems such as Script-To-Video generation, Talking-Head animation, Video Matting, Multimodal Retrieval, and AI-powered GenAI/Animated video creation pipelines.

Before this, I completed my Master’s by Research in Computer Science at IIIT Hyderabad. I was advised by Dr. Vineet Gandhi and was associated with the CVIT Lab. My research focused on Multi-Camera Detection and Tracking, specifically improving Generalization in Deep Multi-View Pedestrian Detection across diverse camera setups and environments. As part of this work, I explored simulation-to-real (Sim2Real) transfer by synthetically generating datasets using the GTA-V and Unity game engines.

Alongside my research and engineering work, I’m passionate about mentorship and teaching. I’ve mentored students and professionals in end-to-end AI/ML projects through collaborations with TalentSprint and InLustro Learning. I’ve also delivered hands-on sessions for institutional clients including L&T Madh Training Academy and MLR Institute of Technology.

More broadly, I’m interested in translating research ideas into robust, scalable AI systems, and exploring the intersection of Perception, Creativity, and Intelligent Systems.