Jeet Vora

I’m a Research Engineer at Animaker, where I head a focused AI research team, driving projects end-to-end, from prototyping Deep Learning models to deploying Production-Ready AI features at scale across products like Steve.ai, Animaker, Picmaker and Vmaker.
My work lies at the intersection of Computer Vision, Natural Language Processing, and Generative AI, focusing on building intelligent systems that automate video creation, from Script-to-Live/GenAI/Animation pipelines and Talking Head Animated Characters to Video Matting and GenAI-powered video creation using Animaker 2.0 Pre-built scenes.
Before this, I completed my Master’s by Research in Computer Science at IIIT-Hyderabad. I was advised by Dr. Vineet Gandhi and was associated with the CVIT Lab. My research focused on Multi-Camera Detection and Tracking, specifically improving Generalization in Deep Multi-View Pedestrian Detection across diverse camera setups, changing camera positions, and orientation, bridging simulation and real-world performance (Sim2Real) by synthetically generating data from GTA-V and Unity game engines.
Alongside my R&D work, I’m deeply passionate about mentorship and teaching. I’ve mentored students and professionals in their end-to-end AI-ML projects in collaboration with TalentSprint and InLustro Learning. I’ve also delivered hands-on sessions for institutional clients such as L&T Madh Training Academy and MLR Institute of Technology.
I’m passionate about research and translating research into robust, real-world AI systems. I’m also curious about the intersection of Perception, Creativity, and Technology.