CV

General Information

Full Name Jeet Vora
Email jeet.vora2010@gmail.com
Phone +91-9930363408
Location India
GitHub https://github.com/jeetv
LinkedIn https://linkedin.com/in/jeetvora

Education

  • 2019 - 2023
    Master’s by Research in Computer Science
    IIIT Hyderabad
    • Specialization in Artificial Intelligence and Robotics
    • CGPA 8.2/10.0
  • 2015 - 2018
    Bachelor of Engineering in Computer Science
    University of Mumbai
    • CGPA 8.96/10.0
  • 2012 - 2015
    Diploma in Computer Science
    Shri Baghubhai Mafatlal Polytechnic
    • CGPA 8.2/10.0

Experience

  • 2025 - Present
    Research Consultant
    King Abdullah University of Science and Technology (KAUST)
    • Research Collaboration - IVUL Lab, KAUST - Prof. Bernard Ghanem, Dr. Silvio Giancola, Dr. Karen Sanchez, Dr. Merey Ramazanova, and Jintao Ma.
    • Leading research and system design for SoccerNetPro, a large scale football video understanding framework for spatio-temporal AI modeling.
    • Architecting a modular and extensible Python research library supporting Action Classification, Temporal Localization, Retrieval, and Automated Video Description.
    • Developing scalable training, inference, and evaluation pipelines for multi-modal inputs, including raw video, frame sequences, pre-extracted features, and player tracking data.
    • Establishing reproducible experimentation protocols and benchmarking standards for standardized evaluation across large annotated sports datasets.
    • Conducting empirical studies on model generalization, efficiency, and performance trade-offs to advance research in sports video analytics.
  • 2022 - 2025
    Research Engineer
    Animaker India Pvt. Ltd.
    • Led applied research in AI-ML (CV/NLP) across multiple Animaker platforms (Animaker, Steve.ai, Picmaker, Vmaker), delivering AI systems serving 25M+ users.
    • Designed and optimized deep learning models for video/image matting, multimodal retrieval, script-to-Animated video generation, script-to-GenAI video architecture, controllable character builder pipeline, and audio driven talking-head animation.
    • Implemented knowledge distillation and model compression techniques achieving 8× inference speedup, enabling 4 concurrent model workers on a single 24GB GPU.
    • Architected end-to-end ML pipelines including dataset curation, training infrastructure, evaluation protocols, CI/CD integration, and GPU optimized scalable deployment.
    • Drove cross-functional collaboration with Product, DevOps, and Engineering teams to transition research prototypes into reliable, production grade AI services.
  • 2020 - 2023
    Research Student
    Star Sports (via IIIT Hyderabad)
    • Project Supervisor - CVIT Lab, IIITH - Dr. Vineet Gandhi
    • Developed Real-Time Player Tracking at 30fps with 4K/FullHD live streaming.
    • Formulated player merging from multiple cameras in bird’s eye view as a linear assignment problem.
    • Worked on camera calibration and homography for cricket grounds to enable multi-camera player tracking, bird’s-eye view transformations, and real-world trajectory analysis.
    • Set up infrastructure using Blackmagic Design products for 4K live streaming and processing.
    • Deployed and broadcasted technology for live matches at,
    • Asia Cup 2022 (UAE, Dubai).
    • Asia Cup 2023 (Sri Lanka).
    • Tamil Nadu Premier League (TNPL).
  • 2022
    Research Collaborator
    Apple (via IIIT Hyderabad)
    • Project Supervisor - CVIT Lab, IIITH - Dr. Vineet Gandhi
    • Developed a video understanding prototype for industrial assembly-line monitoring using deep learning based Video Classification and Action Recognition with Temporal modeling techniques.
    • Designed and evaluated spatio-temporal models to classify fine-grained assembly operations under real-world production constraints.
  • 2018 - 2019
    Software Engineer
    Vistaar Technologies, Inc.
    • Worked as part of the Reimbursement Solution Team, analyzing and providing technical solutions.
    • Directly involved in development, delivery and onboarding of Vistaar’s Saas based Reimbursement product for clients.
    • Designed and implemented tailored solutions for Diageo, E. & J. Gallo Winery, and Brown-Forman, improving and streamlining processes, maximizing recovery, and enhancing customer success.

Mentorship & Teaching

  • 2024 - 2025
    AI-ML Mentor
    InLustro Learning
    • Providing industry-relevant AI-ML training and mentorship to students, professionals, and faculty members through lecture sessions and hands-on workshops.
    • Mentored 14 AI-ML projects, guiding teams in model development, optimization, and deployment.
    • Collaborated with clients, including L&T Madh Training Academy and MLR Institute of Technology, to deliver tailored AI-ML solutions.
  • 2022
    Teaching Assistant - Statistical Methods in AI
    IIIT Hyderabad
    • Lecturer Dr. Vineet Gandhi
    • Course Statistical Methods in AI (SMAI), Spring 2022.
    • Prepared assignments, conducted lab sessions/tutorials, and guided students through projects.
  • 2021 - 2022
    AI-ML Mentor
    TalentSprint
    • Supervisor Dr. Anoop Namboodiri
    • Mentored AI-ML projects as part of the AI/ML program by IIIT Hyderabad and TalentSprint.
    • Provided guidance on AI-ML projects and conducted lab sessions.
    • Supervised 4 teams working on Image Tagging and Road Object Detection.

Publications

  • 2023
    Bringing Generalization to Deep Multi-View Pedestrian Detection
    WACV 2023 Workshops
    • Proposed a generalizable model for multi-camera detection.
    • Proposed and evaluated a model incorporating generalization for multi-camera detection across varying camera setups and new scenes.
    • Achieved a 20% improvement in MODA and MODP metrics over benchmarks.
    • Created synthetic datasets for multi-camera detection using GTA-V and Unity game engines.
    • Developed post-processing tools for joint calibration and synchronization of multiple cameras.
    • Demonstrated strong generalization from synthetic to real-world scenarios (Sim2Real).

Skills

  • Programming
    • Python
    • C
    • C++
    • SQL
  • Deep Learning
    • PyTorch
    • ONNX
    • TensorRT
    • Hugging Face
    • LangChain
    • OpenAI API
  • Computer Vision
    • OpenCV
    • Open3D
    • PIL
    • Unity
    • GTA-V Scripthook
  • Deployment & Infrastructure
    • AWS Sagemaker
    • EC2
    • Torchserve
    • Flask
    • Docker
  • Big Data & Analytics
    • ElasticSearch
    • Kibana

Research Interests

  • Computer Vision
  • Video Understanding
  • Multi-Camera Visual Perception
  • Creative AI
  • Generalization
  • Vision based Sports Broadcasting and Analytics

Courses

  • Statistical Methods in AI
  • Digital Image Processing
  • Computer Vision
  • Mobile Robotics
  • Optimization Methods
  • Topics in Applied Optimization