I am interested in developing algorithms for 3D visual perception. It includes 3D reconstruction of the objects in the scene and the visual perception in the moving world. While such processing is trivial to the human visual system, most of the sophisticated computer vision algorithms come nowhere close in terms of performance and are unable to do the processing online.
I am working on the project AirCap, where the goal is to develop a 3D shape and motion capture system in outdoor scenarios using multiple UAVs. Each UAV is equipped with an RGB camera and onboard computation capabilities to process the camera input. I am interested in developing shape and pose estimation algorithms which can execute on each UAV’s computation unit with minimal inter-UAV communication. The algorithm should take advantage of multi-view RGB input and should provide feedback to the UAV’s flight controller for the best formation planning of the UAVs. I am also working at the software workshop where I am implementing the derivative calculation of SMPL model using OpenCL to harness the power of parallel GPU computing.
I have completed my Master studies in Neural Information Processing from the International Max Planck Research School of Cognitive and Systems Neuroscience, University of Tuebingen. Before that, I have worked in Samsung R&D Institute Bangalore, India for two years. I have done my Bachelors in Electronics and Communication Engineering from IIT(BHU) Varanasi, India.
International Conference on Computer Vision, October 2019 (conference) Accepted
Capturing human motion in natural scenarios means moving motion capture out of the lab and into the wild. Typical approaches rely on fixed, calibrated, cameras and reflective markers on the body, significantly limiting the motions that can be captured. To make motion capture truly unconstrained, we describe the first fully autonomous outdoor capture system based on flying vehicles. We use multiple micro-aerial-vehicles(MAVs), each equipped with a monocular RGB camera, an IMU, and a GPS receiver module. These detect the person, optimize their position, and localize themselves approximately. We then develop a markerless motion capture method that is suitable for this challenging scenario with a distant subject, viewed from above, with approximately calibrated and moving cameras. We combine multiple state-of-the-art 2D joint detectors with a 3D human body model and a powerful prior on human pose. We jointly optimize for 3D body pose and camera pose to robustly fit the 2D measurements. To our knowledge, this is the first successful demonstration of outdoor, full-body, markerless motion capture from autonomous flying vehicles.
Our goal is to understand the principles of Perception, Action and Learning in autonomous systems that successfully interact with complex environments and to use this understanding to design future systems