Jitesh Joshi
Researcher @ University College London (UCL).

I’m a PhD researcher in Computer Science at University College London (UCL), specializing in robust computer vision and deep learning systems that perform reliably across diverse real-world conditions. My research advances contactless camera-based physiological sensing through novel multidimensional attention mechanisms grounded in non-negative matrix factorization, enabling joint attention computation across spatial, temporal, and channel dimensions.
This work, published at NeurIPS 2024, achieved significant improvements in cross-dataset generalization while maintaining computational efficiency—addressing a critical challenge in deploying AI systems beyond controlled laboratory settings. My research spans computer vision, foundational models, physiological computing, and generative AI, with particular focus on architectures that balance robustness with efficiency.
Prior to my PhD, I spent over a decade in healthcare technology, leading AI, R&D, and systems engineering teams to translate research prototypes into commercial products, including FDA/CE-certified medical devices. This experience bridging research and product development informs my approach to tackling technically fascinating challenges while ensuring meaningful societal impact.
Some of my recent research publications are highlighted below.
news
Jun 2, 2025 | How matrix factorization can serve as attention mechanism? Check out our blog on our NeurIPS, 2024 work, titled “FactorizePhys: Matrix Factorization for Multidimensional Attention in Remote Physiological Sensing” that establishes strong baseline for cross-dataset generalization in remote physiological sensing. |
---|---|
May 18, 2025 | Check out the web-browser based real-time demo of our recent work on multitask estimation of remote photoplethysmography (rPPG) and remote respiratory (rRSP) signals from facial video. Preprint of this work can be accessed on arXiv. |
May 12, 2025 | Today marks a significant milestone as I submitted my PhD thesis, “Enhancing Out-of-distribution Generalization for Robust Camera-based Remote Physiological Sensing,” concluding four years of research at University College London with defense scheduled for July 2025. |
Apr 15, 2024 | iBVP Dataset: RGB-Thermal rPPG dataset with signal quality labels, MDPI Sensors, 2024 |
Feb 28, 2023 | How to tackle real-world challenges in thermal image segmentation? Check out our BMVC, 2022 work titled, “Self-Adversarial Multi-scale Contrastive Learning for Semantic Segmentation of Thermal Facial Images”, that delves into domain-specific augmentation and novel contrastive learning framework. |
selected publications
- Efficient and Robust Multidimensional Attention in Remote Physiological Sensing through Target Signal Constrained FactorizationarXiv: 2505.07013 [cs.CV], 2025
- FactorizePhys: Matrix Factorization for Multidimensional Attention in Remote Physiological SensingIn The Thirty-eighth Annual Conference on Neural Information Processing Systems, 2024
- iBVP Dataset: RGB-Thermal rPPG Dataset with High Resolution Signal Quality LabelsElectronics, 2024
- Self-adversarial Multi-scale Contrastive Learning for Semantic Segmentation of Thermal Facial ImagesIn 33rd British Machine Vision Conference 2022, BMVC 2022, London, UK, November 21-24, 2022, 2022