MI X

The MIx Group @ University of Birmingham

3 minutes

an img

Machine Intelligence + X,

X =

We are the Machine Intelligence + x group at the School of Computer Science, University of Birmingham. Welcome!
Our group mainly studies machine learning and computer vision, and also interested in other applied machine learning problems including multimodal data, neuroscience, healthcare, physics, chemistry, to name a few. That is where the x lies in.

Key research interests:

Learning representations with limited human supervision, e.g. self-/semi-/weakly-supervised learning
Multimodal data processing and analysis, e.g. vision-language, vision-audio, etc.
Open-world problems, e.g. incremental learning, open-vocabulary visual understanding
Visual semantics understanding, e.g. semantic segmentation, saliency modelling
3D problems, e.g. depth estimation, multi-view geometry, 3D generation
Healthcare, e.g. medical image understanding and analysis, explanable AI for healthcare
AI for science, including neuroscience, physics and chemistry

News #

 
May 2025:   Our BinEgo-360 Workshop is accepted to ICCV 2025, we invite paper presentations and challenge participation!
Mar 2025:   Our paper about incremental learning is accepted to CVPR 2025, congrats to the co-authors!
Feb 2025:   Congratulations to Kangning on successfully defending her viva!
Feb 2025:   Our paper about molecular structure prediction is accepted to The Journal of Physical Chemistry, congrats to Wenjin!
Jan 2025:   Our paper about open-world learning is accepted to ICLR 2025, congrats to Qiming!
Nov 2024:   Very glad to receive the Best Paper Award at the MICAD 2024! Congrats to Kangning and the Team!
Nov 2024:   Great thanks to Meta Project Aria for their in-kind contributions, and honoured to be an Academic Partner!
Oct 2024:   Very glad to receive the Best Paper Award and Best Presentation (Runner-Up) Award at the MICCAI 2024 ASMUS Workshop! Congrats to Kangning and the Team!
Sep 2024:   Two papers (1 Spotlight) accepted to NeurIPS 2024, and one paper accepted to ACCV 2024, congrats to all the co-authors!
Sep 2024:   Welcome the new PhD students Isaac, Hao, Peixi, and Haotian joining the group!
Jul 2024:   The paper "Show from Tell" is now published in Scientific Reports (Nature Portfolio)! Please check it out here: https://rdcu.be/dNcmb and here :)
Jul 2024:   Three papers accepted to ECCV 2024, two papers accepted to MICCAI Workshop and ACM MM 2024, congrats to all the co-authors!
Mar 2024:   Very grateful to be awarded the Amazon Research Award!
Feb 2024:   Two papers (the  360+x (Oral) multi-modal holistic scene understanding dataset, and DyMvHuman dynamic multiview dataset) are accepted to CVPR 2024; 
            and another two papers are accepted to ISBI 2024 (Oral) and T-IP. Congrats to all the co-authors!
Oct 2023:   Two papers (1 Oral 1 Poster) are accepted to WACV 2024, congrats to all the co-authors!
Sep 2023:   Grateful to be awarded the Royal Society Short Industry Fellowship!
Aug 2023:   One paper is accepted to IJCV. Congrats to all the co-authors!
Jul 2023:   Four papers are accepted to ICCV 2023. Congrats to all the co-authors (esp. the MSc students Hao and Chenyuan)!
Apr 2023:   Grateful to receive the International Exchanges Grant from The Royal Society!
Apr 2023:   Two papers are accepted to CVPR 2023 Workshops (Foundation Model and Sight and Sound) about self-supervised multi-modal (video-text-audio) representation learning
Mar 2023:   Two papers are accepted to ICLR 2023 workshops (TML4H and Neural Fields) about medical video quality assessment and neural representations in low-level vision. Congrats to Jong (PhD) and Wentian (MSc)!
Oct 2022:   Very glad to receive the Best Paper Award at the ECCV 2022 Workshop on Medical Computer Vision! Congrats to the PULSENet Team!
Sep 2022:   One paper is accepted to NeurIPS 2022 about continual learning
Aug 2022:   One paper is accepted to ECCV 2022 Workshop (ECCV-MCV) about anatomy-aware contrastive medical representation learning
Feb 2022:   Birthday of the MIx group @ the University of Birmingham

Collections

Sections

Contact and Join Us

Contact E-mail: mix.group.uk@gmail.com Join us We are always looking for people with strong self-motivation, unusual creativity, and passion for hard problems! If you share the same intetests and passion with us, please send your CV together with a short description (2 – 3 sentences) of your research interests to the above email address (with the keywords “[PhD/Postdoc/RA/Visitor/Collaboration application]” in your email subject). Prospective PhD students Please apply via the University application system here, and mention the PI’s name on your application.

2 minutes

Datasets

360+x: A Panoptic Multi-modal Scene Understanding Dataset CVPR, Dataset link: https://x360dataset.github.io/ 360+x dataset introduces a unique panoptic perspective to scene understanding, differentiating itself from existing datasets, by offering multiple viewpoints and modalities, captured from a variety of scenes. For more details please refer to the paper DyMVHumans: A Multi-View Video Benchmark for High-Fidelity Dynamic Human Modeling CVPR, Dataset link: https://pku-dymvhumans.github.io/ This is a versatile human-centric dataset for high-fidelity reconstruction and rendering of dynamic human scenarios from dense multi-view videos.

2 minutes

Projects

PCo3D: Physically Plausible Controllable 3D Generative Models Amazon Research Award, PI, with Aleš Leonardis Generative AI has shown remarkable performance across various applications involving content generation, showcasing its potential in both academic research and industrial settings. While its effectiveness in generating images and videos is well-established, there exists a notable gap when it comes to 3D content creation, particularly in the consideration of physical properties during the generation process. Another gap is the controllability of the physics-aware generation.

5 minutes

Publications

*Equal contribution Revisit the Open Nature of Open Vocabulary Semantic Segmentation Qiming Huang, Han Hu, Jianbo Jiao International Conference on Learning Representations (ICLR), 2025 [PDF] [BibTeX] [arXiv] [Short Video Intro] [Project Page and Code] CoMBO: Conflict Mitigation via Branched Optimization for Class Incremental Segmentation Kai Fang, Anqi Zhang, Guangyu Gao, Jianbo Jiao, Chi Harold Liu, Yunchao Wei IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2025 [PDF] [BibTeX] [arXiv] [Project Page and Code] Transformer-Based Models for Predicting Molecular Structures from Infrared Spectra Using Patch-Based Self-Attention Wenjin Wu, Aleš Leonardis, Jianbo Jiao, Jun Jiang, Linjiang Chen The Journal of Physical Chemistry A, 2025 [Paper] [BibTeX] [arXiv] [Project Page and Code] PRFormer: Matching Proposal and Reference Masks by Semantic and Spatial Similarity for Few-Shot Semantic Segmentation Guangyu Gao, Anqi Zhang, Jianbo Jiao, Chi Harold Liu, Yunchao Wei IEEE Transactions on Circuits and Systems for Video Technology (T-CSVT), 2025 [PDF] [BibTeX] [Code] Out-of-Clinical-Distribution Detection with a Softmax-Conditioned Variational Autoencoder Regulariser: Application to Fetal Ultrasound Kangning Zhang, Jianbo Jiao, Alison Noble International Conference on Medical Imaging and Computer-Aided Diagnosis (MICAD), Oral Presentation, &nbsp&nbspBest Paper Award, 2024 [PDF] [BibTeX] [Project Page] Few Exemplar-Based General Medical Image Segmentation via Domain-Aware Selective Adaptation Chen Xu*, Qiming Huang*, Yuqi Hou, Jiangxing Wu, Fan Zhang, Hyung Jin Chang, Jianbo Jiao Asian Conference on Computer Vision (ACCV), 2024 [PDF] [BibTeX] [arXiv] [Project Page] Bridge the Points: Graph-based Few-shot Segment Anything Semantically Anqi Zhang, Guangyu Gao, Jianbo Jiao, Chi Harold Liu, Yunchao Wei Annual Conference on Neural Information Processing Systems (NeurIPS), 2024 [PDF] [BibTeX] [arXiv] [Project Page] Structure Consistent Gaussian Splatting with Matching Prior for Few-shot Novel View Synthesis Rui Peng, Wangze Xu, Luyang Tang, Liwei Liao, Jianbo Jiao, Ronggang Wang Annual Conference on Neural Information Processing Systems (NeurIPS), 2024 [PDF] [BibTeX] [arXiv] [Project Page] Disentangled Generation and Aggregation for Robust Radiance Fields Shihe Shen*, Huachen Gao*, Wangze Xu, Rui Peng, Luyang Tang, Kaiqiang Xiong, Jianbo Jiao, Ronggang Wang European Conference on Computer Vision (ECCV), 2024 [PDF] [BibTeX] [arXiv] [Project Page] MVPGS: Excavating Multi-view Prior for Gaussian Splatting from Sparse Input Views Wangze Xu, Huachen Gao, Shihe Shen, Rui Peng, Jianbo Jiao, Ronggang Wang European Conference on Computer Vision (ECCV), 2024 [PDF] [BibTeX] [arXiv] [Project Page] Surface-Centric Modeling for High-Fidelity Generalizable Neural Surface Reconstruction Rui Peng, Shihe Shen, Kaiqiang Xiong, Huachen Gao, Jianbo Jiao, Xiaodong Gu, Ronggang Wang European Conference on Computer Vision (ECCV), 2024 [PDF] [BibTeX] [arXiv] [Code] Show from Tell: Audio-Visual Modelling in a Clinical Setting Jianbo Jiao*, Mohammad Alsharid*, Lior Drukker, Aris T.

6 minutes

Team Members

Jianbo Jiao Principle Investigator Jianbo is an Assistant Professor in the School of Computer Science at the University of Birmingham, a former Royal Society Short Industry Fellow, and a visiting researcher at the University of Oxford. Isaac Akintaro PhD Student (2024 -), MI v Isaac is a first-year PhD student in Computer Science at the University of Birmingham. His research focuses on Visual Reasoning, with a background in AI and Machine Learning.

4 minutes