MI X

The MIx Group @ University of Birmingham

3 minutes

an img

Machine Intelligence + X,

X =

We are the Machine Intelligence + x group at the School of Computer Science, University of Birmingham. Welcome!
Our group mainly studies machine learning and computer vision, and also interested in other applied machine learning problems including multimodal data, neuroscience, healthcare, physics, chemistry, to name a few. That is where the x lies in.

Key research interests:
  • Learning representations with limited human supervision, e.g. self-/semi-/weakly-supervised learning
  • Multimodal data processing and analysis, e.g. vision-language, vision-audio, etc.
  • Open-world problems, e.g. incremental learning, open-vocabulary visual understanding
  • Visual semantics understanding, e.g. semantic segmentation, saliency modelling
  • 3D problems, e.g. depth estimation, multi-view geometry, 3D generation
  • Healthcare, e.g. medical image understanding and analysis, explanable AI for healthcare
  • AI for science, including neuroscience, physics and chemistry

News #

Oct 2024: Very glad to receive the Best Paper Award and Best Presentation (Runner-Up) Award at the MICCAI 2024 ASMUS Workshop! Congrats to Kangning and the Team! Sep 2024: Two papers (1 Spotlight) accepted to NeurIPS 2024, and one paper accepted to ACCV 2024, congrats to all the co-authors! Sep 2024: Welcome the new PhD students Isaac, Hao, Peixi, and Haotian joining the group! Jul 2024: The paper "Show from Tell" is now published in Scientific Reports (Nature Portfolio)! Please check it out here: https://rdcu.be/dNcmb and here :) Jul 2024: Three papers accepted to ECCV 2024, two papers accepted to MICCAI Workshop and ACM MM 2024, congrats to all the co-authors! Mar 2024: Very grateful to be awarded the Amazon Research Award! Feb 2024: Two papers (the 360+x (Oral) multi-modal holistic scene understanding dataset, and DyMvHuman dynamic multiview dataset) are accepted to CVPR 2024;
and another two papers are accepted to ISBI 2024 (Oral) and T-IP. Congrats to all the co-authors! Oct 2023: Two papers (1 Oral 1 Poster) are accepted to WACV 2024, congrats to all the co-authors! Sep 2023: Grateful to be awarded the Royal Society Short Industry Fellowship! Aug 2023: One paper is accepted to IJCV. Congrats to all the co-authors! Jul 2023: Four papers are accepted to ICCV 2023. Congrats to all the co-authors (esp. the MSc students Hao and Chenyuan)! Apr 2023: Grateful to receive the International Exchanges Grant from The Royal Society! Apr 2023: Two papers are accepted to CVPR 2023 Workshops (Foundation Model and Sight and Sound) about self-supervised multi-modal (video-text-audio) representation learning Mar 2023: Two papers are accepted to ICLR 2023 workshops (TML4H and Neural Fields) about medical video quality assessment and neural representations in low-level vision. Congrats to Jong (PhD) and Wentian (MSc)! Oct 2022: Very glad to receive the Best Paper Award at the ECCV 2022 Workshop on Medical Computer Vision! Congrats to the PULSENet Team! Sep 2022: One paper is accepted to NeurIPS 2022 about continual learning Aug 2022: One paper is accepted to ECCV 2022 Workshop (ECCV-MCV) about anatomy-aware contrastive medical representation learning Feb 2022: Birthday of the MIx group @ the University of Birmingham






Collections

Sections

Contact and Join Us

Contact E-mail: mix.group.uk@gmail.com Join us We are always looking for people with strong self-motivation, unusual creativity, and passion for hard problems! If you share the same intetests and passion with us, please send your CV together with a short description (2 – 3 sentences) of your research interests to the above email address (with the keywords “[PhD/Postdoc/RA/Visitor/Collaboration application]” in your email subject). Prospective PhD students Please apply via the University application system here, and mention the PI’s name on your application.

2 minutes

Datasets

360+x: A Panoptic Multi-modal Scene Understanding Dataset CVPR, Dataset link: https://x360dataset.github.io/ 360+x dataset introduces a unique panoptic perspective to scene understanding, differentiating itself from existing datasets, by offering multiple viewpoints and modalities, captured from a variety of scenes. For more details please refer to the paper DyMVHumans: A Multi-View Video Benchmark for High-Fidelity Dynamic Human Modeling CVPR, Dataset link: https://pku-dymvhumans.github.io/ This is a versatile human-centric dataset for high-fidelity reconstruction and rendering of dynamic human scenarios from dense multi-view videos.

2 minutes

Projects

PCo3D: Physically Plausible Controllable 3D Generative Models Amazon Research Award, PI, with Aleš Leonardis Generative AI has shown remarkable performance across various applications involving content generation, showcasing its potential in both academic research and industrial settings. While its effectiveness in generating images and videos is well-established, there exists a notable gap when it comes to 3D content creation, particularly in the consideration of physical properties during the generation process. Another gap is the controllability of the physics-aware generation.

5 minutes

Publications

*Equal contribution Disentangled Generation and Aggregation for Robust Radiance Fields Shihe Shen*, Huachen Gao*, Wangze Xu, Rui Peng, Luyang Tang, Kaiqiang Xiong, Jianbo Jiao, Ronggang Wang European Conference on Computer Vision (ECCV), 2024 [PDF] [BibTeX] [arXiv] [Project Page] MVPGS: Excavating Multi-view Prior for Gaussian Splatting from Sparse Input Views Wangze Xu, Huachen Gao, Shihe Shen, Rui Peng, Jianbo Jiao, Ronggang Wang European Conference on Computer Vision (ECCV), 2024 [PDF] [BibTeX] [arXiv] [Project Page] Surface-Centric Modeling for High-Fidelity Generalizable Neural Surface Reconstruction Rui Peng, Shihe Shen, Kaiqiang Xiong, Huachen Gao, Jianbo Jiao, Xiaodong Gu, Ronggang Wang European Conference on Computer Vision (ECCV), 2024 [PDF] [BibTeX] [arXiv] [Code] Show from Tell: Audio-Visual Modelling in a Clinical Setting Jianbo Jiao*, Mohammad Alsharid*, Lior Drukker, Aris T.

5 minutes

Team Members

Jianbo Jiao Principle Investigator Jianbo is an Assistant Professor in the School of Computer Science at the University of Birmingham, a Royal Society Short Industry Fellow, and a visiting researcher at the University of Oxford. Isaac Akintaro PhD Student (2024 -), MI v Isaac is a first-year PhD student in Computer Science at the University of Birmingham. His research focuses on Visual Reasoning, with a background in AI and Machine Learning.

4 minutes