multimodal-representation-learning

Here are 5 public repositories matching this topic...

TXH-mercury / VALOR

[TPAMI2024] Codes and Models for VALOR: Vision-Audio-Language Omni-Perception Pretraining Model and Dataset

vision-language-pretraining audio-language-pretraining audiovisual-language-pretraining multimodal-representation-learning

Updated Dec 25, 2024
Python

Surrey-UP-Lab / RegionSpot

Star

Recognize Any Regions

open-world object-detection zero-shot instance-segmentation auto-labeling vision-language-pretraining open-vocabulary vision-language-model multimodal-representation-learning vision-foundation-model vision-language-foundation-model

Updated Dec 18, 2024
Python

ligerfotis / mvitac

Star

Self-Supervised Visual-Tactile Representation Learning via Multimodal Contrastive Training

contrastive-learning multimodal-representation-learning visuotactile-representation-learning

Updated Apr 26, 2024
Jupyter Notebook

ming-jq / Adaptor-VL-SSL

Star

Freeze the Backbone: A Parameter-Efficient Contrastive Approach to Robust Medical Vision-Language Pre-training. Thesis of MSc AI degree at Imperial College London.

machine-learning medical-imaging multimodal-representation-learning

Updated Nov 16, 2023
Jupyter Notebook

aurooj / VLM_SS

Star

Mini-batch selective sampling for knowledge adaption of VLMs for mammography.

medical-imaging miccai mammogram multimodal-learning vision-and-language multimodal-retrieval vision-language-transformer multimodal-representation-learning miccai2024 medical-vision-language-model minibatch-selective-sampling

Updated Oct 7, 2024
Jupyter Notebook

Improve this page

Add a description, image, and links to the multimodal-representation-learning topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the multimodal-representation-learning topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

multimodal-representation-learning

Here are 5 public repositories matching this topic...

TXH-mercury / VALOR

Surrey-UP-Lab / RegionSpot

ligerfotis / mvitac

ming-jq / Adaptor-VL-SSL

aurooj / VLM_SS

Improve this page

Add this topic to your repo