cross-modal-learning

Here are 30 public repositories matching this topic...

KimMeen / Time-LLM

[ICLR 2024] Official implementation of " 🦙 Time-LLM: Time Series Forecasting by Reprogramming Large Language Models"

machine-learning deep-learning time-series language-model time-series-analysis time-series-forecast time-series-forecasting multimodal-deep-learning cross-modality multimodal-time-series cross-modal-learning prompt-tuning large-language-models

Updated Oct 15, 2025
Python

MohamedAfham / CrossPoint

Star

Official implementation of "CrossPoint: Self-Supervised Cross-Modal Contrastive Learning for 3D Point Cloud Understanding" (CVPR, 2022)

deep-learning point-cloud transfer-learning unsupervised-learning 3d-point-clouds object-classification few-shot-learning self-supervised-learning cross-modal-learning

Updated Apr 27, 2023
Python

whwu95 / Cap4Video

Star

【CVPR'2023 Highlight & TPAMI】Cap4Video: What Can Auxiliary Captions Do for Text-Video Retrieval?

video-understanding cross-modal-learning video-text-retrieval video-language-understanding

Updated Nov 29, 2024
Python

whwu95 / Text4Vis

Star

【AAAI'2023 & IJCV】Transferring Vision-Language Models for Visual Recognition: A Classifier Perspective

transfer-learning action-recognition video-understanding video-recognition cross-modal-learning

Updated May 30, 2024
Python

whwu95 / BIKE

Star

【CVPR'2023】Bidirectional Cross-Modal Knowledge Exploration for Video Recognition with Pre-trained Vision-Language Models

action-recognition video-understanding video-recognition cross-modal-learning video-language-understanding

Updated Sep 9, 2024
Python

Toytiny / CMFlow

Star

[CVPR 2023 Highlight 💡] Hidden Gems: 4D Radar Scene Flow Learning Using Cross-Modal Supervision

deep-learning optical-flow autonomous-driving mobile-robotics motion-segmentation scene-flow cross-modal-learning 4d-radar automotive-radar ego-motion-estimation

Updated Jul 17, 2023
Python

choyingw / Cross-Modal-Perceptionist

Star

CVPR 2022: Cross-Modal Perceptionist: Can Face Geometry be Gleaned from Voices?

machine-learning computer-vision deep-learning speech pytorch speech-synthesis biometrics cognitive-science 3d cvpr 3d-models 3dmm speech-to-face cross-modal-learning cvpr2022

Updated Dec 11, 2024
Python

RunpeiDong / ACT

Star

[ICLR 2023] Autoencoders as Cross-Modal Teachers: Can Pretrained 2D Image Transformers Help 3D Representation Learning?

representation-learning 3d-point-clouds self-supervised-learning cross-modal-learning

Updated Jul 1, 2024
Python

WinfredGe / T2S

Star

[IJCAI 2025] Official implementation of "T2S: High-resolution Time Series Generation with Text-to-Series Diffusion Models"

machine-learning deep-learning time-series language-model time-series-analysis multimodal-deep-learning cross-modality multimodal-time-series cross-modal-learning time-series-generation

Updated Oct 15, 2025
Python

mako443 / Text2Pos-CVPR2022

Star

Code, dataset and models for our CVPR 2022 publication "Text2Pos"

nlp computer-vision localization deep-learning pytorch cross-modal cvpr language-processing cross-modal-retrieval cross-modal-learning cvpr2022

Updated Jun 17, 2022
Python

knightyxp / DGL

Star

[AAAI 2024] DGL: Dynamic Global-Local Prompt Tuning for Text-Video Retrieval.

cross-modal-retrieval cross-modal-learning video-text-retrieval prompt-tuning parameter-efficient-tuning video-language-understanding

Updated Oct 14, 2024
Python

GaochangWu / FMF-Benchmark

Star

This is a cross-modal benchmark for industrial anomaly detection.

transformer industrial vit anomaly-detection multimodal anomaly-segmentation cross-modal-learning

Updated Aug 12, 2025
Python

StarMoonWang / SeisMoLLM

Star

Official Pytorch Implementation of SeisMoLLM: Advancing Seismic Monitoring via Cross-modal Transfer with Pre-trained Large Language Model

pytorch back-azimuth phase-picking stead cross-modal-learning earthquake-magnitude ai4science diting fine-tuning-llm first-motion-polarity earthquake-monitoring

Updated Dec 4, 2025
Python

ospanbatyr / sample-efficient-multimodality

Star

Code for the "Sample-efficient Integration of New Modalities into Large Language Models" paper

multimodal-learning hypernetworks cross-modal-learning foundation-models sample-efficiency data-efficiency

Updated Sep 8, 2025
Python

frank-chris / ImageTextRetrieval

Star

In this work, we implement different cross-modal learning schemes such as Siamese Network, Correlational Network and Deep Cross-Modal Projection Learning model and study their performance. We also propose a modified Deep Cross-Modal Projection Learning model that uses a different image feature extractor. We evaluate the model’s performance on im…

flask tensorflow pytorch cross-modal-retrieval cross-modal-learning image-text-retrieval

Updated Aug 23, 2021
Jupyter Notebook

Markin-Wang / CAMANet

Star

[IJBHI 2024] This is the official implementation of CAMANet: Class Activation Map Guided Attention Network for Radiology Report Generation accepted to IEEE Journal of Biomedical and Health Informatics (J-BHI), 2023.

cross-modal-learning medical-report-generation radiology-report-generation

Updated May 14, 2025
Python

verlab / StraightToThePoint_CVPR_2020

Star

Original PyTorch implementation of the code for the paper "Straight to the Point: Fast-forwarding Videos via Reinforcement Learning Using Textual Data" at the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2020

agent reinforcement-learning computer-vision video-summarization video-processing cvpr video-analysis multimodal-learning hyperlapse fast-forward vision-and-language multimodal-deep-learning video-fast-forward text-and-image cross-modal-learning

Updated Mar 26, 2022
Python

IGITUGraz / MemoryDependentComputation

Star

Code for Limbacher, T., Özdenizci, O., & Legenstein, R. (2022). Memory-enriched computation and learning in spiking neural networks through Hebbian plasticity. arXiv preprint arXiv:2205.11276.

python reinforcement-learning recurrent-neural-networks neural-networks question-answering spiking-neural-networks one-shot-learning babi-tasks associations hebbian-learning memory-networks pythorch cross-modal-learning

Updated Mar 28, 2023
Python

codiceSpaghetti / T4SA-2.0

Star

This project creates the T4SA 2.0 dataset, i.e. a big set of data to train visual models for Sentiment Analysis in the Twitter domain using a cross-modal student-teacher approach.

nlp computer-vision dataset-creation twitter-sentiment-analysis cross-modal-learning student-teacher-learning

Updated May 30, 2025
Jupyter Notebook

PrithivirajDamodaran / WhatTheFood

Sponsor

Star

An intentionally simple Image to Food cross-modal search. Created by Prithiviraj Damodaran.

cross-modal multimodal cross-modal-retrieval cross-modal-learning

Updated Nov 8, 2021

Improve this page

Add a description, image, and links to the cross-modal-learning topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the cross-modal-learning topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

cross-modal-learning

Here are 30 public repositories matching this topic...

KimMeen / Time-LLM

MohamedAfham / CrossPoint

whwu95 / Cap4Video

whwu95 / Text4Vis

whwu95 / BIKE

Toytiny / CMFlow

choyingw / Cross-Modal-Perceptionist

RunpeiDong / ACT

WinfredGe / T2S

mako443 / Text2Pos-CVPR2022

knightyxp / DGL

GaochangWu / FMF-Benchmark

StarMoonWang / SeisMoLLM

ospanbatyr / sample-efficient-multimodality

frank-chris / ImageTextRetrieval

Markin-Wang / CAMANet

verlab / StraightToThePoint_CVPR_2020

IGITUGraz / MemoryDependentComputation

codiceSpaghetti / T4SA-2.0

PrithivirajDamodaran / WhatTheFood

Improve this page

Add this topic to your repo