- Shatin, N.T., HKSAR
- https://lixin4ever.github.io/
- @lixin4ever
Pinned Loading
-
DAMO-NLP-SG/VideoLLaMA3
DAMO-NLP-SG/VideoLLaMA3 PublicFrontier Multimodal Foundation Models for Image and Video Understanding
-
DAMO-NLP-SG/VideoLLaMA2
DAMO-NLP-SG/VideoLLaMA2 PublicVideoLLaMA 2: Advancing Spatial-Temporal Modeling and Audio Understanding in Video-LLMs
-
DAMO-NLP-SG/CLEX
DAMO-NLP-SG/CLEX Public[ICLR 2024] CLEX: Continuous Length Extrapolation for Large Language Models
-
DAMO-NLP-SG/VCD
DAMO-NLP-SG/VCD Public[CVPR 2024 Highlight] Mitigating Object Hallucinations in Large Vision-Language Models through Visual Contrastive Decoding
-
DAMO-NLP-SG/Inf-CLIP
DAMO-NLP-SG/Inf-CLIP Public[CVPR 2025] The official CLIP training codebase of Inf-CL: "Breaking the Memory Barrier: Near Infinite Batch Size Scaling for Contrastive Loss". A super memory-efficiency CLIP training scheme.
-
DAMO-NLP-SG/Video-LLaMA
DAMO-NLP-SG/Video-LLaMA Public[EMNLP 2023 Demo] Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding
If the problem persists, check the GitHub status page or contact support.