Like Golden Gate Claude, but with a CLIP Vision Transformer ~ feature activation manipulation fun!
-
Updated
Jun 21, 2024 - Python
Like Golden Gate Claude, but with a CLIP Vision Transformer ~ feature activation manipulation fun!
This repo showcase the ENPM673: Perception for Autonomous robots final project. A vision transformer (ViT) architecture SegFormer, has been replicated for implementing semantic segmentation. Furthermore, it was deployed on raspberry pi with pi cam setup for validating the real-time performance.
Vision transformer and CNN implementations for image classification using PyTorch.
Code for the paper "Relating Implicit Bias and Adversarial Attacks through Intrinsic Dimension" [https://arxiv.org/abs/2305.15203] -- Now +CLIP!
Final project for the master's degree in Computer Science course "Advanced Machine Learning" (AML) at the University of Rome "La Sapienza" (A.Y. 2023-2024).
Deep Fake Detection using Vision Transformer and Neural Network
Experimental removal / shuffling of layers in CLIP ViT + Text Transformer
Repository for Master thesis project investigating classification of 3D chest CT scans using Vision Transformer.
DSMIL: Dual-stream multiple instance learning networks for tumor detection in Whole Slide Image
Add a description, image, and links to the visiontransformer topic page so that developers can more easily learn about it.
To associate your repository with the visiontransformer topic, visit your repo's landing page and select "manage topics."