Tensor search for humans.
-
Updated
Mar 15, 2023 - Python
Tensor search for humans.
PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation
Official repository of OFA (ICML 2022). Paper: OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning Framework
Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.
Pix2Seq codebase: multi-tasks with generative modeling (autoregressive and diffusion)
CLIPort: What and Where Pathways for Robotic Manipulation
(CVPR2021) Kaleido-BERT: Vision-Language Pre-training on Fashion Domain.
Tools for movie and video research
Vision-Language Transformer and Query Generation for Referring Segmentation (ICCV 2021 & TPAMI)
This repo is the official implementation of "LViT: Language meets Vision Transformer in Medical Image Segmentation" (Under Major Revision)
CALVIN - A benchmark for Language-Conditioned Policy Learning for Long-Horizon Robot Manipulation Tasks
PyTorch code for BagFormer: Better Cross-Modal Retrieval via bag-wise interaction
Code for "Learning the Best Pooling Strategy for Visual Semantic Embedding", CVPR 2021
Pytorch code for Language Models with Image Descriptors are Strong Few-Shot Video-Language Learners
A collection of original, innovative ideas and algorithms towards Advanced Literate Machinery. This project is maintained by the OCR Team in the Language Technology Lab, Alibaba DAMO Academy.
Improving Visual Grounding with Visual-Linguistic Verification and Iterative Reasoning, CVPR 2022
[IJCAI 2022] Official Pytorch code for paper “S2 Transformer for Image Captioning”
MixGen: A New Multi-Modal Data Augmentation
NExT-QA: Next Phase of Question-Answering to Explaining Temporal Actions (CVPR'21)
[ECCV 2022] Official Pytorch Implementation of the paper : " Zero-Shot Temporal Action Detection via Vision-Language Prompting "
Add a description, image, and links to the vision-language topic page so that developers can more easily learn about it.
To associate your repository with the vision-language topic, visit your repo's landing page and select "manage topics."