Skip to content
#

multi-modal

Here are 61 public repositories matching this topic...

nilsnolde
nilsnolde commented May 25, 2021

Would be nice to tell a user when certain things happen while processing, e.g. clamping values to their allowed maximum, enabling destination_only roads in 2. pass, deprecating request parameters etc.

IMO it should be a top-level warnings array of strings which is empty by default and all modules can add their share during processing: probably implement the most obvious ones first and add o

Applying Common-Sense Reasoning to Multi-Modal Dense Video Captioning and Video Question Answering | Python3 | PyTorch | CNNs | Causality | Reasoning | LSTMs | Transformers | Multi-Head Self Attention | Published in IEEE Winter Conference on Applications of Computer Vision (WACV) 2021

  • Updated May 3, 2021
  • Python

The repository collects many various multi-modal transformer architectures, including image transformer, video transformer, image-language transformer, video-language transformer and related datasets. Additionally, it also collects many useful tutorials and tools in these related domains.

  • Updated Aug 18, 2021

Improve this page

Add a description, image, and links to the multi-modal topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the multi-modal topic, visit your repo's landing page and select "manage topics."

Learn more