Multi-Modal-Comparators issues

model architectures and pretrained models to support

36

## installable - [ ] https://github.com/salesforce/LAVIS - https://github.com/salesforce/BLIP - https://github.com/salesforce/ALBEF - [ ] https://github.com/facebookresearch/multimodal - FLAVA - LateFusion - ALBEF - MDETR - OMNIVORE - video-gpt - [ ] https://github.com/ai-forever/ru-clip...

dmarx

interface

how to * get the tokenizer and preprocessor for a given clip * get the visual and textual encoder separately

rom1504

mechanism to isolate mode-specific components?

https://github.com/archinetai/surgeon-pytorch

dmarx

Gradient Checkpointing for OpenCLIP should be optional

I know hardcoding it came from me but while Gradient Checkpointing makes things faster and use less VRAM so very useful on some use-cases, but can break things on A100...

apolinario

refactor to use hooks

dmarx

enhancement

sbert embeddings need to be normalized prior to comparison

I suspect this is the issue with clip-fa too

dmarx

bug

improved packaging

19

Hi, As part of our package to easily evaluate clip models, https://github.com/LAION-AI/CLIP_benchmark/issues/1 and my inference lib https://github.com/rom1504/clip-retrieval I'm interested to have a package like this however here is what's missing...

rom1504

enhancement

openai/CLIP -> rom1504/CLIP

`pip install clip-anytorch` https://github.com/rom1504/CLIP

dmarx

"vanilla return value

1

rather than an ambiguous _model attribute, let's just attach an attribute that returns the objects that would be returned from the native "load" function of the particular model implementation the...

dmarx

documentation

enhancement

Inherit from nn.Module

dmarx

enhancement

Multi-Modal-Comparators
Multi-Modal-Comparators copied to clipboard

Metadata

model architectures and pretrained models to support

interface

mechanism to isolate mode-specific components?

Gradient Checkpointing for OpenCLIP should be optional

refactor to use hooks

sbert embeddings need to be normalized prior to comparison

improved packaging

openai/CLIP -> rom1504/CLIP

"vanilla return value

Inherit from nn.Module

← Metadata

Owner

Metadata

Multi-Modal-Comparators Multi-Modal-Comparators copied to clipboard

Metadata

← Metadata

Owner

Metadata

Multi-Modal-Comparators
Multi-Modal-Comparators copied to clipboard