Skip to content

TinyCLIP integration for ActionCLIP#47

Open
FransHk wants to merge 4 commits into
sallymmx:masterfrom
FransHk:TinyCLIP
Open

TinyCLIP integration for ActionCLIP#47
FransHk wants to merge 4 commits into
sallymmx:masterfrom
FransHk:TinyCLIP

Conversation

@FransHk

@FransHk FransHk commented Dec 6, 2023

Copy link
Copy Markdown

This PR integrates two TinyCLIP ViT models to the existing model framework with minimal changes. This is possible because TinyCLIP provides a pure ViT-based model, like CLIP. The TinyCLIP model is a CLIP distillation that provides significant speed-ups to the CLIP model while retaining and in some cases improving its zero-shot IN1K accuracy. A small state_dict conversion helper method and optional sha256 ignore flag are added to accommodate for this integration.

The TinyCLIP paper

The TinyCLIP models (Git)

Graphs below show rough indication of ActionCLIP during train time on HMDB51 (no pre-train). Train step indicates the batches processed per minute (wall clock) time. TinyCLIP-based ActionCLIP model trains much faster while performance is almost similar to vanilla CLIP.
wandb_comparison

@FransHk FransHk marked this pull request as ready for review December 6, 2023 10:36
Relic from earlier integration.
Updated all .yaml files to include novel architectures. Skip checksum only on TinyCLIP
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant