CLIP From Scratch

This is my implementation of the CLIP (Contrastive Language-Image Pretraining) model.

Project Structure

clip_deployment/
Contains a Gradio app that lets you upload an image and returns the top 5 matching captions from a pre-stored captions list.
CLIP_training.ipynb
A Jupyter notebook where the entire training process of the CLIP model is implemented.

Training
Open and run the CLIP_training.ipynb notebook to train the CLIP model on your dataset.
Deployment
Inside the clip_deployment folder, launch the Gradio app to interact with the trained model by uploading images and getting the most relevant captions.

Feel free to explore, experiment on this project!

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
clip_deployment		clip_deployment
CLIP_training.ipynb		CLIP_training.ipynb
README.md		README.md