Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
5
8
Minh Le Duc
minhleduc
Follow
0 followers
·
13 following
MinLee0210
minh-le-duc-a62863172
AI & ML interests
Edge Device-oriented Models (language, audio, ...)
Recent Activity
reacted
to
Kseniase
's
post
with 😎
9 days ago
12 Types of JEPA JEPA, or Joint Embedding Predictive Architecture, is an approach to building AI models introduced by Yann LeCun. It differs from transformers by predicting the representation of a missing or future part of the input, rather than the next token or pixel. This encourages conceptual understanding, not just low-level pattern matching. So JEPA allows teaching AI to reason abstractly. Here are 12 types of JEPA you should know about: 1. I-JEPA -> https://huggingface.co/papers/2301.08243 A non-generative, self-supervised learning framework designed for processing images. It works by masking parts of the images and then trying to predict those masked parts 2. MC-JEPA -> https://huggingface.co/papers/2307.12698 Simultaneously interprets video data - dynamic elements (motion) and static details (content) - using a shared encoder 3. V-JEPA -> https://huggingface.co/papers/2404.08471 Presents vision models trained by predicting future video features, without pretrained image encoders, text, negative sampling, or reconstruction 4. UI-JEPA -> https://huggingface.co/papers/2409.04081 Masks unlabeled UI sequences to learn abstract embeddings, then adds a fine-tuned LLM decoder for intent prediction. 5. Audio-based JEPA (A-JEPA) -> https://huggingface.co/papers/2311.15830 Masks spectrogram patches with a curriculum, encodes them, and predicts hidden representations. 6. S-JEPA -> https://huggingface.co/papers/2403.11772 Signal-JEPA is used in EEG analysis. It adds a spatial block-masking scheme and three lightweight downstream classifiers 7. TI-JEPA -> https://huggingface.co/papers/2503.06380 Text-Image JEPA uses self-supervised, energy-based pre-training to map text and images into a shared embedding space, improving cross-modal transfer to downstream tasks Find more types below 👇 Also, explore the basics of JEPA in our article: https://www.turingpost.com/p/jepa If you liked it, subscribe to the Turing Post: https://www.turingpost.com/subscribe
reacted
to
Kseniase
's
post
with 🔥
9 days ago
12 Types of JEPA JEPA, or Joint Embedding Predictive Architecture, is an approach to building AI models introduced by Yann LeCun. It differs from transformers by predicting the representation of a missing or future part of the input, rather than the next token or pixel. This encourages conceptual understanding, not just low-level pattern matching. So JEPA allows teaching AI to reason abstractly. Here are 12 types of JEPA you should know about: 1. I-JEPA -> https://huggingface.co/papers/2301.08243 A non-generative, self-supervised learning framework designed for processing images. It works by masking parts of the images and then trying to predict those masked parts 2. MC-JEPA -> https://huggingface.co/papers/2307.12698 Simultaneously interprets video data - dynamic elements (motion) and static details (content) - using a shared encoder 3. V-JEPA -> https://huggingface.co/papers/2404.08471 Presents vision models trained by predicting future video features, without pretrained image encoders, text, negative sampling, or reconstruction 4. UI-JEPA -> https://huggingface.co/papers/2409.04081 Masks unlabeled UI sequences to learn abstract embeddings, then adds a fine-tuned LLM decoder for intent prediction. 5. Audio-based JEPA (A-JEPA) -> https://huggingface.co/papers/2311.15830 Masks spectrogram patches with a curriculum, encodes them, and predicts hidden representations. 6. S-JEPA -> https://huggingface.co/papers/2403.11772 Signal-JEPA is used in EEG analysis. It adds a spatial block-masking scheme and three lightweight downstream classifiers 7. TI-JEPA -> https://huggingface.co/papers/2503.06380 Text-Image JEPA uses self-supervised, energy-based pre-training to map text and images into a shared embedding space, improving cross-modal transfer to downstream tasks Find more types below 👇 Also, explore the basics of JEPA in our article: https://www.turingpost.com/p/jepa If you liked it, subscribe to the Turing Post: https://www.turingpost.com/subscribe
reacted
to
Kseniase
's
post
with 🧠
9 days ago
12 Types of JEPA JEPA, or Joint Embedding Predictive Architecture, is an approach to building AI models introduced by Yann LeCun. It differs from transformers by predicting the representation of a missing or future part of the input, rather than the next token or pixel. This encourages conceptual understanding, not just low-level pattern matching. So JEPA allows teaching AI to reason abstractly. Here are 12 types of JEPA you should know about: 1. I-JEPA -> https://huggingface.co/papers/2301.08243 A non-generative, self-supervised learning framework designed for processing images. It works by masking parts of the images and then trying to predict those masked parts 2. MC-JEPA -> https://huggingface.co/papers/2307.12698 Simultaneously interprets video data - dynamic elements (motion) and static details (content) - using a shared encoder 3. V-JEPA -> https://huggingface.co/papers/2404.08471 Presents vision models trained by predicting future video features, without pretrained image encoders, text, negative sampling, or reconstruction 4. UI-JEPA -> https://huggingface.co/papers/2409.04081 Masks unlabeled UI sequences to learn abstract embeddings, then adds a fine-tuned LLM decoder for intent prediction. 5. Audio-based JEPA (A-JEPA) -> https://huggingface.co/papers/2311.15830 Masks spectrogram patches with a curriculum, encodes them, and predicts hidden representations. 6. S-JEPA -> https://huggingface.co/papers/2403.11772 Signal-JEPA is used in EEG analysis. It adds a spatial block-masking scheme and three lightweight downstream classifiers 7. TI-JEPA -> https://huggingface.co/papers/2503.06380 Text-Image JEPA uses self-supervised, energy-based pre-training to map text and images into a shared embedding space, improving cross-modal transfer to downstream tasks Find more types below 👇 Also, explore the basics of JEPA in our article: https://www.turingpost.com/p/jepa If you liked it, subscribe to the Turing Post: https://www.turingpost.com/subscribe
View all activity
Organizations
minhleduc
's activity
All
Models
Datasets
Spaces
Buckets
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
liked
5 datasets
3 months ago
Achilles822/viedata82
Viewer
•
Updated
Jul 26, 2025
•
4.75k
•
123
•
2
Achilles822/vietnamese-audio-dataset
Viewer
•
Updated
Jul 13, 2025
•
4.71k
•
146
•
3
hynt/ZipVoice-Vietnamese-2500h-Features
Updated
Aug 8, 2025
•
114
•
4
hungkieu/vietnamese-voices
Viewer
•
Updated
Jun 25, 2025
•
144
•
100
•
1
dolly-vn/dolly-audio-1000h-vietnamese
Viewer
•
Updated
Nov 24, 2025
•
664k
•
2.88k
•
50
liked
a model
7 months ago
dangvantuan/vietnamese-embedding
Sentence Similarity
•
0.1B
•
Updated
Oct 23, 2025
•
15.8k
•
•
53
liked
a dataset
7 months ago
8Opt/vietnamese-summarization-dataset-0001
Viewer
•
Updated
Oct 26, 2025
•
19.5k
•
98
•
3
liked
a dataset
11 months ago
papluca/language-identification
Viewer
•
Updated
Jul 15, 2022
•
90k
•
1.67k
•
67