All
Search
Images
Videos
Shorts
Maps
News
Copilot
More
Shopping
Flights
Travel
Notebook
Report an inappropriate content
Please select one of the options below.
Not Relevant
Offensive
Adult
Child Sexual Abuse
Length
All
Short (less than 5 minutes)
Medium (5-20 minutes)
Long (more than 20 minutes)
Date
All
Past 24 hours
Past week
Past month
Past year
Resolution
All
Lower than 360p
360p or higher
480p or higher
720p or higher
1080p or higher
Source
All
Dailymotion
Vimeo
Metacafe
Hulu
VEVO
Myspace
MTV
CBS
Fox
CNN
MSN
Price
All
Free
Paid
Clear filters
SafeSearch:
Moderate
Strict
Moderate (default)
Off
Filter
13:57
Understand Local Outlier Factor (LOF) | Master LOF | LOF Coding
356 views
1 month ago
YouTube
EduMentor Deepti
7:11
Find in video from 03:04
Calculating Average Local Reachability Density
LOF (Local Outlier Factor) Quickly Explained
12.6K views
Nov 3, 2022
YouTube
R LH
2:15:13
Reinforcement Learning from Human Feedback explained with math derivations and the PyTorch code.
67.1K views
Feb 27, 2024
YouTube
Umar Jamil
28:53
Fine-tuning LLMs on Human Feedback (RLHF + DPO)
23.4K views
Mar 3, 2025
YouTube
Shaw Talebi
9:33
How Reinforcement Learning Algorithms Work - A High Level Overview
3.4K views
Dec 28, 2021
YouTube
Dibya Chakravorty
14:10
#277 Scaling Laws for Dense Retrieval
171 views
7 months ago
YouTube
Data Science Gems
23:02
Local Outlier Factor Decoded | k-Distance, Reachability & Density
1 month ago
YouTube
EduMentor Deepti
51:06
How to finetune LLMs to THINK with Reinforcement Learning (GRPO from scratch!)
26.1K views
10 months ago
YouTube
Neural Breakdown with AVB
39:21
What is the Simplest RL Algorithm That Matches GRPO ? | RAFT + Reinforce-Rej
990 views
2 months ago
YouTube
Deep Learning with Yacine
1:26
How AI is Actually Trained (DPO vs RLHF Explained in 85s)
776 views
4 weeks ago
YouTube
Code With K5KC
1:27:21
RLHF, PPO and DPO for Large language models
3.7K views
Feb 18, 2024
YouTube
Arvind N
1:20:54
LLM Alignment (RLHF, DPO, ORPO) + Hands-on Project
11K views
6 months ago
YouTube
BrainOmega
38:49
Haystack EU 2023 - Philipp Krenn: Reciprocal Rank Fusion (RRF) - How to Stop Worrying about Boosting
2.8K views
Oct 5, 2023
YouTube
OpenSource Connections
3:14:37
RLHF from scratch, step-by-step, in code
2.8K views
11 months ago
YouTube
Ashwani Kumar
36:14
How to Code RLHF on LLama2 w/ LoRA, 4-bit, TRL, DPO
16.9K views
Aug 31, 2023
YouTube
Discover AI
49:31
Recursive Language Models (RLMs) - Let's build the coolest agents ever! (Theory & Code)
20.5K views
2 months ago
YouTube
Neural Breakdown with AVB
1:08:21
State of LLMs 2026: RLVR, GRPO, Inference Scaling — Sebastian Raschka
17.9K views
3 months ago
YouTube
The MAD Podcast with Matt Turck
6:31
RFLP Explained | Restriction Fragment Length Polymorphism Technique for Beginners |
83.8K views
May 28, 2023
YouTube
Biology Lectures
26:00
How to Fine-tune LLMs with RLVR (OpenAI’s RFT API)
2K views
3 months ago
YouTube
Shaw Talebi
23:02
Rubrics as Rewards: A Technical Guide to DPO, RaR, RLVR, GPRO and LLM Model Alignment. Unsloth RL.
148 views
2 months ago
YouTube
Byte Goose AI.
5:27
How AI Models Are Tuned to Follow Instructions : RLHF vs DPO
27 views
4 months ago
YouTube
AI Strategy & Trends
17:43
[RL Fine-Tuning] From RLHF to GRPO: The Evolution and Optimization of AI LLM Models Alignment.
365 views
4 months ago
YouTube
Byte Goose AI.
18:36
Deep Dive: RLVR, GRPO & The End of Spurious AI Logic
67 views
3 months ago
YouTube
DeepCombinator
19:39
Find in video from 10:52
KTO Optimization Algorithm
RLHF Explained (and DPO!)
18K views
Jun 12, 2024
YouTube
Mark Hennings
3:44
RFLP | Restriction Fragment Length Polymorphism
375.6K views
Jun 18, 2020
YouTube
Quick Biochemistry Basics
1:12:55
Density Matrices | Understanding Quantum Information & Computation | Lesson 09
25.1K views
Mar 27, 2024
YouTube
Qiskit
9:09
DeepSeek-R1 Paper Explained - A New RL LLMs Era in AI?
84.5K views
Jan 24, 2025
YouTube
AI Papers Academy
18:51
Line of Balance (LOB) Analysis | Numerical Solved Step-by-Step | Crew Optimization & RoO Explained
212 views
3 weeks ago
YouTube
Jay Brahmbhatt
1:18:00
RLHF Explained & Coded (feat. PPO)
310 views
9 months ago
YouTube
AIArchives
1:16:38
LLM Marathon series : PPO vs DPO: Understanding RLHF and Large Language Models
269 views
May 29, 2024
YouTube
Lingo Research Group, IITGN
See more
More like this
Feedback