Profile Picture
  • All
  • Search
  • Images
  • Videos
    • Shorts
  • Maps
  • News
  • More
    • Shopping
    • Flights
    • Travel
  • Notebook
Report an inappropriate content
Please select one of the options below.
  • Length
    AllShort (less than 5 minutes)Medium (5-20 minutes)Long (more than 20 minutes)
  • Date
    AllPast 24 hoursPast weekPast monthPast year
  • Resolution
    AllLower than 360p360p or higher480p or higher720p or higher1080p or higher
  • Source
    All
    Dailymotion
    Vimeo
    Metacafe
    Hulu
    VEVO
    Myspace
    MTV
    CBS
    Fox
    CNN
    MSN
  • Price
    AllFreePaid
  • Clear filters
  • SafeSearch:
  • Moderate
    StrictModerate (default)Off
Filter
Direct Preference Optimization (DPO) - How to fine-tune LLMs directly without reinforcement learning
21:15
Direct Preference Optimization (DPO) - How to fine-tune LLMs dir…
27.3K viewsJun 21, 2024
YouTubeSerrano.Academy
Direct Preference Optimization (DPO) explained: Bradley-Terry model, log probabilities, math
48:46
Direct Preference Optimization (DPO) explained: Bradley-Terry m…
24.7K viewsApr 14, 2024
YouTubeUmar Jamil
Reinforcement Learning, RLHF, & DPO Explained
19:39
Reinforcement Learning, RLHF, & DPO Explained
13.3K viewsJun 12, 2024
YouTubeMark Hennings
Direct Preference Optimization (DPO): Your Language Model is Secretly a Reward Model Explained
36:25
Direct Preference Optimization (DPO): Your Language Model is S…
18.9K viewsAug 10, 2023
YouTubeGabriel Mongaras
Step-by-Step: Becoming a Data Protection Officer in the Digital Age
35:08
Step-by-Step: Becoming a Data Protection Officer in the Digital Age
5.1K viewsMay 11, 2024
YouTubeINFOSEC TRAIN
DPO Pay by Network x Odoo: Levelling up digital payments in Africa
37:40
DPO Pay by Network x Odoo: Levelling up digital payments in A…
734 views5 months ago
YouTubeOdoo
7 Series DPO Overview
17:36
7 Series DPO Overview
637 views3 months ago
YouTubeTektronix
21:15
DPO直接偏好优化算法 (动画讲解)
8K viewsOct 26, 2024
bilibili数源创域
5:01
股票DPO指标介绍和使用方法
1.9K viewsOct 16, 2022
bilibili肃总
20:25
【DPO衍生算法串讲-Part 1】r2Q*,Step-DPO,RTO,TDPO,S…
5.3K viewsNov 11, 2024
bilibili一心豆儿
See more videos
Static thumbnail place holder
More like this
Feedback
  • Privacy
  • Terms