Profile Picture
  • All
  • Search
  • Images
  • Videos
    • Shorts
  • Maps
  • News
  • More
    • Shopping
    • Flights
    • Travel
  • Notebook
Report an inappropriate content
Please select one of the options below.
NB Films
NVIDIA
F-35
Monitor
Samsung
Gaming Screen
BenQ
MK Dons
TVA Live
Banned Books
Acer Nitro 5
NB Films Latest
Dell Earbuds
16GB RAM Computer
Uwtbg
German Banned
Ai Bubbles
MSI
Monitor
How to Transfer Photos From Phone to PC
Gaming
Monitor
PCIe Switch
Sentul
1000 Ants vs Obstacles
Abdelhamid T Game GTA 5
1000 Ants vs 1000 Cockroaches
BenQ News
Bubble Ai
Hamid T Gaming GTA 5
King 5 News Live Stream
Samsung
Monitors
Gaming
Monitors
  • Length
    AllShort (less than 5 minutes)Medium (5-20 minutes)Long (more than 20 minutes)
  • Date
    AllPast 24 hoursPast weekPast monthPast year
  • Resolution
    AllLower than 360p360p or higher480p or higher720p or higher1080p or higher
  • Source
    All
    Dailymotion
    Vimeo
    Metacafe
    Hulu
    VEVO
    Myspace
    MTV
    CBS
    Fox
    CNN
    MSN
  • Price
    AllFreePaid
  • Clear filters
  • SafeSearch:
  • Moderate
    StrictModerate (default)Off
Filter
    NB Films
    NVIDIA
    F-35
    Monitor
    Samsung
    Gaming Screen
    BenQ
    MK Dons
    TVA Live
    Banned Books
    Acer Nitro 5
    NB Films Latest
    Dell Earbuds
    16GB RAM Computer
    Uwtbg
    German Banned
    Ai Bubbles
    MSI
    Monitor
    How to Transfer Photos From Phone to PC
    Gaming
    Monitor
    PCIe Switch
    Sentul
    1000 Ants vs Obstacles
    Abdelhamid T Game GTA 5
    1000 Ants vs 1000 Cockroaches
    BenQ News
    Bubble Ai
    Hamid T Gaming GTA 5
    King 5 News Live Stream
    Samsung
    Monitors
    Gaming
    Monitors
You now convert any LLM into a faster one without retraining from scratch.NVIDIA just did this to their 30B model. Here's the trick:1. Duplicate the model into two copies2. Freeze one copy, it just reads the prompt and remembers context3. Train the other copy to write chunks of text at once instead of one word at a time4. Run them togetherThe frozen copy barely costs anything (it's already trained). The new copy only needed ~8% of the original training data to learn the new trick.Result: 2.4x fa
0:13
You now convert any LLM into a faster one without retraining from scratch.NVIDIA just did this to their 30B model. Here's the trick:1. Duplicate the model into two copies2. Freeze one copy, it just reads the prompt and remembers context3. Train the other copy to write chunks of text at once instead of one word at a time4. Run them togetherThe frozen copy barely costs anything (it's already trained). The new copy only needed ~8% of the original training data to learn the new trick.Result: 2.4x fa
103.4K views1 day ago
x.comLior Alexander
See more
Static thumbnail place holder
More like this
You may also want to search
NVIDIA System Monitor
Setting Up Two Monitors NVIDIA
NVIDIA Activate 3rd Monitor
How to Set Up Multiple Monitor with NVIDIA
NVIDIA Control Panel Monitor Technology
NVIDIA Recording Wrong Monitor
NVIDIA Control Panel Triple Monitors
NVIDIA Multi-Monitor
Flickering Pixels On Monitors for NVIDIA GPU
Connect 2 Monitors to NVIDIA Graphics Card
Set Up Dual Monitors NVIDIA Graphics Card
How to Connect 4 Monitors to NVIDIA GeForce GTX 1660Ti Graphics 6GB
  • Privacy
  • Terms