Upload date
All time
Last hour
Today
This week
This month
This year
Type
All
Video
Channel
Playlist
Movie
Duration
Short (< 4 minutes)
Medium (4-20 minutes)
Long (> 20 minutes)
Sort by
Relevance
Rating
View count
Features
HD
Subtitles/CC
Creative Commons
3D
Live
4K
360°
VR180
HDR
17 results
repo - https://github.com/GeeeekExplorer/nano-vllm/tree/main * Nano-vLLM is a simple, fast LLM server in \~1200 lines of Python ...
1,541 views
7 months ago
Speaker(s): Rehan Samaratunga My auto-tuning project aims to find the best settings for running large language models using ...
82 views
3 months ago
Run your Locally hosted AI Coding Assistant in VSCode with Continue extension, Ollama, Deepseek, Qwen or CodeLlama in less ...
71,614 views
11 months ago
Don't miss out! Join us at our next Flagship Conference: KubeCon + CloudNativeCon North America in Salt Lake City from ...
475 views
1 year ago
Everyone talks about NVIDIA when it comes to AI-but what if GPUs aren't the future? In this video, I break down why AI inference is ...
7,652 views
9 months ago
If you've ever worked on a python project you know how painful it can be to get all of the dependencies set up properly.
1,325 views
I battled my homelab machine cerebro against cloud machines with identical or better gpus to see if my local setup is worth it or ...
3,435 views
10 months ago
Speaker(s): KEERTHI UDAYAKUMAR RAG apps save up to 60% of the cost compared to standard LLMs. But in this talk, I will tell ...
19 views
Lightning-Talk Track Speaker: Fog Dong Title: BentoML Senior Engineer,CNCF Ambassador,LFAPAC Evangelist, KubeVela ...
35 views
Unlock the complete Full Stack AI Skill Set you need to build, scale, and monetize intelligent systems — even if you're just starting ...
8 views
Blog - https://opensuperintelligencelab.com/blog/deepseek-sparse-attention/ DeepSeek V3 From Scratch (understand attention ...
2,117 views
Don't miss out! Join us at our next Flagship Conference: KubeCon + CloudNativeCon India in Hyderabad (August 6-7), and ...
121 views
In this video, I show you how to accelerate Transformer inference with Optimum, an open source library by Hugging Face, and ...
3,043 views
3 years ago
Paper - https://github.com/deepseek-ai/DeepSeek-OCR/blob/main/DeepSeek_OCR_paper.pdf Become AI Researcher & Train ...
4,988 views
This episode details a practical exercise focused on fine-tuning a language model to improve its reasoning capabilities using ...
2 views
LLMatic can be used as a drop-in replacement for OpenAI's API. In this video, I briefly introduce the project and demo some of its ...
995 views
2 years ago
In this video, I show you how to accelerate Transformer training and inference with the Hugging Face Optimum Neuron library, ...
2,215 views