Lightweight LLM AI inference with Wasm video

Name: Lightweight LLM AI inference with Wasm video
Uploaded: 2024-05-29T12:00:00Z
Duration: 38 min 56 s
Description: Explore lightweight large language model inference with WebAssembly in Michael Yuan's Navigate tech talk. Learn to run models like LLaMA efficiently across platforms.

Speakers: Michael Yuan

Join Michael Yuan as he explores lightweight large language model (LLM) inference with WebAssembly (WASM). In this tech video demo, Michael demonstrates how to run full-scale LLMs like LLaMA on various platforms, from personal laptops to cloud servers, with the efficiency of WASM. He addresses the challenges of running LLMs in cloud environments, offers practical demos, and discusses future applications.

Lightweight LLM AI inference with Wasm video

Stay up to date