Building The Future Show - Radio / TV / Podcast
Ep. 567 w/ Brian Stevens CEO at Neural Magic
Together with our community, we engineer sparse LLM, CV, and NLP models that are more efficient and performant in production. Why does this matter? Sparse models are more flexible and can achieve unrivaled latency and throughput performance on your privat