Joined 8/10/2018, 7:11:17 PM has 252 karma
Show HN: Automatically Build Nvidia TRT-LLM Engines
Show HN: 60% higher tokens per second for 70B custom LLMs
Show HN: Baseten Chains – Framework and SDK for Multi-Model AI Products
Open Source Inference Engine Baseten Raises $40M from IVP, Spark and Greylock