This is a product update we’ve been working on. Our multi-headed model is accurate in 49 languages and the full pipeline has really good normalisation (“ten dollars” -> “$10”), abbreviations, and punctuation too. It makes a big difference when you’re feeding this, say, to an LLM in real time.
This is a product update we’ve been working on. Our multi-headed model is accurate in 49 languages and the full pipeline has really good normalisation (“ten dollars” -> “$10”), abbreviations, and punctuation too. It makes a big difference when you’re feeding this, say, to an LLM in real time.
(Should we add oxford_comma as a config option?)