Hacker News Clone

OpenChatKit – Fully open-source model, data and more

by jamiedg on 3/10/2023, 9:14:31 PM with 5 comments

by jamiedg on 3/10/2023, 9:14:31 PM
Hi Everyone, I work at Together. Today we released OpenChatKit: an open-source base to create chatbots for various applications. More than a model release, this is the beginning of an open source project. We are releasing a set of tools and processes for ongoing improvement with community contributions.
You can try it now on Hugging Face! https://huggingface.co/spaces/togethercomputer/OpenChatKit
OpenChatKit includes 4 key components:
1. An instruction-tuned large language model, fine-tuned for chat from EleutherAI’s GPT-NeoX-20B with over 43 million instructions on 100% carbon negative compute available under Apache-2.0 license on Hugging Face.
2. A set of customization recipes to fine-tune the model to achieve high accuracy on your tasks documented and available as open-source under the Apache-2.0 license on Github, along with code to recreate our model results.
3. An extensible retrieval system enabling you to augment bot responses with information from a document repository, API, or other live-updating information source at inference time, with open-source examples for using Wikipedia or a web search API.
4. A moderation model, fine-tuned from GPT-JT-6B, designed to filter which questions the bot responds to, also available under the Apache-2.0 license on Hugging Face.
We collaborated with the tremendous communities at @laion_ai and Ontocord to create the training dataset used for these models, also released as open-source. Read the full details on LAION's blog post!
Can't wait to hear your feedback!
Thanks, -Together
by mach1ne on 3/11/2023, 7:46:50 PM
How did this not get traction? I came upon the model from Twitter by chance. This is the first seemingly valid and functional open-source ChatGPT alternative. Can’t wait for the optimizations which will allow it (or similar) to run on consumer-grade GPUs.
by lxe on 3/14/2023, 8:26:35 AM
Hacker News seemed to have missed it. I only now discovering this amidst the hype of llama.
> We are releasing OpenChatKit 0.15 under an Apache-2.0 license, with full access to source code, model weights and training datasets.
This model has the best license of them all, allowing for all sorts of use cases. Nicely done!