Hacker News Clone

Ask HN: Great, concise resource to learn LLM technical fundamentals?

by bulldog13 on 3/15/2024, 12:16:22 PM with 2 comments

Can anyone recommend a great, concise resource to learn LLM technical fundamentals including transformers, pre-training, fine-tuning, etc from the ground up starting with neural networks?

The resources I can find are either how to use langchain to write LLM apps, which is not what I want, or very deep graduate level book study, which I don't have the time for.

Something that starts with neural networks (which I am familiar with) and takes you all the way to how LLMs work, but as concise as possible.

My goal is to be able to talk about LLM technical fundamentals cogently, not necessarily code one from scratch.

Thanks!

by boredemployee on 3/16/2024, 6:03:09 AM
I'm in the same bot as you are learning with this activeloop course [1]. I think it has a good overview. The problem is that the code is a little bit outdated, which is good because you have to look outside instead of just copy and paste, so you actually know what's going on!
[1] https://learn.activeloop.ai/courses/llms
by eimrine on 3/15/2024, 4:50:17 PM
They are for using only, not for learning about.