• Top
  • New

Ask HN: Best books on introduction to AI/LLMs?

by laksmanv on 10/9/2024, 6:40:01 PM with 1 comments
I want to learn a bit more about how AI like ChatGPT and LLM's actually work, do you have any suggestions for good books in this area?

  • by mindcrime on 10/9/2024, 6:48:55 PM

    There are a couple of new books on the topic that are slated to drop any day now, IIRC. Of what's already published, a few I'm familiar with include:

    Transformers for Natural Language Processing and Computer Vision: Explore Generative AI and Large Language Models with Hugging Face, ChatGPT, GPT-4V, and DALL-E 3

    https://www.amazon.com/gp/product/1805128728/

    Transformer, BERT, and GPT: Including ChatGPT and Prompt Engineering

    https://www.amazon.com/gp/product/1683928989/

    Introduction to Transformers for NLP: With the Hugging Face Library and Models to Solve Problems

    https://www.amazon.com/gp/product/1484288432

    Transformers for Machine Learning: A Deep Dive

    https://www.amazon.com/gp/product/0367767341/

    Natural Language Processing with Transformers, Revised Edition

    https://www.amazon.com/gp/product/1098136799

    EDIT:

    a couple of the ones that I thought were still pending have now been released. I haven't read any of these, but they are ones that caught my eye and that I was planning to get:

    Large Language Models: A Deep Dive: Bridging Theory and Practice

    https://www.amazon.com/gp/product/3031656466

    Building LLMs for Production: Enhancing LLM Abilities and Reliability with Prompting, Fine-Tuning, and RAG

    https://www.amazon.com/gp/product/B0D4FFPFW8

    Building LLM Powered Applications: Create intelligent apps and agents with large language models

    https://www.amazon.com/gp/product/1835462316

    The "not yet released" group still includes:

    Build a Large Language Model (From Scratch) (ships Oct. 29th)

    https://www.amazon.com/gp/product/1633437167

    LLM Engineer's Handbook: Master the art of engineering Large Language Models from concept to production (ships Nov. 11th)

    https://www.amazon.com/gp/product/1836200072

    Hands-On Large Language Models: Language Understanding and Generation (ships ???)

    https://www.amazon.com/gp/product/1098150961