Deep Learning: Foundations and Concepts

The book introduces core principles and theoretical foundations behind deep learning, bridging the gap between classical machine learning and modern neural networks. It explains key architectures, optimization techniques, and mathematical frameworks that underpin today’s AI systems. By combining rigorous treatment with accessible explanations, it empowers researchers and practitioners to understand not just how deep models work, but why. Its impact lies in deepening the academic rigor of the field, shaping curricula, and guiding both industry innovation and the next generation of AI breakthroughs.

Visit Website

Introduction

Deep learning uses multilayered neural networks trained with large data sets to solve complex information processing tasks and has emerged as the most successful paradigm in the field of machine learning. Over the last decade, deep learning has revolutionized many domains including computer vision, speech recognition, and natural language processing, and it is being used in a growing multitude of applications across healthcare, manufacturing, commerce, finance, scientific discovery, and many other sectors. Recently, massive neural networks, known as large language models and comprising of the order of a trillion learnable parameters, have been found to exhibit the first indications of general artificial intelligence and are now driving one of the biggest disruptions in the history of technology. This expanding impact has been accompanied by an explosion in the number and breadth of research publications in machine learning, and the pace of innovation continues to accelerate. For newcomers to the field, the challenge of getting to grips with the key ideas, let alone catching up to the research frontier, can seem daunting. Against this backdrop, Deep Learning: Foundations and Concepts aims to provide newcomers to machine learning, as well as those already experienced in the field, with a thorough understanding of both the foundational ideas that underpin deep learning as well as the key concepts of modern deep learning architectures and techniques. This material will equip the reader with a strong basis for future specialization. Due to the breadth and pace of change in the field, we have deliberately avoided trying to create a comprehensive survey of the latest research. Instead, much of the value of the book derives from a distillation of key ideas, and although the field itself can be expected to continue its rapid advance, these foundations and concepts are likely to stand the test of time. For example, large language models have been evolving very rapidly at the time of writing, yet the underlying transformer architecture and attention mechanism have remained largely unchanged for the last five years, while many core principles of machine learning have been known for decades.

Back

Information

Websitewww.bishopbook.com
AuthorsChris Bishop, Hugh Bishop
Published date2023/11/02

More Items

Hands-On Large Language Models

2024

Jay Alammar, Maarten Grootendorst +1

Official code repository for the O'Reilly book "Hands-On Large Language Models" by Jay Alammar and Maarten Grootendorst. It provides runnable notebooks, visual explanations, and practical examples across chapters covering tokens and embeddings, transformer internals, text classification, semantic search, fine-tuning, multimodal models, and more. Recommended to run in Google Colab for easy setup.

book llm LLM github tutorial+5

Machine Learning Systems (MLSysBook)

2023

Harvard EDGE / MLSysBook community, Vijay Janapa Reddi (lead/primary author listed)

MLSysBook (Machine Learning Systems) is an open, community-driven textbook and learning stack for AI systems engineering led by the Harvard EDGE / MLSysBook community. The repository houses the textbook source, TinyTorch (a small educational DL framework), hardware lab kits, and supporting materials to teach how to design, build, benchmark, and deploy real-world machine learning systems.

github book ai-development ai-framework mlops+4

Foundations of LLMs (大模型基础)

2024

ZJU-LLMs

Foundations of LLMs is an open-source book by the ZJU-LLMs team that teaches fundamentals and advanced topics of large language models. It covers language model basics, LLM architecture evolution, prompt engineering, parameter-efficient fine-tuning, model editing, and retrieval-augmented generation. The repo provides chapter PDFs, paper lists, and is updated monthly.

book foundation LLM llm NLP+3