Building Transformer Models with PyTorch 2.0: NLP, computer vision, and speech processing with PyTorch and Hugging Face (English Edition)

(3 customer reviews)

$39.95

SKU: 9355517491 Categories: ,

Description

Editorial Reviews

About the Author

Prem Timsina is the Director of Engineering at Mount Sinai Health Systems, where he oversees the development and implementation of Machine Learning Data Products. He has overseen multiple Machine Learning products that have been used as clinical decision support tool at multiple hospitals within New York City.

Additional information

Dimensions7.5 × 0.6 × 9.25 foot
Publisher ‏

‎ BPB Publications (March 8, 2024)

Language ‏

‎ English

Paperback ‏

‎ 266 pages

ISBN-10 ‏

‎ 9355517491

ISBN-13 ‏

‎ 978-9355517494

Item Weight

‎ 1.18 pounds

Dimensions ‏

‎ 7.5 x 0.6 x 9.25 inches

3 reviews for Building Transformer Models with PyTorch 2.0: NLP, computer vision, and speech processing with PyTorch and Hugging Face (English Edition)

  1. fabio santana


    I work as a technical project manager and purchased this book to not only learn the fundamentals of transformer models but also understand how to develop and implement projects in practice using these models. This book fulfills both goals: teaching the fundamentals and providing step-by-step, practical examples.The first chapter does a great job in detailing the architecture of transformer models. Since I do not have expertise in this domain, some terminology went over my head but the book provides enough information for you to perform additional research and dive deeper into any related topics as needed. The information is structured in a logical sequence and broken down into small pieces, making it easy to follow how data is processed through each layer of the transformer architecture. The book also provides a link to a google drive folder with colored versions of all illustrations, which was helpful when trying to visualize the different steps (embedding, positional encoding, attention mechanism, etc.).The next chapters contain several hands-on practical examples of how to implement these models, starting with the basics of creating custom tokenizers (which led me to watch videos on the history and inner workings of Byte Pair Encoders (BPEs)) to full blown computer vision, speech processing, and multimodal projects. Since every code example is readily available as jpnyb files (that will take you straight to Google Colab), it is extremely easy for anyone to follow along and create these projects themselves.If you are interested in learning more about transformer models but is a bit overwhelmed with all the content available, this is the perfect book to guide you through in a logical sequence and allow you to learn “by doing” with easy to follow code examples.

  2. sunita


    This book covers the comprehensive topics of ML (NLP, Computer Vision, Audio, Structured Data, RL ..) and how transformer can be applied on those topics. I find this very helpful to understand transformer and it’s application. Specially, I like the approach where the author describes the architecture and goes into application via practical example.

  3. DhavalP


    It’s pretty good book, compare to others. Main thing is that language, content and coding samples which can easily get your experience more informative and learning.

Add a review

Your email address will not be published. Required fields are marked *