Introduction to Machine Learning with Python: A Guide for Data Scientists
- Length: 392 pages
- Edition: 1
- Language: English
- Publisher: O'Reilly Media
- Publication Date: 2016-10-20
- ISBN-10: 1449369413
- ISBN-13: 9781449369415
- Sales Rank: #19185 (See Top 100 Books)
Machine learning has become an integral part of many commercial applications and research projects, but this field is not exclusive to large companies with extensive research teams. If you use Python, even as a beginner, this book will teach you practical ways to build your own machine learning solutions. With all the data available today, machine learning applications are limited only by your imagination.
You’ll learn the steps necessary to create a successful machine-learning application with Python and the scikit-learn library. Authors Andreas Müller and Sarah Guido focus on the practical aspects of using machine learning algorithms, rather than the math behind them. Familiarity with the NumPy and matplotlib libraries will help you get even more from this book.
With this book, you’ll learn:
- Fundamental concepts and applications of machine learning
- Advantages and shortcomings of widely used machine learning algorithms
- How to represent data processed by machine learning, including which data aspects to focus on
- Advanced methods for model evaluation and parameter tuning
- The concept of pipelines for chaining models and encapsulating your workflow
- Methods for working with text data, including text-specific processing techniques
- Suggestions for improving your machine learning and data science skills
Table of Contents
Chapter 1. Introduction
Chapter 2. Supervised Learning
Chapter 3. Unsupervised Learning and Preprocessing
Chapter 4. Representing Data and Engineering Features
Chapter 5. Model Evaluation and Improvement
Chapter 6. Algorithm Chains and Pipelines
Chapter 7. Working with Text Data
Chapter 8. Wrapping Up