General Book Search for "9780136753353"

Data Cleaning for Effective Data Science

Paperback
Published : Sunday 9 January 2000
ISBN : 9780136753353
Price : €44.45


Description

In Data Cleaning for Effective Data Science, leading data science trainer David Mertz provides the most systematic guide to cleaning data for any project, using any library or toolset. Mertz introduces many powerful techniques for analyzing, manipulating, and pre-processing data sources. He offers best practices for working with leading data formats such as JSON, CSV, SQL RDBMSes, HDF5, NoSQL databases, files in image formats, binary serialized data structures, and more. Mertz also focuses on crucial issues within the data itself, including missing data, outliers, biasing trends, class imbalance, value imputation, over/under-sampling, normalization and/or randomization, and anomalies. 


This guide is organized around downloadable datasets, each illuminating specific issues with data integrity or quality. Each chapter explores the best ways to diagnose, analyze, and remediate these issues, offering hands-on practice using tools such as Python, Pandas, sklearn.preprocessing, scipy.stats, R, and Tidyverse. While the examples are demonstrated with widely-used tools, Mertzs concepts are applicable with any toolset. Each chapter also links to additional datasets with more problems, exercises, and solutions. Ancillary resources include Instructor Notes and PowerPoint lecture slides, which will both be downloadable from Pearson.com/us. 



You may also like ...

Product

Better Python Code

Paperback
05 Dec 2023
Programming and scripting languages: general

€46.79

Extended stock – Dispatch 5-7 days
Product

Regular Expression Puzzles and AI Codin...

Hardback
24 Apr 2023
Computing and Information Technology

€36.26

Extended stock – Dispatch 5-7 days
Product

Data Cleaning for Effective Data Science

Paperback
09 Jan 2000
Data mining

€44.45

Extended stock – Dispatch 5-7 days
Product

David Bowie Ziggy Stardust Jigsaw Puzzl...

Paperback
01 Mar 2024
Stationery and miscellaneous items

€23.39

Extended stock – Dispatch 5-7 days

Reviews