Big Data Analysis with Python

Big Data Analysis with Python
Author :
Publisher :
Total Pages : 276
Release :
ISBN-10 : 1789955289
ISBN-13 : 9781789955286
Rating : 4/5 (89 Downloads)

Book Synopsis Big Data Analysis with Python by : Ivan Marin

Download or read book Big Data Analysis with Python written by Ivan Marin and published by . This book was released on 2019-04-08 with total page 276 pages. Available in PDF, EPUB and Kindle. Book excerpt: Get to grips with processing large volumes of data and presenting it as engaging, interactive insights using Spark and Python. Key Features Get a hands-on, fast-paced introduction to the Python data science stack Explore ways to create useful metrics and statistics from large datasets Create detailed analysis reports with real-world data Book Description Processing big data in real time is challenging due to scalability, information inconsistency, and fault tolerance. Big Data Analysis with Python teaches you how to use tools that can control this data avalanche for you. With this book, you'll learn practical techniques to aggregate data into useful dimensions for posterior analysis, extract statistical measurements, and transform datasets into features for other systems. The book begins with an introduction to data manipulation in Python using pandas. You'll then get familiar with statistical analysis and plotting techniques. With multiple hands-on activities in store, you'll be able to analyze data that is distributed on several computers by using Dask. As you progress, you'll study how to aggregate data for plots when the entire data cannot be accommodated in memory. You'll also explore Hadoop (HDFS and YARN), which will help you tackle larger datasets. The book also covers Spark and explains how it interacts with other tools. By the end of this book, you'll be able to bootstrap your own Python environment, process large files, and manipulate data to generate statistics, metrics, and graphs. What you will learn Use Python to read and transform data into different formats Generate basic statistics and metrics using data on disk Work with computing tasks distributed over a cluster Convert data from various sources into storage or querying formats Prepare data for statistical analysis, visualization, and machine learning Present data in the form of effective visuals Who this book is for Big Data Analysis with Python is designed for Python developers, data analysts, and data scientists who want to get hands-on with methods to control data and transform it into impactful insights. Basic knowledge of statistical measurements and relational databases will help you to understand various concepts explained in this book.


Big Data Analysis with Python Related Books

Big Data Analysis with Python
Language: en
Pages: 276
Authors: Ivan Marin
Categories: Computers
Type: BOOK - Published: 2019-04-08 - Publisher:

DOWNLOAD EBOOK

Get to grips with processing large volumes of data and presenting it as engaging, interactive insights using Spark and Python. Key Features Get a hands-on, fast
Big Data Analytics with Hadoop 3
Language: en
Pages: 471
Authors: Sridhar Alla
Categories: Computers
Type: BOOK - Published: 2018-05-31 - Publisher: Packt Publishing Ltd

DOWNLOAD EBOOK

Explore big data concepts, platforms, analytics, and their applications using the power of Hadoop 3 Key Features Learn Hadoop 3 to build effective big data anal
Frank Kane's Taming Big Data with Apache Spark and Python
Language: en
Pages: 289
Authors: Frank Kane
Categories: Computers
Type: BOOK - Published: 2017-06-30 - Publisher: Packt Publishing Ltd

DOWNLOAD EBOOK

Frank Kane's hands-on Spark training course, based on his bestselling Taming Big Data with Apache Spark and Python video, now available in a book. Understand an
Python for Data Analysis
Language: en
Pages: 553
Authors: Wes McKinney
Categories: Computers
Type: BOOK - Published: 2017-09-25 - Publisher: "O'Reilly Media, Inc."

DOWNLOAD EBOOK

Get complete instructions for manipulating, processing, cleaning, and crunching datasets in Python. Updated for Python 3.6, the second edition of this hands-on
Data Analysis with Python and PySpark
Language: en
Pages: 454
Authors: Jonathan Rioux
Categories: Computers
Type: BOOK - Published: 2022-03-22 - Publisher: Simon and Schuster

DOWNLOAD EBOOK

Think big about your data! PySpark brings the powerful Spark big data processing engine to the Python ecosystem, letting you seamlessly scale up your data tasks