Bookbot

Data Science with Python and Dask

Valutazione del libro

Maggiori informazioni sul libro

Dask is a native parallel analytics tool that integrates seamlessly with existing libraries like Pandas, NumPy, and Scikit-Learn, allowing you to work with large datasets using familiar tools. This guide shows how to leverage Dask for data projects without altering your workflow. The print book purchase includes a free eBook in PDF, Kindle, and ePub formats from Manning Publications, with registration instructions provided inside. Efficient data pipelines are crucial for successful data science projects. Dask offers a flexible library for parallel computing in Python, enabling intuitive workflows for ingesting and analyzing large, distributed datasets. It features dynamic task scheduling and parallel collections that enhance the capabilities of NumPy, Pandas, and Scikit-Learn, allowing users to scale their code from a single laptop to a cluster of hundreds of machines effortlessly. The book teaches you to build scalable projects capable of handling massive datasets. You'll explore the Dask framework, analyze the NYC Parking Ticket database, and use DataFrames for streamlined processes. You'll also create machine learning models with Dask-ML, develop interactive visualizations, and build clusters using AWS and Docker. This resource is intended for data scientists and developers familiar with Python and the PyData stack. The author, Jesse Daniel, is an experienced Python developer and educator, leading a team of data scienti

Acquisto del libro

Data Science with Python and Dask, Jesse C. Daniel

Lingua
Pubblicato
2019
product-detail.submit-box.info.binding
(In brossura)
Ti avviseremo via email non appena lo rintracceremo.

Metodi di pagamento

3,6
Molto buono
13 Valutazioni

Qui potrebbe esserci la tua recensione.

Titolo
Data Science with Python and Dask
Lingua
Inglese
Pubblicato
2019
Formato
In brossura
Pagine
296
ISBN10
1617295604
ISBN13
9781617295607
Serie
Valutazione
3,6 su 5
Descrizione
Dask is a native parallel analytics tool that integrates seamlessly with existing libraries like Pandas, NumPy, and Scikit-Learn, allowing you to work with large datasets using familiar tools. This guide shows how to leverage Dask for data projects without altering your workflow. The print book purchase includes a free eBook in PDF, Kindle, and ePub formats from Manning Publications, with registration instructions provided inside. Efficient data pipelines are crucial for successful data science projects. Dask offers a flexible library for parallel computing in Python, enabling intuitive workflows for ingesting and analyzing large, distributed datasets. It features dynamic task scheduling and parallel collections that enhance the capabilities of NumPy, Pandas, and Scikit-Learn, allowing users to scale their code from a single laptop to a cluster of hundreds of machines effortlessly. The book teaches you to build scalable projects capable of handling massive datasets. You'll explore the Dask framework, analyze the NYC Parking Ticket database, and use DataFrames for streamlined processes. You'll also create machine learning models with Dask-ML, develop interactive visualizations, and build clusters using AWS and Docker. This resource is intended for data scientists and developers familiar with Python and the PyData stack. The author, Jesse Daniel, is an experienced Python developer and educator, leading a team of data scienti