Mastering Big Data Analytics with PySparkMP4 | Video: h264, 1920x1080 | Audio: AAC, 44.1 KHz, 2 Ch
June 26, 2020 | ISBN: 9781838640583 | English
Duration: 41 Lessons (8h 7m) | Size: 1.54 GB
Effectively apply Advanced Analytics to large datasets using the power of PySpark
LearnGain a solid knowledge of vital Data Analytics concepts via practical use cases
Create elegant data visualizations using Jupyter
Run, process, and analyze large chunks of datasets using PySpark
Utilize Spark SQL to easily load big data into DataFrames
Create fast and scalable Machine Learning applications using MLlib with Spark
Perform exploratory Data Analysis in a scalable way
Achieve scalable, high-throughput and fault-tolerant processing of data streams using Spark Streaming
AboutPySpark helps you perform data analysis at-scale; it enables you to build more scalable analyses and pipelines. This course starts by introducing you to PySpark's potential for performing effective analyses of large datasets. You'll learn how to interact with Spark from Python and connect Jupyter to Spark to provide rich data visualizations. After that, you'll delve into various Spark components and its architecture.
You'll learn to work with Apache Spark and perform ML tasks more smoothly than before. Gathering and querying data using Spark SQL, to overcome challenges involved in reading it. You'll use the DataFrame API to operate with Spark MLlib and learn about the Pipeline API. Finally, we provide tips and tricks for deploying your code and performance tuning.
By the end of this course, you will not only be able to perform efficient data analytics but will have also learned to use PySpark to easily analyze large datasets at-scale in your organization.
All related code files are placed on a GitHub repository at:
https://github.com/PacktPublishing/Mastering-Big-Data-Analytics-with-PySpark]https://github.com/PacktPublishing/Mastering-Big-Data-Analytics-with-PySpark
FeaturesSolve your big data problems by building powerful Machine Learning models with Spark and implementing them using Python
Get up-and-running with Spark's essential libraries and tools (such as PySpark, Spark Streaming, Spark SQL, and Spark MLlib) and learn to apply them in practical, real-world big data applications
Leverage Spark 2.x-one of the most popular big data technologies-to discover how powerful Spark Machine Learning is how easily you can apply it!
Screenshots
Download link:
Só visivel para registados e com resposta ao tópico.Only visible to registered and with a reply to the topic.Links are Interchangeable - No Password - Single Extraction