MP4 | Video: h264, 1280x720 | Audio: AAC, 44.1 KHz, 2 Ch
Skill Level: Advanced | Genre: eLearning | Language: English + srt | Duration: 1h 2m | Size: 156.8 MB
Data eeering is the foundation for building analytics and data science applications in the new Big Data world.
Data eeering requires combining multiple big data technologies to construct data pipelines and networks to stream, process, and store data. This course focuses on building full-fledged solutions that combine Apache Spark with other Big Data tools to create end-to-end data pipelines. Instructor Kumaran Ponnambalam bs by defining data eeering, its functions, and its concepts. Next, Kumaran goes over how Spark capabilities such as parallel processing, execution plans, state management options, and machine learning work with extract, transform, load (ETL). He introduces you to batch processing use cases and processes, as well as real- processing pipelines. After walking you through several useful best practices, Kumaran concludes with an end-to-end exercise project.