You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Discover the fundamentals of Apache Spark, including its architecture, transformations and actions. Learn why the design choices of RDD were made and how this enhances the Hadoop and MapReduce construct.
We will look at SQL and NoSQL databases and decide the ideal database to be used based on the format of data. Then you will implement storing and querying data in different databases such as MySQL, MongoDB, and Cassandra.
You will look at several case studies, examining everything from AI being used to manipulate elections, to AI displaying racial and sexist prejudices. Implement a simple sentiment classifier to differentiate between positive and negative words and sentences. You'll observe how this works in many cases, and display the problematic biases and huma…
You will look at the different formats for storing, transmitting and organizing very large collections of data. You will learn how to choose the right data formats as well as look at compression.