Data Mining
What is Data Mining?

Introduction We are currently in a data-driven world, rich in information. Knowing there is superfluous and readily available information can feel comforting. However, the more there is available knowledge, the more challenging it becomes for you to get helpful insights. In that case, we will explore data mining aspects. We will discuss data mining, its...

Data Analytics
What is Data Analytics?

“The Global Datasphere will reach 175 zettabytes by 2025” – IDC Data has become the lifeblood of many organizations across the globe. Established enterprises have sophisticated systems to capture and store data. However, the reality for many organizations can be summarized in a single line. Data, data, everywhere, not a byte that is useful… Inside...

Modern Data Experience
Modern Day Data Experience – How to Get Started

Recently I was talking to a customer of ours about their digital transformation goals and as is the norm, the discussion came down to data, analytics, data science and you name it – we covered every possible technology in that area. “We have built a data lake. We use this tool for ETL and that...

How Apache Spark Programs are Executed
Learn How Apache Spark Programs are Executed

We discussed about the fundamentals of an Apache Spark system including its architecture in the previous blog. We shared the basic differences between Resilient Distributed Datasets and Dataframes. We’ve covered the what and the why, now we are going to discuss about the how. This blog will outline the different steps involved in an Apache...

Understanding the Fundamental Architecture of Apache Spark
Diving Deeper – Understanding the Fundamental Architecture of the Apache Spark

The previous blog, we saw about what is Apache Spark & how it is a preferred data analysis engine due to the speed it possesses. This blog is where we discuss about the intricacies of Apache spark’s working principle and we will introduce the concept of Spark RDD’s and data frames. Breaking the Class –...

Introduction to the World of Apache Spark
Introduction To The World Of Apache Spark In Big Data Analytics

InfoWorld defines Apache Spark as a data processing framework that can quickly perform processing tasks on very large data sets, and can also distribute data processing tasks across multiple computers, either on its own or in tandem with other distributed computing tools (Source: InfoWorld).  Apache Spark – The Go-To Data Analytics Engine Apache Spark is...

© 2022 Purpleslate Private Limited | Made with 🤍 at Chennai, India