Data Validation Engineer
Hadoop Projects HD1 : Analyzing Marketing Data for a Portuguese banking institution Frameworks & Tool Spark & Hive SQL, Spark Dataframe ,Hadoop cluster with Data Block replication factor as three, Scoop, MapReduce, Hive, Cloudera Search ,Hue, Scala, Python Project Abstract Portuguese banking institution—ran a marketing campaign to convince potential customers to invest in bank term deposit and further meaningful data analysis on campaign were required In order to find customer probability to subscribe for product Role and Responsibilities Hadoop & SAS Developer: ▪ Creation of Spark RDD and define schema with delimiter ▪ Performing multiple actions and transformations using Spark and Hive SQL ▪ Manually Import data file in SQL Server and import in HDFS, further using Spark Dataframe receive the required results ▪ Analyze data reside in HDFS using Hive ▪ Write MR code for analyzing data ▪ Verify dataset and resultset using Cloudera Search and Hue