Your browser does not support javascript! Please enable it, otherwise web will not work for you.

Data Streaming @ Integrated Personnel

Home > IT Operations / EDP / MIS

 Data Streaming

Job Description

: Spark/Scala/PySpark developer who knows how to fully exploit the potential of our Spark
cluster. Should have the ability to clean, transform, and analyze vast amounts of raw data from
various systems using Spark to provide ready-to-use data.     Responsibilities: Create Scala/Spark/Pyspark jobs for data transformation and aggregation Produce unit tests for Spark transformations and helper methods Write Scaladoc-style documentation with all code Design data processing pipelines     Skills: Pyspark Scala (with a focus on the functional programming paradigm) Apache Spark 2. x, 3. x
  • Apache Spark RDD API
  • Apache Spark SQL DataFrame API
  • Apache Spark Streaming API
Spark query tuning and performance optimization SQL database integration (Postgres, and/or MySQL) Experience working with HDFS, AWS ( S3, Redshift, EMR, IAM, Polices, Routing) CI-CD Pipeline, Jenkins, Gitlab /Bitbucket Deep understanding of distributed systems (e.g. partitioning, replication, consistency, and consensus)

Employement Category:

Employement Type: Full time
Industry: IT - Software
Role Category: IT Operations / EDP / MIS
Functional Area: Not Applicable
Role/Responsibilies: Data Streaming

Contact Details:

Company: Integrated Personnel
Location(s): Multi-City, India

+ View Contactajax loader


Keyskills:   scala spark pyspark

 Job seems aged, it may have been expired!
 Fraud Alert to job seekers!

₹ 3.0 - 5 Lakh/Yr

Similar positions

Integrated Personnel

Integrated Personnel Services Limited (IPS Group) is a team of experienced professionals providing end to end Human resource management solutions to the top-notch corporates in various industries.