Senior Data Engineer PySpark, Informatica, @ Hitachi

Home > Software Development

Senior Data Engineer PySpark, Informatica,

Hitachi
6 - 8 years
Noida, Gurugram
2 months ago
Email to a friend
Report this job

Job Description

Description:

Join GlobalLogic, to be a valid part of the team working on a huge software project for the world-class company providing M2M / IoT 4G/5G modules e g to the automotive, healthcare and logistics industries

Through our engagement, we contribute to our customer in developing the end-user modulesfirmware, implementing new features, maintaining compatibility with the newest telecommunication and industry standards, as well as performing analysis and estimations of the customer requirements

Requirements:

Our Big Data capability team needs hands-on developers who can produce beautiful functional code to solve complex analytics problems

If you are an exceptional developer with an aptitude to learn and implement using new technologies, and who loves to push the boundaries to solve complex business problems innovatively, then we would like to talk with you

You would be responsible for evaluating, developing, maintaining and testing big data solutions for advanced analytics projects

The role would involve big data pre-processing reporting workflows including collecting, parsing, managing, analyzing and visualizing large sets of data to turn information into business insights

The role would also involve testing various machine learning models on Big Data, and deploying learned models for ongoing scoring and prediction

An appreciation of the mechanics of complex machine learning algorithms would be a strong advantage

Qualification Experience 8-10 years of demonstrable experience designing technological solutions to complex data problems, developing testing modular, reusable, efficient and scalable code to implement those solutions

Ideally, this would include work on the following technologies

Expert-level proficiency in Scala PySpark knowledge is a strong advantage

Exp in at least one of Java, Scala or Python Preferred

Strong understanding and experience in distributed computing frameworks, particularly Apache Hadoop YARN, MR, HDFS and associated technologies one or more of Hive, Sqoop, Avro, Flume, Oozie, Zookeeper, Impala, etc Hands-on experience with Apache Spark and its components (Streaming, SQL, MLLib) is a strong advantage

Operating knowledge of cloud computing platforms (AWS orAzure or GCP)

Experience working within a Linux computing environment, and use of command line tools including knowledge of Shell/Python scripting for automating common tasks

Ability to work in a team in an agile setting, familiarity with JIRA and clear understanding of how Git works or any version control tools

In addition, the ideal candidate would have great problem-solving skills, and the ability confidence to hack their way out of tight corners

Experience:

Must Have (handson) Experience Scala or Python or PySpark expertise

Distributed computing framewor ks (Hadoop Ecosystem AND Spark components)

Cloud computing platforms AWS

Linux environment, SQL and Shell scripting

Nice to have : DevOps knowledge

Job Responsibilities:

Our Big Data capability team needs hands-on developers who can produce beautiful functional code to solve complex analytics problems

You would be responsible for evaluating, developing, maintaining and testing big data solutions for advanced analytics projects

The role would involve big data pre-processing reporting workflows including collecting, parsing, managing, analyzing and visualizing large sets of data to turn information into business insights

The role would also involve testing various machine learning models on Big Data, and deploying learned models for ongoing scoring and prediction

An appreciation of the mechanics of complex machine learning algorithms would be a strong advantage

Ideally, this would include work on the following technologies

Expert-level proficiency in Scala PySpark knowledge is a strong advantage

Exp in at least one of Java, Scala or Python Preferred

Operating knowledge of cloud computing platforms (AWS orAzure or GCP)

Experience working within a Linux computing environment, and use of command line tools including knowledge of Shell/Python scripting for automating common tasks

Ability to work in a team in an agile setting, familiarity with JIRA and clear understanding of how Git works or any version control tools

In addition, the ideal candidate would have great problem-solving skills, and the ability confidence to hack their way out of tight corners

Experience:

Must Have (handson) Experience Scala or Python or PySpark expertise

Distributed computing framewor ks (Hadoop Ecosystem AND Spark components)

Cloud computing platforms AWS

Linux environment, SQL and Shell scripting

Nice to have : DevOps knowledge

What We Offer

Exciting Projects: We focus on industries like High-Tech, communication, media, healthcare, retail and telecom

Our customer list is full of fantastic global brands and leaders who love what we build for them

Collaborative Environment: You Can expand your skills by collaborating with a diverse team of highly talented people in an open, laidback environment or even abroad in one of our global centers or client facilities!

Work-Life Balance: GlobalLogic prioritizes work-life balance, which is why we offer flexible work schedules, opportunities to work from home, and paid time off and holidays

Professional Development: Our dedicated Learning Development team regularly organizes Communication skills training(GL Vantage, Toast Master),Stress Management program, professional certifications, and technical and soft skill trainings

Excellent Benefits: We provide our employees with competitive salaries, family medical insurance, Group Term Life Insurance, Group Personal Accident Insurance , NPS(National Pension Scheme ), Periodic health awareness program, extended maternity leave, annual performance bonuses, and referral bonuses

Job Classification

Industry: Courier / Logistics
Functional Area / Department: Engineering - Software & QA,
Role Category: Software Development
Role: Data Engineer
Employement Type: Full time

Contact Details:

Company: Hitachi
Location(s): Noida, Gurugram

+ View Contact

Login

Candidates can login here to view contacts and apply.

Sign In Sign Up

Email:

Password:

Password too short

To create your profile, apply for a job or make a registration

Your name (*)

Email (*)

Mobile (*)

Preferred City (* max. 2 w/comma)

Designation / Expected Role

Current / Recent Company (*)

Experience (*)

Expected Salary (*)

Desired Industry (*):

Functional area / Department (*):

Enter Skills (key skills, subjects, technologies & roles to use in search)

Write briefly about yourself, your experience and education (*)

500 characters remaining

Attach Resume Max 2.38 MB (RTF, PDF, DOC, DOCX formats only parsed)

Please, check the file size and type.

Add social media [ + ]

Create password

I agree with website service terms and conditions

Candidates are expected to provide most recent and accurate profile information, inappropriate content is strictly prohibited!

Keyskills: hive oozie flume sqoop apache zookeeper avro

Fraud Alert to job seekers!

₹ Not Disclosed

Job application

We will notify the employer with your details. You can also attach a resume or a cover letter.

Sign In Sign Up

Email:

Password:

Password too short

To create your profile, apply for a job or make a registration

Your name (*)

Email (*)

Mobile (*)

Preferred City (* max. 2 w/comma)

Designation / Expected Role

Current / Recent Company (*)

Experience (*)

Expected Salary (*)

Desired Industry (*):

Functional area / Department (*):

Enter Skills (key skills, subjects, technologies & roles to use in search)

Write briefly about yourself, your experience and education (*)

Attach ResumeMax 2.38 MB (RTF, PDF, DOC, DOCX formats only parsed)

Please, check the file size and type.

Add social media [ + ]

Create password

I agree with website service terms and conditions

Similar positions

Lead Software Engineer

Capgemini

5 - 8 years

Bengaluru

0 seconds

₹ Not Disclosed

Senior Member of Technical Staff

Oracle

8 - 13 years

Kolkata

6 hours ago

₹ Not Disclosed

Data Engineer - Data Platforms-Google

IBM

2 - 5 years

Hyderabad

7 hours ago

₹ Not Disclosed

Senior Platform Engineer - Java, Vue.js / React.js, DevOps

SAP

9 - 11 years

Bengaluru

7 hours ago

₹ Not Disclosed

Hitachi

Hitachi Vantara, a wholly owned subsidiary of Hitachi, Ltd., helps data-driven leaders use the value in their data to innovate intelligently and reach outcomes that matter for business and society - what we call a double bottom line. Only Hitachi Vantara combines 100+ years of experience in operatio...

Senior Data Engineer PySpark, Informatica, @ Hitachi

Home > Software Development