Your browser does not support javascript! Please enable it, otherwise web will not work for you.

Senior Data Engineer PySpark, Informatica, @ Hitachi

Home > Software Development

 Senior Data Engineer PySpark, Informatica,

Job Description

Description:
  • Join GlobalLogic, to be a valid part of the team working on a huge software project for the world-class company providing M2M / IoT 4G/5G modules e g to the automotive, healthcare and logistics industries
  • Through our engagement, we contribute to our customer in developing the end-user modulesfirmware, implementing new features, maintaining compatibility with the newest telecommunication and industry standards, as well as performing analysis and estimations of the customer requirements
  • Requirements:
  • Our Big Data capability team needs hands-on developers who can produce beautiful functional code to solve complex analytics problems
  • If you are an exceptional developer with an aptitude to learn and implement using new technologies, and who loves to push the boundaries to solve complex business problems innovatively, then we would like to talk with you
  • You would be responsible for evaluating, developing, maintaining and testing big data solutions for advanced analytics projects
  • The role would involve big data pre-processing reporting workflows including collecting, parsing, managing, analyzing and visualizing large sets of data to turn information into business insights
  • The role would also involve testing various machine learning models on Big Data, and deploying learned models for ongoing scoring and prediction
  • An appreciation of the mechanics of complex machine learning algorithms would be a strong advantage
  • Qualification Experience 8-10 years of demonstrable experience designing technological solutions to complex data problems, developing testing modular, reusable, efficient and scalable code to implement those solutions
  • Ideally, this would include work on the following technologies
  • Expert-level proficiency in Scala PySpark knowledge is a strong advantage
  • Exp in at least one of Java, Scala or Python Preferred
  • Strong understanding and experience in distributed computing frameworks, particularly Apache Hadoop YARN, MR, HDFS and associated technologies one or more of Hive, Sqoop, Avro, Flume, Oozie, Zookeeper, Impala, etc Hands-on experience with Apache Spark and its components (Streaming, SQL, MLLib) is a strong advantage
  • Operating knowledge of cloud computing platforms (AWS orAzure or GCP)
  • Experience working within a Linux computing environment, and use of command line tools including knowledge of Shell/Python scripting for automating common tasks
  • Ability to work in a team in an agile setting, familiarity with JIRA and clear understanding of how Git works or any version control tools
  • In addition, the ideal candidate would have great problem-solving skills, and the ability confidence to hack their way out of tight corners
  • Experience:
  • Must Have (handson) Experience Scala or Python or PySpark expertise
  • Distributed computing framewor ks (Hadoop Ecosystem AND Spark components)
  • Cloud computing platforms AWS
  • Linux environment, SQL and Shell scripting
  • Nice to have : DevOps knowledge
  • Job Responsibilities:
  • Our Big Data capability team needs hands-on developers who can produce beautiful functional code to solve complex analytics problems
  • If you are an exceptional developer with an aptitude to learn and implement using new technologies, and who loves to push the boundaries to solve complex business problems innovatively, then we would like to talk with you
  • You would be responsible for evaluating, developing, maintaining and testing big data solutions for advanced analytics projects
  • The role would involve big data pre-processing reporting workflows including collecting, parsing, managing, analyzing and visualizing large sets of data to turn information into business insights
  • The role would also involve testing various machine learning models on Big Data, and deploying learned models for ongoing scoring and prediction
  • An appreciation of the mechanics of complex machine learning algorithms would be a strong advantage
  • Qualification Experience 8-10 years of demonstrable experience designing technological solutions to complex data problems, developing testing modular, reusable, efficient and scalable code to implement those solutions
  • Ideally, this would include work on the following technologies
  • Expert-level proficiency in Scala PySpark knowledge is a strong advantage
  • Exp in at least one of Java, Scala or Python Preferred
  • Strong understanding and experience in distributed computing frameworks, particularly Apache Hadoop YARN, MR, HDFS and associated technologies one or more of Hive, Sqoop, Avro, Flume, Oozie, Zookeeper, Impala, etc Hands-on experience with Apache Spark and its components (Streaming, SQL, MLLib) is a strong advantage
  • Operating knowledge of cloud computing platforms (AWS orAzure or GCP)
  • Experience working within a Linux computing environment, and use of command line tools including knowledge of Shell/Python scripting for automating common tasks
  • Ability to work in a team in an agile setting, familiarity with JIRA and clear understanding of how Git works or any version control tools
  • In addition, the ideal candidate would have great problem-solving skills, and the ability confidence to hack their way out of tight corners
  • Experience:
  • Must Have (handson) Experience Scala or Python or PySpark expertise
  • Distributed computing framewor ks (Hadoop Ecosystem AND Spark components)
  • Cloud computing platforms AWS
  • Linux environment, SQL and Shell scripting
  • Nice to have : DevOps knowledge
  • What We Offer
  • Exciting Projects: We focus on industries like High-Tech, communication, media, healthcare, retail and telecom
  • Our customer list is full of fantastic global brands and leaders who love what we build for them
  • Collaborative Environment: You Can expand your skills by collaborating with a diverse team of highly talented people in an open, laidback environment or even abroad in one of our global centers or client facilities!
  • Work-Life Balance: GlobalLogic prioritizes work-life balance, which is why we offer flexible work schedules, opportunities to work from home, and paid time off and holidays
  • Professional Development: Our dedicated Learning Development team regularly organizes Communication skills training(GL Vantage, Toast Master),Stress Management program, professional certifications, and technical and soft skill trainings
  • Excellent Benefits: We provide our employees with competitive salaries, family medical insurance, Group Term Life Insurance, Group Personal Accident Insurance , NPS(National Pension Scheme ), Periodic health awareness program, extended maternity leave, annual performance bonuses, and referral bonuses
  • Job Classification

    Industry: Courier / Logistics
    Functional Area / Department: Engineering - Software & QA,
    Role Category: Software Development
    Role: Data Engineer
    Employement Type: Full time

    Contact Details:

    Company: Hitachi
    Location(s): Noida, Gurugram

    + View Contactajax loader


    Keyskills:   hive oozie flume sqoop apache zookeeper avro

     Fraud Alert to job seekers!

    ₹ Not Disclosed

    Similar positions

    Lead Software Engineer

    • Capgemini
    • 5 - 8 years
    • Bengaluru
    • 0 seconds
    ₹ Not Disclosed

    Senior Member of Technical Staff

    • Oracle
    • 8 - 13 years
    • Kolkata
    • 6 hours ago
    ₹ Not Disclosed

    Data Engineer - Data Platforms-Google

    • IBM
    • 2 - 5 years
    • Hyderabad
    • 7 hours ago
    ₹ Not Disclosed

    Senior Platform Engineer - Java, Vue.js / React.js, DevOps

    • SAP
    • 9 - 11 years
    • Bengaluru
    • 7 hours ago
    ₹ Not Disclosed

    Hitachi

    Hitachi Vantara, a wholly owned subsidiary of Hitachi, Ltd., helps data-driven leaders use the value in their data to innovate intelligently and reach outcomes that matter for business and society - what we call a double bottom line. Only Hitachi Vantara combines 100+ years of experience in operatio...