Your browser does not support javascript! Please enable it, otherwise web will not work for you.

Java + Big Data Tech Lead @ Hitachi

Home > Programming & Design

 Java + Big Data Tech Lead

Job Description

 
  • Basics of Distributed computing
  • MapReduce
  • Distributed computing vs RDBMS/ scale up vs scale out
  • Hands on experience in any one of the programming languages (Java, Python, Scala)
  • Understanding of Linux and Bash scripting
  • Knowledge of SQL
  • Basics of Hadoop framework, problem patterns that can be solved like filtering, aggregation, joins etc
  • Understanding of Spark concepts like RDD, Dataframes, Clousures etc., has implemented at least one project using Spark and Scala
  • Should have worked on at least 1-2 bigdata projects (Could be ingestion, ETL processing) on the Cloudera Platform
  • Understanding of Hive/Pig, concepts like partitioning, bucketing, metastore, schema on read vs schema on write, SerDe
  • Solid programming fundaments /design concepts.
  • In depth understanding of different batch and stream processing technologies and NoSQL storage
  • Demonstrated work experience as an Sr.Developer/ Jr. Architect role in Bigdata/Cloud and opensource technology stack.
  • Should be able to articulate, suggest right use of technology stack for different use cases with reasoning.
  • Understanding of Lambda, Kappa architecture
  • Should have participated or able to suggest right hardware choices, platform components, distributions etc.
  • Programming concepts
  • Object oriented vs Functional programming concepts
  • Design patterns (Singleton, Immutable, Factory)
  • MapReduce Programming like Combiner, Partitioiner, InputFormat/OutputFormat, Serialization
  • Distributed Computing
    1. Scale up vs Scale out
  • Scale up vs Scale out
  • Scala hands on, SparkSQL, dataframes etc.
  • Understanding of different storage formats Avro, RCFile, ORC, Parquet
  • Has worked/working on any one of the cloud platform AWS, Azure, GCP
  • Has worked/working on any one of the bigdata platforms like Hortonworks, Cloudera, Datastacks, Databricks
  • Aware of latest technology trends in streaming, real-time, batch processing frameworks (Storm, Apache Beam, Flink, Spark, Kafka Connect etc)
  • Certified in any of the bigdata distribution (Hortonworks / Cloduera / Databricks / Datastacks)

Job Classification

Industry: Consumer Electronics & Appliances
Functional Area: IT Software - Application Programming, Maintenance,
Role Category: Programming & Design
Role: Programming & Design
Employement Type: Full time

Education

Under Graduation: Any Graduate
Post Graduation: Post Graduation Not Required
Doctorate: Doctorate Not Required

Contact Details:

Company: Hitachi
Location(s): Bengaluru

+ View Contactajax loader


Keyskills:   Hive Design Patterns Scala Hadoop Big Data Spark Python

 Job seems aged, it may have been expired!
 Fraud Alert to job seekers!

₹ Not Disclosed

Hitachi

Hitachi Vantara, a wholly owned subsidiary of Hitachi, Ltd., helps data-driven leaders use the value in their data to innovate intelligently and reach outcomes that matter for business and society - what we call a double bottom line. Only Hitachi Vantara combines 100+ years of experience in operatio...