Day to day responsibility:
As a Data Engineer , you will be responsible for the below activities:
Work closely with the development teams to develop and maintain scalable data platform architecture and scalable data pipelines across our working environment
Utilize the latest technologies to assure ease of data integrations and reusability across the various teams
Display exceptional skills in both data science and big data engineering
Skills and Experience
Demonstrates thorough abilities and/or a proven record of success as a team leader including:
Ideally 6 to 9 years of relevant experience
Bachelor s Degree in Computer Science, Engineering or other technical discipline (BE, BTech, MCA)
Develops and maintains scalable data pipelines and builds out new API integrations to support continuing increases in data volume and complexity
Collaborates with analytics and business teams to improve data models that feed business intelligence tools, increasing data accessibility and fostering data-driven decision making across the organization
Implements processes and systems to monitor data quality, ensuring production data is always accurate and available for key stakeholders and business processes that depend on it
Designs data integrations and data quality framework
Proven expertise and extensive knowledge in Python programming language
Designing and implementing microservice based applications using Python
Experience w/ PostgreSQL preferred
Experience w/ tools such as NodeJS React highly preferred
Computer vision experience is a plus
Big data machine learning toolkits such as SparkML, messaging systems (Kafka) and NoSQL databases (Cassandra, HBase, MongoDB)
Cloud computing platforms such as AWS, GCP and Azure
Building data lakes and performs data analysis required to troubleshoot data related issues and assist in the resolution of data issues
Handling large scale structured and unstructured data from internal and third-party sources.
Architecting highly scalable distributed data pipelines using open source tools and big data technologies such as Hadoop, Pig, Hive, Presto, Spark, Drill, Sqoop and ETL frameworks
Utilizing Linux shell scripting and containerization technologies (Docker, Kubernetes)
Other Skills
Documenting systems, refining requirements, self-identify solutions and communicate to the team
Demonstrating a desire to keep learning, maintain own skill set, stay up to date and expand one s knowledge across the full stack
Demonstrating a desire to improve the status quo , especially automating and improving software development and operations processes to achieve massively higher delivery velocity and operations quality
Contributing to thought leadership through participation in the development of technology processes
Applying continuous independent judgement while collaborating with others, and influencing others within the project and domain teams
Building and leveraging relationships as well as specialist level verbal and written communication skills.
Preferred Certifications (at least two certifications are preferred):
CCP Data Engineer Exam (DE575)
Certified Professional in Python Programming Level 1 or 2
Keyskills: Cloud computing Data analysis Linux Manager Technology Programming Data quality Process quality big data Open source Python