Friday, 29 January 2021

Data Engineer - Python, Pyspark, Teradata

Data Engineer - Python, Pyspark, Teradata
remote 
12+ Years 
Skills: Python, Pyspark, Teradata, Azure

About the Data Engineer role:
We are looking for a savvy Data Engineer to join our growing team of analytics experts. The hire will be responsible for expanding and optimizing our data and data pipeline architecture, as well as optimizing data flow and collection for cross functional teams. The Data Engineer will support our software developers, database architects, data analysts and data scientists on data initiatives and will ensure optimal data delivery architecture is consistent throughout ongoing projects. They must be self-directed and comfortable supporting the data needs of multiple teams, systems, and products. 

 Responsibilities
• Communicate progress across organizations and levels from individual contributor to senior executive. Identify and clarify the critical issues that need action and drive appropriate decisions and actions. Communicate results clearly and in actionable form.
• Lead development and ongoing maintenance and enhancement of applications running on Azure Cloud and business intelligence tools.
• Detailed technical design, conduct analysis, development of applications and proof of concepts
• Develop microservices, application code and configuration to deliver application
• Provide technical leadership for development & BI team to deliver on various initiatives. 
• Lead problem resolution tasks, document approach for support mechanisms
• Ensure all solutions meet Enterprise Guidelines and industry standards/best practices
• Advise IT and business stakeholders of alternative solutions
• Ensure optimal system performance across BI & Analytics platforms.
• Lead the effort to monitor system activity, tune performance and architect solutions to meet future demand.
• Offer technical guidance to team members and lead design/requirements sessions
• Benchmark systems, analyze system bottlenecks and propose solutions to eliminate them;
• Articulate pros and cons of various technologies and platforms and document use cases, solutions and recommendations;
• Troubleshoot complex system issues and handle multiple tasks simultaneously
• Ensure all solutions meet Enterprise Guidelines and industry standards/best practices
 
Experience 
• Bachelor's Degree or master’s degree in Computer Science, Mathematics, Statistics.
• 4+ years of development experience in using Spark to build applications through Python and PySpark
• 3+ years’ hands-on experience developing optimized, complex SQL queries and writing PLSQL code across large volumes of data in both relational and multi-dimensional data sources such as Teradata, Hive, Impala, Oracle, Teradata
• Experience in deploying and developing application using Azure
• Experience working with disparate data-sets in multiple formats like JSON, Avro, text files, Kafka queues, and log data and storage like blob/ADLS GEN2
• 2+ years of strong ETL experience on either Informatica, Ab-Initio, Talend, DataStage, Syncsort
• Enthusiastic to work on disparate datasets in multiple formats like JSON, Avro, text files, Kafka queues, and Knowledge of software design and programming principles.
• Experience working in Scrum Agile framework and using DevOps to deploy and manage code.
• Good communication and team-working skills.

Murthy Chavali
murthy@vedainfo.com
310-589-4458
Company Name | Website

No comments:

Post a Comment