You are viewing a preview of this job. Log in or register to view more details about this job.

Data Engineering Associate

Overview of Dept/Division:

The Data & Analytics team is seeking students in computer science or engineering programs to serve as Data Engineering Associates in our intern program. Our work combines algorithmic database coding and work with colleagues in transportation operations and support functions to create tools that provide deeper insights into our performance. The Data Engineering group designs, builds, tests and delivers end-to-end, automated data pipelines over complex on-premises and off-premises platforms. We extract data from numerous systems and transform structured, semi-structured and unstructured data into consistent, reliable, and usable datasets, which are in turn are used to create reports, dashboards, and other decision-making tools that address the agencys most pressing and persistent challenges.

 

Project Responsibilities:

The key functions of the Data Engineering Associate are as follows:
- Carry out tasks to operationalize data pipelines, data warehouses, data marts, multi-dimensional cubes, and data lakes to collect, structure, and integrate data sources for analysis and consumption
- Write automated test scripts to monitor and report on data quality, validity, accuracy, and usability
- Conduct root cause analyses in response to issues and implement cost effective resolutions for data anomalies.
- Document data content and flows for use by teammates and other users
- Other tasks as assigned to problem-solve and achieve the goals of the team

 

Qualifications:

All candidates must have:
- Strong skills in Python, SQL or R coding and database design
- Experience in documenting data processes and performing data quality checks
- Understanding of analytical methods (e.g., probability and statistics, algorithm design), advanced abilities in Excel, and experience with business intelligence tools (e.g., Power BI)
- Familiarity with KPI metrics and algorithms to calculate them

It is preferable for candidates to have:
- Strong written communication skills
- Familiarity with transit/transportation systems, particularly the MTA system