Database Administrator - Data Warehouse (DW), 3+ years
REMOTE
Desired Start Date: 4/23/2024
Duration: 3 Months
Work hours: 40
General Information
Job Description: *PLEASE NOTE THIS POSITION WILL ALLOW CONSULTANT TO WORK REMOTELY. (VERY HIGH EXPECTATION THAT CONSULTANT WILL BE PROFESSIONAL, RELIABLE, REACHABLE AND ABLE TO WORK PRODUCTIVELY WHILE REMOTE). WILL COME ONSITE ONLY AS NEEDED BY THEIR MANAGER/TEAM (AT THEIR OWN EXPENSE).
Background
MTA Data & Analytics is building a data lake and associated pipeline infrastructure to replace a custom, multi-pipeline system. The new system will support expanded and improved (1) datasets and tools for agency operations management, (2) performance evaluation systems for agency management and oversight reporting, and (3) open data platforms for public access to datasets that support oversight group analysis and application development. The data that is or will be included in the system is generated and/or used by virtually all parts of the agency. Types of data range from ridership to on time performance to employee workhours and overtime use to administrative functions.
Some data sets add a million or more records each day and current data infrastructure does not have the capacity to support user needs and future growth in the range and volume of the datasets. New technologies and techniques will expand functionality and enhance the timeliness and responsiveness of tools.
Aim
Our plan requires skills in data engineering, coding in Python and other languages, and report/dashboard development in PowerBI and other data visualization tools. Specific tasks include:
Designing data structures and writing code to collect, combine and transform datasets to meet business needs.
Developing data lake architecture to automate data extraction and transformation of raw data to more complex and calculation-based tables.
Documenting work in a thorough manner consistent with team standards so that it can be easily understood by teammates and future users.
Designing and carrying out testing processes and quality controls on output data for validity, accuracy and usability by the desired audience.
Generating data visualization outputs
Requirements
Skills and experience programming in Python and SQL – 3+ years
Skills and experience in using data lake tools and demonstrated ability to learn new tools quickly
Skills and experience using PowerBI
Ability to clearly document all work (commented code, readme files, diagrams, etc.) so that work is easily transferred back to internal employees
Excellent attention to detail and QC skills to ensure errors are found and corrected before outputs are made available
Good verbal and written communication abilities for internal collaboration
Adhoc SQL 1 - 2 Years
Power BI 1 - 2 Years
Python Scripting 4 - 6 Years
Seniority level
Mid-Senior level
Employment type
Contract
Job function
Information Technology
Industries
Software Development
Referrals increase your chances of interviewing at Steneral Consulting by 2x