Job Description

Our data science consulting services are growing rapidly, and we are looking to expand our team of both full-time and part-time contract data engineers. As a Data Engineer at District Data Labs, you would be working either remotely or on-site at a client's location designing, building, and implementing data management systems to support data science and machine learning efforts. You would have the flexibility to determine which assignments you would want to accept. Some assignments may require travel - either regionally, nationally, or internationally.


As a District Data Labs Data Engineer, you would:

  • Investigate, change, and modernize existing data systems, and build new ones, if necessary.

  • Design, build, and maintain data pipelines that transform data into usable formats.

  • Play a key role in the selection of backend database technologies (SQL, NoSQL, etc),

  • Assist with data discovery, collection, ingestion, and wrangling.

  • Write extract, transform, and load (ETL) procedures to automate data collection and reporting processes.

  • Design, develop, and support data warehouses, dashboards, and reporting tools.

  • Identify and resolve issues to ensure the quality and consistency of data.

  • Communicate data processes and insights through visualization and presentations.

  • Collaborate with data scientists, other data engineers, and project stakeholders to ensure data infrastructure meets requirements. 

  • Communicate and coordinate effectively with clients, project managers, data scientists, and other team members.

Skills and Qualifications

The ideal candidate for this position would possess:

  • At least 4 years of industry experience in a data engineering or similar role.

  • Bachelor's Degree or higher in a technical/quantitative discipline such as:

    • Computer Science

    • Engineering

    • Information Systems

    • Statistics

    • Mathematics

    • Finance

    • Hard or Soft Sciences

  • Demonstrated proficiency in several of the following:

    • Python programming

    • Working with different operating systems (Unix/Linux, MacOS, Windows).

    • Processing Big Data using Python/PySpark, R, Hive, SQL, shell scripting, etc.

    • Deploying resources across local, remote and large-scale distributed computing platforms.

    • Creating and extracting data from APIs.

    • Familiarity with different cloud computing environments (Amazon AWS, Google Cloud, etc.).

    • Experience with data pipeline and workflow management tools.

    • Familiarity with BI and data visualization in open source environments like Dash/Plotly or with commercial tools like Qlik, Tableau, etc.

  • Proven ability to think strategically and solve problems creatively.

  • Outstanding verbal and written communication skills.

  • Ability to assume ownership of tasks, manage workload, and meet deadlines.

About District DAta Labs

District Data Labs is a data science consulting and corporate training firm. Through our corporate training and consulting services, we help companies make their operations more data-driven and enhance the analytical capabilities of their workforce. In addition to our commercial activities, we also operate a data science research lab and open source collaborative where people from diverse backgrounds come together to work on interesting projects, push themselves beyond their current capabilities, and help each other become more successful data scientists.

Apply for This Position

Please fill out the application form below and we will get back to you soon. 

Name *