Job Details

ID #40723761
State Texas
City Houston
Job type Permanent
Salary USD TBD TBD
Source Modis
Showed 2022-05-12
Date 2022-05-11
Deadline 2022-07-10
Category Architect/engineer/CAD
Create resume

Data Pipeline Engineer

Texas, Houston, 77030 Houston USA

Vacancy expired!

A

Data Pipeline Engineer job is available through Modis. This is a Hybrid position and requires the candidate be local to the Houston area. Key requirements for this position include delivering data pipeline solutions, experience working with large data sets using SQL, Python or R languages and working with Jenkins X for containerization and orchestration.

This position is not available for candidates that require sponsorship

This position is not available to C2C opportunities. Please be respectful of this notation.

Job Summary/Description Project goal: Assist in the initial design and build of the foundational data pipeline infrastructure for the Texas All Payor Claims Database (APCD) project.

Job Duties/Responsibilities/Functions (including but not limited to) Project Background: The Center for Health Care Data (CHCD), pursuant to H.B. 2090 passed by the Texas Legislature in 2021, is in the process of designing and building the foundational components of the Texas All Payor Claims Database. The project involves the ongoing collection of administrative claims and other health-related data from certain payors operating in the state of Texas. More details about the Texas APCD and its goals can be found at https://sph.uth.edu/research/centers/chcd/apcd/.

Specific deliverables:
  • Provide expert advice on orchestration technology selection, given the broader context of the Texas All Payors Claims Database (APCD).
  • Install/configure prototype environments as needed to support technology selection.
  • Install/configure development and production environments after technology selection.
  • Develop guidelines for data pipeline operations, including code build/deploy and monitoring/troubleshooting/recovery.
  • EDUCATION: Bachelor's Degree in Statistics, Mathematics, Data Science, Computer Science, Engineering, or a related field preferred; or an equivalent combination of experience and education.

    EXPERIENCE:
  • 3 years of experience installing, configuring, and deploying data pipeline solutions using tools like Airflow, Prefect, and Dagster
  • 5 years of experience working with large data sets using SQL, Python, or R, along with Python packages like Pandas, Dask, and PySpark.
  • 3 years of experience with code hosting platforms for version control and collaboration such as GitHub.
  • 3 years of experience with CI/CD tools, including Jenkins X
  • 3 years of experience with container tools such as Docker, Docker Compose, Kubernetes, and associated tools.
  • Strong analytic and design skills
  • Strong verbal and written communication skills
  • Ability to collaborate in a cross-functional teams
  • If you are interested in this

    Data Pipeline Engineer job please click APPLY. For other opportunities available at Modis go to www.modis.com. If you have questions about the position or would like more information, please contact David Baio by email at Equal Opportunity Employer Minorities/Women/Veterans/Disabled To read our Candidate Privacy Information Statement, which explains how we will use your information, please visit https://www.modis.com/en-candidate-privacy/

    Equal Opportunity Employer/Veterans/Disabled

    To read our Candidate Privacy Information Statement, which explains how we will use your information, please navigate to https://www.modis.com/en-candidate-privacy

    The Company will consider qualified applicants with arrest and conviction records

    Vacancy expired!

    Subscribe Report job