Data Engineer

    • Job Tracking ID: 512891-608605
    • Job Location: Dallas, TX
    • Date Updated: May 21, 2018
    • Starting Date: ASAP
    • Other Job Locations: Dallas TX
    • Job Level: Junior,Mid-level
    • Job Type: Full-Time
Invite a friend
facebook LinkedIn Twitter Email


Job Description:

Position Overview

Are you a data technologist who enjoys the challenge of solving complex problems that matter? Did you push the envelope and learn AWS on your own time, all while knowing deep down that there was a better way to handle ETL? Does the thought of data streams get you out of bed in the morning? If so, then read on!

What’s in it for you?

Taking traditional ETL and turning it on its head. Working on the latest and coolest stuff - PostgreSQL, Apache Spark, Redshift, AWS S3, all while making a difference in people’s healthy lives. Working in a place where you are not just another cog in the machine - where your talent and intellect isn’t just put to use re-executing someone else’s old ideas, but where you’re challenged to propose bold new ways forward that create the future that we’ll be living in tomorrow. You’ll be surrounded by others like yourself who can carry their weight and are just as competent/capable as you are. And when all is said and done, maybe even have a little fun along the way.

What’s expected of you?

Designing robust and forward-thinking data flows that will empower the business to succeed for many years will be your priority. You should be able to slice and dice data in your sleep, all while making it useful to multiple facets of the company. Your approach to solutions should be scalable, performant, and maintainable while continuing to push the envelope on new technologies. We want you to bring a diverse skill set to the table and know how to apply it to solve some tough business and technical problems. Our expectation will be a sense of ownership in both your daily and project work, and to take pride in the awesome work that you’ll be doing.

Responsibilities

  • Design and develop code/processes to load and transform source data from various formats (like relational, unstructured, semi-structured) into a form suitable for ingestion.
  • Provide statistical information/analysis around these processes such as data volumes, hit/miss match percentages, and other related metrics.
  • Work with business users during data requirements phase for client integration.
  • Assist in migrating older ETL processes to newer technologies.

Experience and Skills:

Qualifications

  • Bachelor's degree in Computer Science, Information Systems, Mathematics, Business Administration, or a related field.
  • 4+ years of experience with business intelligence concepts working with ETL products (e.g., Informatica, Talend, SSIS, etc.).
  • 2+ years of experience with Apache Spark.
  • Familiarity with Scala.
  • Expert working knowledge and experience with extract, transform, load (ETL)/data integration and development of data strategies.
  • Firm understanding of Amazon Simple Storage Service (S3).
  • Strong background in developing complex SQL queries.
  • Expertise in performance-tuning in both SQL and BI tools.
  • Familiar with the workings of networks, servers, AWS infrastructure & cloud services.
  • High level of intellectual and technological curiosity. Learning new technologies, patterns, and methods should be exciting.
  • Excellent problem solving/analytical skills with high level of accuracy.
  • Must be able to develop efficient and effective program and system solutions in solving highly complex business problems.
  • Health industry experience, including coding systems, a big plus.
  • Familiarity with HIPAA regulations.
  • Experience working with streaming data ingestion services like Amazon Kinesis, Firehose, Kafka, Flume, etc.
  • Understanding of data warehousing concepts and design, including 3NF and star schemas.
  • Familiarity with Amazon Redshift.
  • Experience with BI reporting tools such as Tableau, Alteryx, SSRS.

Optional/Preferred Qualifications

  • Health industry experience, including coding systems, a big plus.
  • Familiarity with HIPAA regulations.
  • Experience working with streaming data ingestion services like Amazon Kinesis, Firehose, Kafka, Flume, etc.
  • Understanding of data warehousing concepts and design, including 3NF and star schemas.
  • Familiarity with Amazon Redshift.
  • Experience with BI reporting tools such as Tableau, Alteryx, SSRS.

Benefits

HealthMine offers competitive base salaries, full benefits (medical, dental, and vision), paid short-term and long-term disability benefits, a generous matching 401(k) plan, flexible work hours, and EQUITY in the company.