Data Engineer – Remote

  • Location: Philadelphia, Pennsylvania
  • Type: Direct Hire
  • Job #158035
This is a full-time direct hire position. Sorry no sponsorship offered.

What you will do
• Work with internal stakeholders to load data into client's data warehouse
• Troubleshoot and resolve issues relating to data integrity
• Help establish procedures and best practices for transforming and storing dataLead requirements gathering around data pipeline automation improvements
• Work with some of the most exciting open-source tools like Spark, Hadoop, Docker, Airflow, Zeppelin
• Leverage distributed computing and serverless architecture such as AWS EMR & AWS Lambda, to develop pipelines for transforming data
• Enjoy the peace that comes with working in a mature software development environment
• Marvel at the speed with which your creation makes it into production
• Research and implement new technologies with a team of developers to execute strategies and implement solutions
• Produce peer reviewed quality software
• Solve complex problems related to the real-time discovery of large data

About you
• Experienced in writing scalable applications on distributed architectures
• Data driven, testing and measuring as much as you can
• Eager to both review peer code and have your code reviewed
• Comfortable on the command line and consider it an essential tool
• Confident in SQL, you know it, write smart queries, it’s no big deal

Required skills and experience
• 5+ years of work experience
• 3+ years of experience with Python and Scala
• 3+ years of experience with PySpark and Spark-SQL (writing, testing, debugging spark routines)
• 1+ years of experience with AWS EMR, AWS S3 service.
• Comfortable using AWS CLI and boto3
• Comfortable working in remote environments Comfortable using *nix command line (shell scripting, AWK, SED)
• Experience with MySQL and Postgres

Bonus experience
• Experience with Apache Airflow
• Experience with Apache Zeppelin
• Experience with healthcare data

 

Attach a resume file. Accepted file types are DOC, DOCX, PDF, HTML, and TXT.

We are uploading your application. It may take a few moments to read your resume. Please wait!

Back to Top