Lead Data Engineer Job at WorkHQ, Los Angeles, CA

WlR5SWtic3BoZHpzZlR2Z0FudEtCU1hSZ3c9PQ==
  • WorkHQ
  • Los Angeles, CA

Job Description

Company Context

Series A, well-funded US startup in HRTech developing WorkHQ.com and an AI Recruiter product.

This is a US-only, Remote role (Mainland).

Role Overview

Lead data infrastructure architect managing billions of data points across 250M+ professional profiles.

Hire data engineers to aid you in that journey.

Core Responsibilities

  • Design scalable data pipelines processing massive record volumes

  • Architect ETL processes using PySpark on Amazon EMR (Open to shifting to other solutions like Data Bricks / Snowflake)

  • Distribute enriched data through medallion architecture across Postgres, Athena, OpenSearch

  • Integrate new data sources into the main pipeline

  • Implement advanced data matching using Splink

Technical Requirements

  • 5-8 years professional data engineering experience

  • Good proficiency in:

    • PySpark and distributed computing

    • AWS data services (EMR, Glue, Athena)

    • Docker

    • Pandas and DataFrame manipulation

    • Complex data format handling (JSONL, Parquet)

  • Strong background in:

    • Big data processing architectures

    • Data warehouse design

    • Performance optimization

  • Advanced Python, SQL skills

Nice to Have

  • Probabilistic record linking expertise

  • OpenSearch/elasticsearch technologies

  • Machine learning data pipeline design

  • Recruitment tech ecosystem knowledge

Technical Stack

  • Big Data: PySpark, EMR

  • Databases: Postgres, OpenSearch

  • Cloud: AWS

  • Containerization: Docker

  • Data Formats: JSONL, Parquet

  • Analytics: Metabase, Athena, Glue

  • Data Processing: Pandas, Splink

Other Considerations

While this role has specific requirements - if you lack a few technical skills, but motivated to learn and lead the platform, please apply for consideration.

If you are coming from Director/Head of/VP levels that is relevant to this job, you can apply as well.

You will need to apply directly on our platform.

Thank you for your time.

Job Tags

Permanent employment, Remote work, Shift work,

Similar Jobs

Two95 International Inc.

Ideation Lead - Data Mining Job at Two95 International Inc.

 ...Requirements ~5+ years experience in the health care payer industry (Medicare, Medicaid, and/or Commercial)~2+ years experience in Data Mining ideation and research of new concept development ~ Maintains working knowledge of CMS transmittals, RAC and OIG reports ~... 

Mc3 Partners

Cyber Security Engineer Job at Mc3 Partners

 ...Job Description: Mc3 Partners is seeking a Cyber Security Engineer to support a federal program focused on securing enterprise systems, networks, and cloud environments. This is a contingent role , pending contract award. Candidates with active TS/SCI clearances... 

Allied Momentum Trucking

Class A Truck Driver Job Job at Allied Momentum Trucking

Class A Truck Driver JobWe are a small family owned trucking business based in Colorado. We are looking for someone to drive our VOLVO I-SHIFT w/ 53 FT TRAILER. We carry all goods for many companies all OTR. We do not carry Hazmat loads, and we are not tanker endorsed... 

Sparkbit 360

Communications Agent - Entry Level Job at Sparkbit 360

 ...deserves to be seen. As a full-service marketing and public relations agency, were dedicated...  ...in-person interactions not through social media or digital channels. Job Overview...  ...Agent to join our Charlotte team. This entry-level position focuses on direct client and customer... 

Oceaneering International

Textile Chemical Engineer - Space Systems Job at Oceaneering International

 ...Company Profile Oceaneering Space Systems (OSS) develops, integrates, and applies new and innovative technologies to meet the challenges of working in space and other harsh environments. We are ideally positioned to meet the growing needs of NASA and the expanding commercial...