Sayari

Science for decision making.

Principal Data Engineer

Data EngineerData EngineerFull TimeRemoteTeam 1-10H1B No SponsorCompany SiteLinkedIn

Location

United States

Posted

57 days ago

Salary

$200K - $220K / year

Bachelor Degree8 yrs expEnglishAirflowApacheCassandraCloudElastic SearchSpark

Job Description

• Design and implement complex Spark data logic, focusing on performance optimization, data volume tuning, and robust execution. • Own the architectural design of graph build pipelines, ensuring they are scalable, automated, and highly resilient. • Plan and oversee the strategic re-architecture of data pipelines to meet evolving business needs and scale. • Optimize infrastructure-as-code and schema designs to reduce cloud costs and improve pipeline latency. • Act as a technical consultant for the team, fostering a collaborative and engineer-led approach to design decisions. • Support the development of the engineering team through code reviews, design docs, and architectural best practices. • Ensure the accuracy of mission-critical data outputs.

Job Requirements

  • 8+ years of experience in the big data space, with a proven track record of implementing large-scale features and leading process redesigns.
  • Expert-level mastery of Apache Spark for large-scale data processing.
  • Strong experience with orchestration tools (Airflow) and cloud computing environments.
  • Hands-on experience architecting and managing data flows into databases such as Elasticsearch, Memgraph, and Cassandra.
  • Demonstrated ability in system architecture, including Infrastructure as Code (IaC) and schema design.
  • A "builder" mindset with experience evolving and improving existing architectures to meet new scale requirements.

Benefits

  • 100% fully paid medical, vision, and dental for employees and their dependents
  • Generous time off; we observe all US federal holidays, close our office for a winter break (12/24-12/31), in addition to granting 18 PTO days and 10 sick days
  • Outstanding compensation package; competitive commissions for revenue roles and quarterly bonuses for non-revenue positions
  • A strong commitment to diversity, equity, and inclusion
  • Eligibility to participate in additional benefits such as 401k match up to 5%, 100% paid life insurance (up to $100,000 coverage),, and parental leave
  • A collaborative and positive culture - your team will be as smart and driven as you
  • Limitless growth and learning opportunities

Related Categories

Related Job Pages

More Data Engineer Jobs

Data Engineer

Spear AI

Artificial Intelligence & Machine Learning for National Security

Data Engineer57 days ago
Full TimeRemoteTeam 11-50Since 2020

We’re seeking a skilled Data Engineer to build the next-generation data management and artificial intelligence platform for maritime domain awareness. What you’ll do: Implement real-time data pipelines with MQTT and Redpanda for stream processing. Implement offline data pipelines...

PythonRustPostgreSQLApache IcebergApache ParquetAmazon S3MQTTRedpandaApache KafkaDagsterApache AirflowProtocol Bufferstime-series data processingbinary message parsingOLTPOLAPdistributed systemsstreaming architecturesbatch processing
United States
Data Engineer57 days ago
Full TimeRemoteTeam 201-500H1B No Sponsor

Senior Data Engineer delivering data science services and streaming data pipelines

AirflowAWSCloudKafkaPythonSQLTerraform
United States
$215K - $300K / year

Senior Data Engineer

Tekmetric

Simplify Your Life. Supercharge Your Shop.

Data Engineer57 days ago
Full TimeRemoteTeam 51-200H1B No Sponsor

Senior Data Engineer responsible for designing data infrastructure for Tekmetric

AirflowApacheETLJavaPythonScalaSparkSQLTableau
United States

Senior Data Engineer – Analytics

Machinify, Inc.

Bending the healthcare cost curve with AI.

Data Engineer57 days ago
Full TimeRemoteTeam 51-200H1B No Sponsor

Data Engineer transforming raw external data into powerful datasets at Machinify.

AirflowAWSCloudKafkaPythonSparkSQL
California