Sayari
Science for decision making.
Principal Data Engineer
Location
United States
Posted
57 days ago
Salary
$200K - $220K / year
Bachelor Degree8 yrs expEnglishAirflowApacheCassandraCloudElastic SearchSpark
Job Description
• Design and implement complex Spark data logic, focusing on performance optimization, data volume tuning, and robust execution.
• Own the architectural design of graph build pipelines, ensuring they are scalable, automated, and highly resilient.
• Plan and oversee the strategic re-architecture of data pipelines to meet evolving business needs and scale.
• Optimize infrastructure-as-code and schema designs to reduce cloud costs and improve pipeline latency.
• Act as a technical consultant for the team, fostering a collaborative and engineer-led approach to design decisions.
• Support the development of the engineering team through code reviews, design docs, and architectural best practices.
• Ensure the accuracy of mission-critical data outputs.
Job Requirements
- 8+ years of experience in the big data space, with a proven track record of implementing large-scale features and leading process redesigns.
- Expert-level mastery of Apache Spark for large-scale data processing.
- Strong experience with orchestration tools (Airflow) and cloud computing environments.
- Hands-on experience architecting and managing data flows into databases such as Elasticsearch, Memgraph, and Cassandra.
- Demonstrated ability in system architecture, including Infrastructure as Code (IaC) and schema design.
- A "builder" mindset with experience evolving and improving existing architectures to meet new scale requirements.
Benefits
- 100% fully paid medical, vision, and dental for employees and their dependents
- Generous time off; we observe all US federal holidays, close our office for a winter break (12/24-12/31), in addition to granting 18 PTO days and 10 sick days
- Outstanding compensation package; competitive commissions for revenue roles and quarterly bonuses for non-revenue positions
- A strong commitment to diversity, equity, and inclusion
- Eligibility to participate in additional benefits such as 401k match up to 5%, 100% paid life insurance (up to $100,000 coverage),, and parental leave
- A collaborative and positive culture - your team will be as smart and driven as you
- Limitless growth and learning opportunities
Related Guides
Related Categories
Related Job Pages
More Data Engineer Jobs
Data Engineer57 days ago
Full TimeRemoteTeam 11-50Since 2020
We’re seeking a skilled Data Engineer to build the next-generation data management and artificial intelligence platform for maritime domain awareness. What you’ll do: Implement real-time data pipelines with MQTT and Redpanda for stream processing. Implement offline data pipelines...
PythonRustPostgreSQLApache IcebergApache ParquetAmazon S3MQTTRedpandaApache KafkaDagsterApache AirflowProtocol Bufferstime-series data processingbinary message parsingOLTPOLAPdistributed systemsstreaming architecturesbatch processing
United States
Data Engineer57 days ago
Full TimeRemoteTeam 201-500H1B No Sponsor
Senior Data Engineer delivering data science services and streaming data pipelines
AirflowAWSCloudKafkaPythonSQLTerraform
Data Engineer57 days ago
Full TimeRemoteTeam 51-200H1B No Sponsor
Senior Data Engineer responsible for designing data infrastructure for Tekmetric
AirflowApacheETLJavaPythonScalaSparkSQLTableau
United States
Data Engineer57 days ago
Full TimeRemoteTeam 51-200H1B No Sponsor
Data Engineer transforming raw external data into powerful datasets at Machinify.
AirflowAWSCloudKafkaPythonSparkSQL
California