Data Engineer @Auerbach Grayson
Software Development
Salary unspecified
Remote Location
πŸ‡ΊπŸ‡Έ USA Only
Job Type full-time
Posted 2d ago

[Hiring] Data Engineer @Auerbach Grayson

2d ago - Auerbach Grayson is hiring a remote Data Engineer. πŸ’Έ Salary: unspecified πŸ“Location: USA

Role Description

We're looking for a Data Engineer with a strong foundation in data pipelines and a meaningful edge in AI-native data infrastructure, specifically RAG pipelines, vector search, embedding workflows, and semantic retrieval systems. You'll work on two interconnected problem sets:

  • Consolidating eight legacy systems into a unified, reliable data platform: ETL pipelines, a data warehouse, and cross-system client identity resolution.
  • Transforming three decades of institutional research into an intelligent, searchable, interactable knowledge layer that clients can query in ways that weren't possible two years ago.

This is a small, senior team. You'll work directly with the CTO, have real architectural ownership, and build systems that are in production.

Qualifications

  • Strong foundation in data pipelines.
  • Experience with AI-native data infrastructure.
  • Familiarity with RAG pipelines, vector search, embedding workflows, and semantic retrieval systems.

Requirements

  • Lead the data engineering work for our research portal migration β€” extracting, transforming, and loading data from legacy systems into modern cloud infrastructure.
  • Build and maintain ETL/ELT pipelines across multiple integration points: CRM, research distribution platforms, trading systems, and third-party APIs.
  • Design and implement our β€œGolden Record” initiative β€” cross-system client identity resolution across eight legacy databases with no unified identifiers.
  • Implement event-driven data flows using AWS EventBridge as the central routing layer, treating each source system as a swappable adapter.
  • Design and build production-grade RAG (Retrieval-Augmented Generation) pipelines over AGCO's research archive β€” ingestion, chunking strategy, embedding generation, vector storage, and retrieval.
  • Implement hybrid search approaches that combine semantic (vector) search with keyword and metadata filtering, appropriate for structured financial research queries.
  • Build and maintain embedding pipelines that keep the vector store current as new research is published, with full observability and freshness guarantees.
  • Evaluate and implement emerging retrieval strategies as the space evolves: Re-ranking with cross-encoders; Hypothetical Document Embeddings (HyDE); Query expansion and decomposition; Graph-based retrieval (e.g., GraphRAG) for analyst relationship mapping; Structured metadata retrieval for faceted financial queries; Wire retrieval layers into LLM interfaces for research summarization, analyst Q&A, and recommendation-change tracking across the archive.
  • Apply DataOps practices across all pipelines: version control, CI/CD, environment parity across dev/staging/production, and infrastructure as code.
  • Monitor pipeline health, embedding freshness, retrieval quality, and LLM call latency β€” build alerting that catches problems before users do.
  • Work within our AWS environment (App Runner, EventBridge, CDK) and contribute to IaC best practices.
  • Partner with the CTO, product team, and application developers to translate business requirements into sound data and retrieval architecture decisions.
  • Document data flows, schema designs, chunking strategies, and retrieval logic so systems are maintainable and not a black box.
  • Contribute to evaluation frameworks for retrieval quality β€” precision, recall, answer faithfulness β€” so we know when the system is actually working.

Company Description

Before You Apply
️
πŸ‡ΊπŸ‡Έ Be aware of the location restriction for this remote position: USA Only
β€Ό Beware of scams! When applying for jobs, you should NEVER have to pay anything. Learn more.
Data Engineer @Auerbach Grayson
Software Development
Salary unspecified
Remote Location
πŸ‡ΊπŸ‡Έ USA Only
Job Type full-time
Posted 2d ago
Apply for this position
Did not apply βœ“
Applied βœ“
Sent Follow-Up βœ“
Interview Scheduled βœ“
Interview Completed βœ“
Offer Accepted βœ“
Offer Declined βœ“
Unlock 152,720 Remote Jobs
️
πŸ‡ΊπŸ‡Έ Be aware of the location restriction for this remote position: USA Only
β€Ό Beware of scams! When applying for jobs, you should NEVER have to pay anything. Learn more.
Apply for this position
Did not apply βœ“
Applied βœ“
Sent Follow-Up βœ“
Interview Scheduled βœ“
Interview Completed βœ“
Offer Accepted βœ“
Offer Declined βœ“
Unlock 152,720 Remote Jobs
Γ—

Apply to the best remote jobs
before everyone else

Access 152,720+ vetted remote jobs and get daily alerts.

4.9 β˜…β˜…β˜…β˜…β˜… from 500+ reviews
Unlock All Jobs Now

Maybe later