Back to Remote jobs  >   AI / ML
Principal AI/ML Architect @Caylent
AI / ML
Salary cad 180,000 - 2..
Remote Location
Job Type full-time
Posted 2d ago

[Hiring] Principal AI/ML Architect @Caylent

2d ago - Caylent is hiring a remote Principal AI/ML Architect. πŸ’Έ Salary: cad 180,000 - 202,000 per year πŸ“Location: Worldwide

Role Description

This is a senior technical client leadership role that blends deep hands-on ML expertise with strategic advisory and consulting skills. You will be the most experienced ML voice across a diverse and expanding book of customer engagements β€” from early-stage companies bringing ambitious ML ideas to market, to established enterprises modernizing how they build and operate AI systems on AWS.

You will shape strategy, influence architecture, and leave every team you touch better than you found it. You bring the scientific depth to design and evaluate models rigorously, the engineering depth to architect production ML systems at scale, and the consulting instincts to translate both into business value for customers.

If you have led the hard conversations, shaped the architecture decisions that mattered, and built the things others benchmark against β€” and you are looking to do that across a growing portfolio of varied and interesting customers β€” this is the role for you.

What You'll Do

  • Lead end-to-end ML assessments across infrastructure, data pipelines, model lifecycle, and organizational readiness β€” producing recommendations that drive executive decision-making and earn Caylent the next engagement.
  • Partner with sales and solutions teams through the proposal and scoping phase, contributing the technical depth needed to shape well-grounded statements of work.
  • Serve as the senior technical authority on client engagements β€” possibly across multiple projects simultaneously β€” providing architectural guidance, ensuring technical quality from your project team members, and getting hands-on when the engagement demands it, without owning day-to-day implementation responsibilities.
  • Own or orchestrate high-quality POCs that give customers confidence before committing to a larger initiative.
  • Advise customers on ML operations standards and architecture β€” covering MLOps pipeline design, model lifecycle management, LLMOps patterns, and production monitoring frameworks β€” translating operational complexity into decisions and guardrails their teams can own and sustain.
  • Shape how Caylent wins its most technically complex opportunities β€” contributing the architectural thinking and credibility that turns prospects into customers.
  • Strengthen the ML practice from the inside β€” through peer guidance, technical interviews, and contributions to accelerators, reference architectures, and thought leadership content.

Qualifications

  • 10+ years in machine learning or AI, with a proven track record of leading client-facing engagements in a consulting or advisory capacity.
  • Deep, current knowledge of the AWS ML and GenAI ecosystem, with the ability to make and defend architectural decisions across the full ML lifecycle β€” from data and feature engineering through training, deployment, and monitoring.
  • Deep expertise in at least two or three ML domains β€” whether traditional ML, computer vision, NLP, time series, or others β€” combined with the judgment to assess, architect, and advise across the broader ML landscape.
  • Proven ability to architect and govern production ML systems end-to-end, translating MLOps, LLMOps, and broader AI operations complexity into standards and decisions that engineering teams can execute and executives can act on.
  • Deep expertise across foundation model adaptation β€” fine-tuning (LoRA, QLoRA, PEFT), alignment (RLHF, DPO), inference optimization (quantization, vLLM), and distributed training (DeepSpeed, FSDP) β€” combined with RAG and agentic system design, including multi-agent architectures, event-driven workflows, MCP integration, and human-in-the-loop patterns on AWS.
  • Proven ability to operate independently in complex customer environments β€” navigating ambiguity, aligning stakeholders, and translating ML tradeoffs into business risk and value for both technical and executive audiences.

Requirements

  • AWS Certified Machine Learning – Specialty and/or AWS Certified Solutions Architect – Professional.
  • Experience shaping practice-level standards, reference architectures, and reusable ML accelerators across multiple engagements.
  • Exposure to varied industries and problem types in a consulting or client-facing context.
  • Deep fluency in responsible AI practices β€” model evaluation, bias detection, fairness frameworks, and AI governance β€” applied in enterprise deployments.
  • Hands-on experience designing and deploying SRE agents and AI-driven operations workflows in production β€” spanning automated incident detection, triage, and remediation β€” with the ability to integrate across observability platforms and translate AI operations outcomes into measurable business value.

Technical Stack

  • ML Domains: Classical ML, Computer Vision, NLP, Generative AI & LLMs, AI Agents & Autonomous Systems, Intelligent Document Processing, Video Understanding, Speech & Audio, Time Series & Forecasting, Recommender Systems, Graph ML, Reinforcement Learning, Multimodal AI
  • AWS ML Platform: SageMaker, SageMaker Pipelines, SageMaker Feature Store, SageMaker Model Registry, SageMaker Clarify, Bedrock (Agents, Knowledge Bases, Guardrails, AgentCore, Model Evaluation)
  • Multi-provider LLM: Bedrock, Anthropic API, OpenAI API, Google Gemini API, Azure OpenAI β€” with the judgment to reason across provider tradeoffs in enterprise contexts
  • AWS AI Services: Rekognition, Comprehend, Transcribe, Textract, Translate, Personalize, Neptune, Kinesis Video Streams, Polly
  • Data Platform: Apache Spark / PySpark, Apache Kafka, Amazon Kinesis, Apache Iceberg, Delta Lake, Apache Hudi, AWS Glue
  • Vector Databases: Pinecone, pgvector, Amazon OpenSearch (vector), Weaviate
  • Frameworks: PyTorch, TensorFlow, JAX, Scikit-learn, XGBoost, HuggingFace (Transformers, PEFT, TRL), LangChain, LlamaIndex, DSPy, Ollama
  • MLOps & Governance: MLflow, W&B, Airflow / MWAA (data orchestration), Dagster (asset-based pipelines), Kubeflow Pipelines, CI/CD, IaC (CloudFormation, CDK, Terraform), Docker, Kubernetes, ML Governance (lineage, data contracts, audit), Responsible AI / Bias & Fairness
  • LLM Evaluation & Safety: RAGAS, LLM-as-judge patterns, DeepEval, NeMo Guardrails, Constitutional AI patterns, structured output validation
  • Inference & Optimization: Triton, vLLM, SGLang, Trainium, Inferentia, Quantization (GPTQ, AWQ, bitsandbytes), SageMaker Neo

Benefits

  • 100% remote work
  • Equitable Life - Hybrid Plan
  • 100% Premium Coverage for the employee and dependents
  • Competitive phantom equity
  • Long-Term Disability
  • 4% Pension match (employer contribution)
  • Unlimited Vacations
  • Sick Leave
  • Paid Holidays
  • Parental Leave
  • Paid for exams and certifications
  • Peer bonus awards
  • State of the art laptop and tools
  • Equipment & Office Stipend
  • Individual professional development plan
  • Annual stipend for Learning and Development
  • Work with an amazing worldwide team and in an incredible corporate culture
Before You Apply
️
worldwide Be aware of the location restriction for this remote position: Worldwide
β€Ό Beware of scams! When applying for jobs, you should NEVER have to pay anything. Learn more.
Back to Remote jobs  >   AI / ML
Principal AI/ML Architect @Caylent
AI / ML
Salary cad 180,000 - 2..
Remote Location
Job Type full-time
Posted 2d ago
Apply for this position
Did not apply βœ“
Applied βœ“
Sent Follow-Up βœ“
Interview Scheduled βœ“
Interview Completed βœ“
Offer Accepted βœ“
Offer Declined βœ“
Unlock 152,720 Remote Jobs
️
worldwide Be aware of the location restriction for this remote position: Worldwide
β€Ό Beware of scams! When applying for jobs, you should NEVER have to pay anything. Learn more.
Apply for this position
Did not apply βœ“
Applied βœ“
Sent Follow-Up βœ“
Interview Scheduled βœ“
Interview Completed βœ“
Offer Accepted βœ“
Offer Declined βœ“
Unlock 152,720 Remote Jobs
Γ—

Apply to the best remote jobs
before everyone else

Access 152,720+ vetted remote jobs and get daily alerts.

4.9 β˜…β˜…β˜…β˜…β˜… from 500+ reviews
Unlock All Jobs Now

Maybe later