Lead Data Engineer + AI Client - Altimetrik Takeda Location: Remote Need minimum 3 years of experien

Remote Full-time
Lead Data Engineer + AI Client - Altimetrik Takeda Location: Remote Need minimum 3 years of experience as Lead. About the role We're looking for a Senior Data Engineer to build and scale our Lakehouse and AI data pipelines on Databricks. You'll design robust ETL/ELT, enable feature engineering for ML/LLM use cases, and drive best practices for reliability, performance, and cost. What you'll do • Design, build, and maintain batch/streaming pipelines in Python + PySpark on Databricks (Delta Lake, Autoloader, Structured Streaming). • Implement data models (Bronze/Silver/Gold), optimize with partitioning, Z-ORDER, and indexing, and manage reliability (DLT/Jobs, monitoring, alerting). • Enable ML/AI: feature engineering, MLflow experiment tracking, model registries, and model/feature serving; support RAG pipelines (embeddings, vector stores). • Establish data quality checks (e.g., Great Expectations), lineage, and governance (Unity Catalog, RBAC). • Collaborate with Data Science/ML and Product to productionize models and AI workflows; champion CI/CD and IaC. • Troubleshoot performance and cost issues; mentor engineers and set coding standards. Must-have qualifications • 10+ years in data engineering with a track record of production pipelines. • Expert in Python and PySpark (UDFs, Window functions, Spark SQL, Catalyst basics). • Deep hands-on Databricks: Delta Lake, Jobs/Workflows, Structured Streaming, SQL Warehouses; practical tuning and cost optimization. • Strong SQL and data modeling (dimensional, medallion, CDC). • ML/AI enablement experience: MLflow, feature stores, model deployment/monitoring; familiarity with LLM workflows (embeddings, vectorization, prompt/response logging). • Cloud proficiency on AWS/Azure/GCP (object storage, IAM, networking). • CI/CD (GitHub/GitLab/Azure DevOps), testing (pytest), and observability (logs/metrics). Nice to have • Databricks Delta Live Tables, Unity Catalog automation, Model Serving. • Orchestration (Airflow/Databricks Workflows), messaging (Kafka/Kinesis/Event Hubs). • Data quality & lineage tools (Great Expectations, OpenLineage). • Vector DBs (FAISS, pgvector, Pinecone), RAG frameworks (LangChain/LlamaIndex). • IaC (Terraform), security/compliance (PII handling, data masking). • Experience interfacing with BI tools (Power BI, Tableau, Databricks SQL). Apply tot his job
Apply Now →

Similar Jobs

Artificial Intelligence Engineer (Full-Time Remote, North Carolina Based)

Remote Full-time

AI Consultant (travel to PA required)

Remote Full-time

Data Engineer (Remote, Continental United States)

Remote Full-time

AI Consultant – Agentic AI, Systems Integration

Remote Full-time

AI Consultants

Remote Full-time

Data Scientist - Senior AI Consultant Engineer - Fulltime position

Remote Full-time

[Remote] ML/AI Data Engineer

Remote Full-time

Expert Data Engineer – Analytical Engineer, AI Initiative – Healthcare, Retail

Remote Full-time

Outside Sales Representative

Remote Full-time

AI Engineering Lead at Aios Medical — Remote, $80k-$120k/year inc equity

Remote Full-time

Apply Now: Remote Independent Trader Job in Fayetteville, NC

Remote Full-time

Systems Engineer, Library Development Operations - University Libraries job at Washington University in St. Louis in Saint Louis, MO

Remote Full-time

Medical Record Retrieval Specialist

Remote Full-time

Strategy & Analytical Consultant, Analytical Platforms - WFH -Atlanta

Remote Full-time

Experienced Remote Customer Service Representative - Exceptional Problem Solving Skills & Excellent Communication - Earn $19/Hour or More with blithequark

Remote Full-time

**Experienced Customer Service Representative – Night Shift Work From Home Opportunity at blithequark**

Remote Full-time

Remote Insurance Claim Specialist; Colorado Springs

Remote Full-time

Experienced Technical Support and Customer Service Representative – Remote Work Opportunity with blithequark

Remote Full-time

Experienced Medical Records Coordinator – Remote Data Entry Opportunity at blithequark

Remote Full-time

Virtual Data Entry Clerk - Entry Level - Remote Opportunity at blithequark: Shape Your Career with Flexibility and Growth

Remote Full-time
← Back to Home