Data Scientist
Metriport›
📍San Francisco, California, US
Posted 3d ago · via ashby
Apply on ashby→Job Description
Data Scientist
San Francisco, CA
On-site
About us
Metriport is an open-source data intelligence platform that helps healthcare organizations access and exchange patient data in real-time. We integrate with all major US healthcare IT systems and tap into comprehensive medical data for 300+ million individuals.
We've found product-market fit with multi-million ARR, 100+ customers (including Amazon, Color, and Strive Health), backing from top VCs, and years of runway. We're ready to scale. We're a tight-knit, high-performing team of mostly former founders (including two YC alumni). We're engineering-heavy, operate with minimal bureaucracy and high autonomy, and hire based on competence, not prestige. We push hard—founders work six days a week from our SF office—but give everyone freedom to craft their schedule. We measure output and we're committed to sustainable intensity.
About You
We're looking for a data scientist who thrives in ambiguity and cares as much about getting the fundamentals right as building the next model. At Metriport, you won't be handed a clean dataset and a well-scoped problem — you'll be building the data foundation alongside our engineering team while applying science to some of the messiest, most impactful data in healthcare.
You're entrepreneurial-minded, with an olympian-level work ethic.
You are obsessed with data integrity. If a metric is off by 1%, it keeps you up at night until you find the root cause.
You believe that high-quality clinical data is the bedrock of excellent healthcare, and you're excited to work at the intersection of ML and patient records.
You have a strong sense of ownership and the ability to lead cross-functional initiatives with minimal direction.
You care about impact over sophistication. You'd rather ship a logistic regression that changes a workflow than a transformer that lives in a notebook.
You're excited to go wide. In a small team, the best data scientist is also a great analyst — and you see that as a feature, not a compromise.
You're a hacker at heart, and you're comfortable writing production-grade code to get the job done.
What You'll Be Doing
After quickly ramping up on our clinical data domain, your goal is to be the person who owns clinical intelligence within Metriport's stack — turning massive volumes of clinical records into predictions, insights, and automated decisions. Day to day, this looks like:
Applying AI/ML to Clinical Data: Building and deploying models to predict patient outcomes, identify gaps in care, or surface anomalies across our data warehouse.
Normalizing Clinical Data at Scale: Using NLP, LLMs, or rule-based systems to transform messy, unstructured clinical records into structured, searchable, trustworthy data.
Owning Analytics When It Matters: You'll share ownership of our analytics stack and data quality alongside the team. When a customer needs an accurate report or the team needs a reliable metric, you're just as accountable as anyone.
Productizing Intelligence: Designing and shipping data-science-powered features as core parts of the Metriport platform — not just internal experiments, but things customers use.
Building the Data Foundation: Contributing to data modeling, warehouse design, and tooling (dbt, DWHs, PostHog, etc.) as we scale our data infrastructure. Science without solid foundations is noise.
Team alignment: Participating in a daily 30-minute remote standup at 7:30 AM PST Mon–Fri (our only regular mandatory meeting).
Requirements
4+ years of experience in a data science role, ideally at a high-growth or startup company where you wore multiple hats.
SQL mastery: You can write complex, performant queries in your sleep.
ML/statistical modeling: Practical experience building and deploying models (classification, regression, clustering, NLP) — not just prototyping.
Coding proficiency: Strong in Python (pandas, scikit-learn, and ideally some experience with LLM APIs or frameworks). TypeScript proficiency is a plus — our stack is TypeScript-heavy.
Analytical chops: You're comfortable owning dashboards, data quality, and ad-hoc analysis. You see this as part of the job, not beneath it.
Location: San Francisco / Bay Area (or willing to relocate).
Nice to Have
Experience with healthcare data is strongly preferred — FHIR, HL7, or clinical data. Understanding how a patient moves through the healthcare system is the core of what we do.
Experience with data modeling tools (dbt or similar) and product analytics platforms (PostHog, Mixpanel, Amplitude).
Experience integrating models into backend services or APIs (not just notebooks).
Benefits
Competitive equity + compensation package 🚀
Full family Platinum health insurance, dental, and vision coverage 🦷
401(k) retirement plan + matching 💰
Flexible work from home or in-office 🏢
Healthy lunches are complimentary when working in-office (and breakfast + dinners as needed) 🍏
Quarterly company off-sites with the team ⛷️
MacBook provided by us 💻
Unlimited PTO (we work hard, but trust you to take time you need to be at your best) 🧘♂️
Our tech
Our data lives in PostgreSQL, DynamoDB, S3, Snowflake, and a FHIR server. We use dbt for transformations and Posthog for product analytics. Our infrastructure is managed via AWS CDK, and our core platform is written in TypeScript and Python. We are looking for a generalist who can jump into any part of this stack to extract value.
Metriport provides equal employment opportunities (EEO) to all employees and applicants for employment without regard to race, color, religion, sex, national origin, age, disability, genetics, sexual orientation, gender identity, or gender expression. We are committed to a diverse and inclusive workforce and welcome people from all backgrounds, experiences, perspectives, and abilities.
Details
- Department
- Engineering
- Work Type
- onsite
- Locations
- San Francisco, California, US
- Salary
- $160K - $190K
- Posted
- April 12, 2026
- Source
- ashby