Principal Data Scientist - Machine Learning Platform

Data Science · Remote, California
Department Data Science
Employment Type Full-Time
Minimum Experience Experienced


We are seeking a Principal Data Scientist - Machine Learning Platform to join our Data Science team to focus on researching and implementing healthcare Deep Learning algorithms at scale to become part of Lumiata’s Machine Learning Platform. This individual will play a critical data science and ML engineering leadership role, by stirring the team to specific experiment and research directions that can take our platform to the next level. We want this individual to lead internally and externally in the industry by contributing to the research and technical communities.

This role would involve a strong understanding of:

  • How AI and machine learning can transform healthcare.
  • How to build and deploy novel machine learning products.
  • How medical information is stored and communicated between different actors in the healthcare system.
  • What modern, open standards have been developed to better communicate and represent medical data.
  • What specific standards must be respected and how to ensure compliance to handle sensitive healthcare data including HIPAA, SOC 2 and HITRUST among others.

As a Principal Data Scientist, strong leadership and executive presence is desired; not only learn and apply the above, but to disseminate and evangelize with members of the team.

Key Responsibilities

  • Participate in cutting edge research in healthcare AI/ML applications.
  • Develop solutions for real world, large scale problems.
  • Drive industry standards and beyond within the DS team
  • Use strong coding chops to drive experiments all the way to production
  • Collaborate very tightly with our engineering team in terms of setting our long term technical strategy that blends engineering and science
  • Mentor the more junior members of the data science organization

Minimum Qualifications

  • Ph.D. in Computer Science or related field, or equivalent industry experience.
  • Experience in Natural Language Understanding, Computer Vision, Machine Learning, Algorithmic Foundations of Optimization, Data Mining or Machine Intelligence (Artificial Intelligence).
  • Programming experience in C, C++, or Python.
  • Contributions to research communities/efforts, including publishing papers in machine learning (JMLR, ICLR, NeurIPS, ICML, ACL, CVPR).

Preferred Qualifications

  • Relevant work experience, including full time industry experience or as a researcher in a lab with Deep Learning on Electronic Health Records or claims data.
  • Experience with Spark/pyspark
  • Experience with Cloud (e.g., AWS, Google Cloud, Azure...etc)
  • Strong publication record and open source contribution.
  • Ability to design and execute on research agenda.

About Lumiata

Lumiata delivers Machine Learning powered health analytics to make healthcare smarter. At the intersection of clinical, operational, and financial functions, Lumiata provides cost and risk analytics to health plans, care providers and employers.  We have over 100+ end-to-end model pipelines to solve use-cases in pricing and underwriting, medical intervention recommendation, hospital readmission risk, and payment integrity.  We are rare in that we have both a strong customer-centric focus, as well as a strong R&D pedigree (for example, we published a paper to appear in the top AI conference AAAI in 2021, which describes how our ML pipelines provide decision support for pricing/underwriting of health insurance plans:

We process TBs of patient data per customer, which we use to train/fine-tune a variety of AI/ML models that are used to solve our customer’s prediction and classification problems. We’ve also built a Big Data / Machine Learning platform for managing PBs of data, as well as providing our data science team capabilities that will allow them to iterate very quickly throughout the ML experimentation lifecycle: data cleansing, feature engineering, training, predict/classify, tune, and repeat. We are driving towards reaching economies of scale via our platform.

Based in Silicon Valley, Lumiata’s team is a diverse, multinational, creative team; join us in building a better medical system for everyone by tackling looming health problems using large scale machine learning. Lumiata is backed by Khosla Ventures, BlueCross BlueShield Venture Fund, Intel Capital, Sandbox Industries and other leaders in healthcare and AI.

Diversity creates a healthier atmosphere: Lumiata is an Equal Employment Opportunity/Affirmative Action employer and all qualified applicants will receive consideration for employment without regard to race, color, religion, sex, age, national origin, protected veteran status, disability status, sexual orientation, gender identity or expression, marital status, genetic information, or any other characteristic protected by law.

Thank You

Your application was submitted successfully.

  • Location
    Remote, California
  • Department
    Data Science
  • Employment Type
  • Minimum Experience