All jobs

[Remote] Lead Data Scientist

100% Remote Full-time Open now

Note: The job is a remote job and is open to candidates in USA. Smarsh empowers its customers to manage risk and unleash intelligence in their digital communications. As a Lead Data Scientist (NLP & Financial Compliance), you will develop NLP and large language model solutions for compliance and surveillance systems, working with data to uncover misconduct and risk while mentoring junior team members.

Responsibilities

  • Collect, analyze, and interpret small/large datasets to uncover meaningful insights to support the development of statistical methods / machine learning algorithms
  • Lead the design, training, and deployment of NLP and transformer-based models for financial surveillance and supervisory use cases (e.g., misconduct detection, market abuse, trade manipulation, insider communication)
  • Development of machine learning models and other analytics following established workflows, while also looking for optimization and improvement opportunities
  • Data annotation and quality review
  • Exploratory data analysis and model fail state analysis
  • Contribute to model governance, documentation, and explainability frameworks aligned with internal and regulatory AI standards
  • Client/prospect guidance in machine learning model and analytic fine-tuning/development processes
  • Provide guidance to junior team members on model development and EDA
  • Work with Product Manager(s) to intake project/product requirements and translate these to technical tasks within the team’s tooling, technique and procedures
  • Continued self-led personal development

Skills

  • Strong understanding of financial markets, compliance, surveillance, supervision, or regulatory technology
  • Experience with one or more data science and machine/deep learning frameworks and tooling, including scikit-learn, H2O, keras, pytorch, tensorflow, pandas, numpy, carot, tidyverse
  • Command of data science and statistics principles (regression, Bayes, time series, clustering, P/R, AUROC, exploratory data analysis etc…)
  • Strong knowledge of key programming concepts (e.g. split-apply-combine, data structures, object-oriented programming)
  • Solid statistics knowledge (hypothesis testing, ANOVA, chi-square tests, etc…)
  • Knowledge of NLP transfer learning, including word embedding models (gloVe, fastText, word2vec) and transformer models (Bert, SBert, HuggingFace, and GPT-x etc.)
  • Experience with natural language processing toolkits like NLTK, spaCy, Nvidia NeMo
  • Knowledge of microservices architecture and continuous delivery concepts in machine learning and related technologies such as helm, Docker and Kubernetes
  • Familiarity with Deep Learning techniques for NLP
  • Familiarity with LLMs - using ollama & Langchain
  • Excellent verbal and written skills
  • Proven collaborator, thriving on teamwork
  • Master's or Doctor of Philosophy degree in Computer Science, Applied Math, Statistics, or a scientific field
  • Familiarity with cloud computing platforms (AWS, GCS, Azure)
  • Experience with automated supervision/surveillance/compliance tools

Company Overview

  • Smarsh manage the risk and see the value in their communications data. It was founded in 2001, and is headquartered in Portland, Oregon, USA, with a workforce of 1001-5000 employees. Its website is http://www.smarsh.com.
  • Company H1B Sponsorship

  • Smarsh has a track record of offering H1B sponsorships, with 16 in 2025, 5 in 2024, 12 in 2023, 22 in 2022, 2 in 2021, 1 in 2020. Please note that this does not guarantee sponsorship for this specific role.
  • Apply To This Job

    You might also like

    [Remote] Environmental Project Manager

    100% Remote Full-time

    [Remote] Senior Financial Analytics Advisor-Remote

    100% Remote Full-time

    [Remote] Product Manager – IT Cooling Systems

    100% Remote Full-time

    [Remote] Director-Delivery Operations - CDH - Remote

    100% Remote Full-time

    [Remote] Manufacturing Engineering Technician

    100% Remote Full-time

    [Remote] Project Manager, Data & Insights Solutions

    100% Remote Full-time

    [Remote] Bilingual Customer Service Representative (Spanish)

    100% Remote Full-time

    [Remote] Enterprise Account Manager

    100% Remote Full-time

    [Remote] Head of Marketing

    100% Remote Full-time

    [Remote] Assistant Manager, Marketing & Communications (Contract Employee)

    100% Remote Full-time

    Clinical Trial Assistant​/Sr. Clinical Trial Assistant - CTA; Remote – PST

    100% Remote Full-time

    Experienced Entry-Level Data Entry Clerk – Logistics and Data Management at arenaflex

    100% Remote Full-time

    Machine Learning Engineer - Remote

    100% Remote Full-time

    Manager, Workday Payroll, Time & Absence (Remote)

    100% Remote Full-time

    Customer Service Representative (Work from Home) at Amazon - VacancyGlobal

    100% Remote Full-time

    Experienced Remote Customer Service Specialist – Delivering Exceptional Arenaflex Experiences

    100% Remote Full-time

    National LGBTQ Task Force – Communications Marketing Campaign Manager – New York City, NY – Washington DC

    100% Remote Full-time

    Customer Support Specialist

    100% Remote Full-time

    Experienced Customer Service Representative – Remote Opportunity with arenaflex

    100% Remote Full-time

    Ophthalmology Account Executive - Indianapolis, IN

    100% Remote Full-time