All jobs

NLP Engineer (Remote)

100% Remote Full-time Open now

ROLE SUMMARY

We are hiring a hands-on NLP Engineer to build robust pipelines that convert policy, regulatory, fintech, and healthcare documents into structured, graph-ready data. You will own the full extraction lifecycle from raw text to clean, schema-validated outputs using classical NLP, deep learning, and LLM APIs. KEY RESPONSIBILITIES

- Pipeline Development: Design and build end-to-end text extraction pipelines for policy, regulatory, fintech, and healthcare documents

- Entity & Clause Extraction: Extract key entities (countries, companies, minerals) and structure policy clauses and obligations

- Deep Learning & Transformers: Fine-tune BERT / RoBERTa for NER, text classification, and relation extraction tasks

- LLM Integration: Leverage LLM APIs with structured output extraction, prompt engineering, and tool/function calling

- Data Engineering: Build scalable Python pipelines for high-volume document processing with robust pre-processing for PDF, DOCX, and HTML

- Schema & Graph Readiness: Define and enforce JSON schemas; ensure outputs are clean and compatible with knowledge graph ingestion

- Accuracy Improvement: Evaluate model performance, track metrics, and implement feedback loops to improve extraction quality over time

REQUIRED SKILLS

- 3–5 years hands-on NLP engineering real production pipelines, not just model experiments

- Strong Python skills: OOP, async programming, packaging, and testing

- NLP frameworks: spaCy, HuggingFace Transformers, NLTK

- Deep learning: fine-tuning transformer models for sequence labeling and classification

- LLM API integration: prompt engineering, structured outputs, and function/tool calling

- Data pipeline experience: ETL, batch processing, and text pre-processing at scale

- JSON schema design and validation using pydantic or json schema

GOOD TO HAVE

- Experience with legal, regulatory, or policy documents (contracts, compliance filings, government publications)

- Familiarity with knowledge graphs or graph databases (Neo4j, RDF)

- Document parsing tools: pdfplumber, Docling, Apache Tika

- Domain knowledge in fintech or healthcare NLP

- Exposure to information extraction benchmarks (CoNLL, DocRED, SciERC)

Apply To This Job

You might also like

Principal Software Engineer (Corporate Systems)

100% Remote Full-time

Customer Support Specialist

100% Remote Full-time

Staff Success Manager

100% Remote Full-time

Bookkeeper

100% Remote Full-time

ATI Heavy Maintenance Materials Specialist II

100% Remote Full-time

ATI Heavy Maintenance Materials Specialist II

100% Remote Full-time

IT Support Engineer

100% Remote Full-time

General Counsel

100% Remote Full-time

Sales Development Representative (m/w/d)

100% Remote Full-time

Director of Sales & Channel Strategy

100% Remote Full-time

CVS Customer Service Representative, Work From Home Opportunity

100% Remote Full-time

Jr .NET Software Engineer

100% Remote Full-time

Experienced Remote Licensed Therapist - Flexible Part-Time Career with Leading Online Mental Health Platform

100% Remote Full-time

Director, Customer Service – Crafting Exceptional Customer Experiences at arenaflex

100% Remote Full-time

Remote Customer Benefits Specialist at Globe Life American Income Schreiter Organization

100% Remote Full-time

Experienced Customer Service Representatives – Remote Work Opportunity with blithequark

100% Remote Full-time

Assistant Vice President, Political Risk & Structured Credit

100% Remote Full-time

Clinical Trials Data Coord

100% Remote Full-time

[Remote] Consultative Pharmacy Technician

100% Remote Full-time

Remote Data Entry Specialist for arenaflex - Unlock a World of Opportunities with arenaflex and Competitive Pay ($30/Hour) and Flexible Work Arrangements

100% Remote Full-time