All jobs

DevOps / Infrastructure Engineer

100% Remote Full-time Open now
About BTSE: BTSE Group is a global leader in fintech and blockchain technology, anchored by three core business pillars: Exchange, Payments, and Infrastructure Development. Serving over 100 corporate clients worldwide, we provide white-label exchange and payment solutions. Our offerings encompass everything from exchange infrastructure hosting and development to custody, wallets, payments, blockchain integration, trading, and more. We are looking for talented professionals in marketing, operations, customer support, and other departments. The roles offered may be on-site, remote, or hybrid, in collaboration with our local partner. About the opportunity: You keep the platform running reliably. For the first client operating in crypto markets, this means 24/7 uptime with zero maintenance windows. You build multi-tenant Kubernetes infrastructure with per-tenant namespace isolation, manage GPU scheduling for AI model serving, set up CI/CD for rapid iteration, and own monitoring and on-call. You also automate tenant provisioning so that scaling from one client to ten is an operational exercise, not an engineering project. About BTSE: BTSE Group is a global leader in fintech and blockchain technology, anchored by three core business pillars: Exchange, Payments, and Infrastructure Development. Serving over 100 corporate clients worldwide, we provide white-label exchange and payment solutions. Our offerings encompass everything from exchange infrastructure hosting and development to custody, wallets, payments, blockchain integration, trading, and more. We are looking for talented professionals in marketing, operations, customer support, and other departments. The roles offered may be on-site, remote, or hybrid, in collaboration with our local partner.   About the opportunity: You own the AI core: model serving, the retrieval-augmented generation (RAG) pipeline, prompt engineering, and the feedback-to-training pipeline. In Phase 1, you make the base model perform as well as possible through context engineering — system prompts, few-shot exemplars, and retrieval optimisation — without modifying model weights. You also design the custom model training workflow so that enterprise clients can train their own fine-tuned models in Phase 2. This is the highest-leverage individual contributor role on the founding team. Responsibilities
  • Set up a multi-tenant Kubernetes cluster: shared services namespace, per-tenant namespaces for isolated workloads, GPU node pools for model inference.
  • Build CI/CD pipeline: source control → container build → automated deployment with zero-downtime rolling updates.
  • Configure GPU management: scheduling, resource quotas per tenant, device plugins.
  • Set up comprehensive monitoring: per-tenant metrics, SLA tracking, data pipeline health, GPU utilisation, API latency percentiles, WebSocket connection stability.
  • Implement backup and disaster recovery: cross-region replication, automated database backups.
  • Build tenant provisioning automation: scripted creation of new tenant namespaces, storage, network policies, and service accounts.
  • Security hardening: network policies between namespaces, vulnerability scanning, audit logging.
  • 24/7 on-call during initial pilot (rotating with Tech Lead).
  • Requirements
  • 4+ years DevOps/SRE; Kubernetes cluster operations including multi-tenant patterns.
  • GPU workloads on Kubernetes (GPU Operator, device plugins, resource scheduling).
  • CI/CD pipelines: GitHub Actions, ArgoCD or FluxCD.
  • Terraform IaC.
  • On-call experience and incident management.
  • Nice to have
  • Kubernetes namespace isolation and network policies for multi-tenancy.
  • 24/7 systems experience (crypto, gaming, or global SaaS).
  • Monitoring WebSocket-heavy architectures and streaming data pipelines.
  • GPU cluster management for ML inference.
  • #LI-MC1 Apply To This Job

    You might also like

    Data Engineer

    100% Remote Full-time

    IN_Bosch Rexroth India_ Executive / Assistant Manager_Technical Sales_Hydraulics

    100% Remote Full-time

    Représentant(e) commercial(e)/marketing sur le terrain – Stage d'été de 4 mois pour étudiants / Field Sales/Marketing Representative – 4 Month Summer Student

    100% Remote Full-time

    Federal Account Executive, Strategic Enterprise

    100% Remote Full-time

    Marketing Consultant – Salesforce Marketing Cloud (SFMC)

    100% Remote Full-time

    Operations Manager - Aviation Analytics Integration

    100% Remote Full-time

    Systems Administration, Lead Associate

    100% Remote Full-time

    VDI Administrator

    100% Remote Full-time

    AR Specialist I

    100% Remote Full-time

    Regional Partner Manager, SLED West

    100% Remote Full-time

    Senior Software Engineer, ML Infrastructure

    100% Remote Full-time

    Manager, US Government Affairs

    100% Remote Full-time

    Experienced Part-Time Work At Home American Express Virtual Assistant - Customer Service & Account Management

    100% Remote Full-time

    Experienced Remote Data Entry Coordinator – Full-time Opportunity for Detail-Oriented Professionals to Join blithequark and Thrive in a Dynamic Work Environment

    100% Remote Full-time

    Compliance Officer - Brokerage - Securities Admin

    100% Remote Full-time

    Senior Cloud Engineer (AWS)

    100% Remote Full-time

    Computer and Information Systems Manager/Team Lead

    100% Remote Full-time

    Remote Travel Agent (Niche Market-Disney Vacations)

    100% Remote Full-time

    Executive Assistant/Office Manager (Virtual)

    100% Remote Full-time

    Site Activation Specialist II - Min 2 years of regulatory experience in Mexico - 100% Remote role

    100% Remote Full-time