Want Senior Python Engineer (Remote) in USA
Job title: Senior Python Engineer (Remote)
Company: Bristol-Myers Squibb
Job description: Working with Us
Challenging. Meaningful. Life-changing. Those arent words that are usually associated with a job. But working at Bristol Myers Squibb is anything but usual. Here, uniquely interesting work happens every day, in every department. From optimizing a production line to the latest breakthroughs in cell therapy, this is work that transforms the lives of patients, and the careers of those who do it. Youll get the chance to grow and thrive through opportunities uncommon in scale and scope, alongside high-achieving teams rich in diversity. Take your career farther than you thought possible.
Bristol Myers Squibb recognizes the importance of balance and flexibility in our work environment. We offer a wide variety of competitive benefits, services and programs that provide our employees with the resources to pursue their goals, both at work and in their personal lives. Read more: .
Note - this role can be fully remote.
When you join BMS, you are joining a diverse, high-achieving team united by a common mission.
The Informatics and Predictive Sciences (IPS) mission is to Pioneer, Partner and Predict to drive transformative insights for patient benefit. IPS conducts applied computational research in areas that include genomic, structural and molecular informatics, computational and systems biology, patient selection and translational biomarker research, and broader fields including knowledge science, epidemiology and machine learningacross the full lifecycle of drug discovery and development and across all therapeutic areas at BMS. We do this in close partnership with scientific and clinical experts in the field, both inside and outside the company. We perform innovative science to empower key data-driven decisions across a rich pipeline of next-generation medicines. In doing so, our work transforms the lives of patients, as well as our own lives and careers.
Here, youll get the chance to grow and thrive through opportunities that are uncommon in scale and scope. Youll pursue innovative ideas while advancing professionally alongside some of the brightest minds in biopharma.
The Research Engineering group of Bristol Myers Squibb seeks a resilient, results-oriented Senior Data Engineer to join our motivated and diverse team focusing on informatics and data enablement initiatives in Research. The individual will play a leadership role in data engineering, will be responsible for helping to define data engineering practices for the group and will build data ingestion pipelines using Python. Additionally, the individual will design and implement ETL pipelines and APIs that focus on data enablement of primary and secondary high-dimensional data in pre-clinical and clinical settings where data is being employed to identify molecular drug targets, characterize MOA, prioritize disease indications, and generate patient selection hypotheses. This hands-on role interfaces closely with data and computational scientists in Informatics and Predictive Sciences and business partners in IT and supports programs spanning both discovery and translational sciences. We are seeking an individual with extensive experience integrating data and building data solutions to make data accessible and meaningful for the Research community.
Responsibilities
- Innovate and advise on the latest technologies and standard methodologies in Data Engineering and be able to identify software solutions that can address hurdles in data enablement
- Design, implement and manage ETL data ingestion pipelines that ingest vast amounts of genomic, phenotypic, and screening data from public, internal and partner sources
- Evaluate database and data storage solutions to find the most optimal manner in which to model, store and retrieve data
- Design and build REST APIs that allow flexible and easy access to ingested data
- Collaborate with data scientist leads to establish best-suited data enablement methods to optimize the enablement and interpretation of the data for downstream scientists
- Proactively communicate data ecosystem and pipeline value propositions to partnering scientific collaborators
- Collaborate with colleagues across Informatics and Predictive Sciences to make data, including raw/interim data, available to Research department personnel as the need arises
- Help ensure good engineering practices and code readability across data engineering
- Bachelor's Degree and 7+ years of academic / industry experience in an engineering field
- or Master's Degree and 5+ years of academic / industry experience in an engineering field
- or PhD 2+ years of academic / industry experience in an engineering field
- Excellent skills and deep knowledge in Python and object-oriented programming is a must, including common Python libraries such as pandas, boto3, flask, sqlalchemy, psycopg2, paramiko
- Excellent skills and deep knowledge of databases such as Postgres, Elasticsearch, Redshift, and Athena, including distributed database design, SQL vs. NoSQL, database optimizations, and database administration
- Solid understanding of ETL pipelines and workflow managements tools such as Airflow, AWS Glue, or AWS Step Functions
- Solid understanding of AWS cloud computing services such as S3, EC2, ECS, Batch, Elastic Load Balancer, Lambda, EMR, RDS, CloudWatch
- Solid understanding of the principles of HTTP, SSH, and basic networking concepts
- Experience designing and building REST APIs
- Solid understanding of and experience with containers using Docker, ECS, and ECR
- Proficiency with modern software development methodologies such as Agile, source control, CI/CD, project management and issue tracking with JIRA
- Proficiency with Linux
- Working understanding of and experience with various filesystems and object storage, including Linux filesystems, S3, SFTP, and FTP
- Experience in a life sciences research environment a plus