Data Engineer

CVS Health

Remote

Quick summary

Work type
Remote
Location
Hartford, CT
Posted
6 days ago
Closes
Jul 6, 2026

Market check

Salary context

How this pay compares to similar roles

Similar $163k
$110k most similar roles pay here $210k

This listing doesn't post a salary. Most similar roles pay $126,800–$199,050.

Based on 240 similar postings.

Employer

About CVS Health

CVS Health is a leading American healthcare company operating retail pharmacies, pharmacy benefit management services, and a health insurance segment through Aetna, one of the nation''s largest health insurers. Industry: Healthcare & Pharmacy

CVS Health currently has 407 open roles on FindRole.

Listed pay typically runs $118,450–$284,280 across 133 roles with salary data.

Most-posted roles

View all roles at CVS Health

At a glance

TL;DR · Data Engineer

Aetna Resources, LLC, a CVS Health company, seeks a Data Engineer in Hartford, CT, with opportunities for telecommuting. This role involves developing and managing large-scale data structures and ETL workflows using technologies like Spark, PySpark, Scala, MySQL, NoSQL, PowerBI, Tableau, and the PyData ecosystem. The ideal candidate will have expertise in designing efficient data pipelines, optimizing queries, and deploying predictive models in cloud environments such as GCP, AWS, or Azure. Additionally, proficiency in machine learning, statistical analysis, NLP with frameworks like Scikit-learn, SpaCy, PyTorch, and Spark NLP is required. The position demands a Master’s degree in Computer Science, Data Science, Statistics, Mathematics, Analytics, or related field, along with relevant coursework and experience in the aforementioned technologies and tools.

What you'll do

  • Develop and manage large-scale data structures and pipelines using Java, Python, and R.
  • Design and optimize ETL workflows for complex business problems.
  • Implement machine learning models in cloud environments like GCP or Azure.
  • Utilize Spark, PySpark, Scala, and other tools to process big data efficiently.
  • Create visualizations with PowerBI and Tableau to communicate insights effectively.
  • Build and deploy predictive models using Vertex-AI and the PyData ecosystem.
  • Design data architectures including distributed computing engines and ML infrastructure.

What we're looking for

  • Master’s degree in Computer Science, Data Science, Statistics, Mathematics, Analytics, or related field required.
  • Proficiency in Java, Python, and R programming languages.
  • Expertise in Spark, PySpark, Scala, MySQL, NoSQL, PowerBI, Tableau.
  • Experience designing and optimizing data pipelines and queries.
  • Knowledge of machine learning, statistical analysis, predictive modeling.
  • Skills in NLP tools (Scikit, SpaCy, PyTorch, Spark NLP).
  • Cloud deployment experience for ML systems (GCP, AWS, Azure).

More like this

Similar roles

Data Engineer

CVS Health

Remote (New York, NY) 6 days ago
AWS Azure GCP Python Java R Spark PySpark Scala SAS SQL Hadoop HDFS CI/CD Jenkins GIT Machine learning Statistical analysis Predictive modeling NLP Scikit-Learn Spacy Pytorch Spark NLP ETL Data warehousing Big Data Distributed computing
Remote

Data Engineer

CVS Health

Remote (Woonsocket, RI) 6 days ago
Java Python R CI/CD Jenkins GIT Agile SAFe JIRA Rally Confluence SQL MySQL Hadoop HDFS Hive Big Data DevOps
Remote

Data Scientist

CVS Health

Remote (Wellesley, MA) 6 days ago
Python R SQL Hadoop Spark Airflow Kafka PySpark Scala Java Scikit-Learn Spacy Pytorch Spark NLP Kubernetes AWS Azure GCP
Remote

Data Engineer

Booz Allen Hamilton

Albuquerque, NM 41 days ago $61,900$141,000
Python PostgreSQL AWS Docker Kubernetes Terraform CI/CD RESTful APIs PySpark CloudFormation CDK Data质量管理框架 日志监控报警 数据验证 安全访问控制

Data Engineer

Booz Allen Hamilton

McLean, VA +3 26 days ago $62,000$141,000
Python Java C++ ETL ELT Spark Databricks Hadoop Hive AWS EMR Kafka UNIX Linux Shell scripting

Data Engineer

Citi

Remote (Jersey City, NJ) +3 26 days ago $142,320$213,480
AWS Glue AWS Athena AIRFLOW SNOWFLAKE MongoDB Oracle PostgreSQL Unix shell scripting Python PySpark AbInitio ETL BigData CI/CD
Remote