Data Scientist (Big Data Engineer) 3

Remote, USA Full-time
Solicitation Reference Number: 2026C0014DIrect Client: Texas Department of Family and Protective ServicesWorking Title: Data Scientist (Big Data Engineer) 3Work Location: Austin, Tx - Telework JD: The Worker is responsible for developing, maintaining, and optimizing big data solutions using the Databricks Unified Analytics Platform. This role supports data engineering, machine learning, and analytics initiatives within this organization that relies on large-scale data processing. Duties include: • Designing and developing scalable data pipelines • Implementing ETL/ELT workflows • Optimizing Spark jobs • Integrating with Azure Data Factory • Automating deployments • Collaborating with cross-functional teams • Ensuring data quality, governance, and security. CANDIDATE SKILLS AND QUALIFICATIONS Minimum Requirements: Candidates that do not meet or exceed the minimum stated requirements (skills/experience) will be displayed to customers but may not be chosen for this opportunity. Years Required/Preferred Experience 4 Required Implement ETL/ELT workflows for both structured and unstructured data 4 Required Automate deployments using CI/CD tools 4 Required Collaborate with cross-functional teams including data scientists, analysts, and stakeholders 4 Required Design and maintain data models, schemas, and database structures to support analytical and operational use cases 4 Required Evaluate and implement appropriate data storage solutions, including data lakes (Azure Data Lake Storage) and data warehouses 4 Required Implement data validation and quality checks to ensure accuracy and consistency 4 Required Contribute to data governance initiatives, including metadata management, data lineage, and data cataloging 4 Required Implement data security measures, including encryption, access controls, and auditing; ensure compliance with regulations and best practices 4 Required Proficiency in Python and R programming languages 4 Required Strong SQL querying and data manipulation skills 4 Required Experience with Azure cloud platform 4 Required Experience with DevOps, CI/CD pipelines, and version control systems 4 Required Working in agile, multicultural environments 4 Required Strong troubleshooting and debugging capabilities 3 Required Design and develop scalable data pipelines using Apache Spark on Databricks 3 Required Optimize Spark jobs for performance and cost-efficiency 3 Required Integrate Databricks solutions with cloud services (Azure Data Factory) 3 Required Ensure data quality, governance, and security using Unity Catalog or Delta Lake 3 Required Deep understanding of Apache Spark architecture, RDDs, DataFrames, and Spark SQL 3 Required Hands-on experience with Databricks notebooks, clusters, jobs, and Delta Lake 1 Preferred Knowledge of ML libraries (MLflow, Scikit-learn, TensorFlow) 1 Preferred Databricks Certified Associate Developer for Apache Spark 1 Preferred Azure Data Engineer Associate Apply tot his job
Apply Now

Similar Jobs

LEAD Data Engineer - Big Data

Remote, USA Full-time

Principal Data Engineer (Remote)

Remote, USA Full-time

Sr. Big Data Engineer

Remote, USA Full-time

Data Engineer, Life Sciences Technology Solutions

Remote, USA Full-time

Data Engineer (Enterprise Landing Zone) (multiple positions) in Columbus, OH.

Remote, USA Full-time

Remote Bilingual Spanish Representative

Remote, USA Full-time

Customer Service Representative - Bilingual (Spanish)

Remote, USA Full-time

Bilingual Scheduling Center Agent

Remote, USA Full-time

Remote - Bilingual Corporate Resolutions Specialist (Spanish/English)

Remote, USA Full-time

Bilingual Call Center Specialist - Remote Role

Remote, USA Full-time

Experienced Customer Service Representative – Full Time Remote Call Center Chat Specialist for Dynamic Customer Engagement and Support

Remote, USA Full-time

Billing Quality Control Coordinator - REMOTE (Northeast)

Remote, USA Full-time

Experienced Data Entry Professional for Remote Opportunities – Utilizing Microsoft Office and Database Management Skills for Accurate Data Input and Management at arenaflex

Remote, USA Full-time

Senior Payroll Specialist (Remote Opportunity) – 744000016133718-5844

Remote, USA Full-time

Dealer Account Manager

Remote, USA Full-time

PRN Coordinator (Clinical Operations), PT, Anywhere

Remote, USA Full-time

Compliance Analyst II

Remote, USA Full-time

Customer Support & Data Entry Jobs Permanent WFH Roles

Remote, USA Full-time

**Experienced Full Stack EHS Lead – Data Management and Operations**

Remote, USA Full-time

Director, Cyber Governance and Controls

Remote, USA Full-time
Back to Home