Print or save as PDF using your browser's print dialog
Helana Nosratbakhsh
Senior Data Engineer & Advisor
Summary
Senior Data Engineer with over 8 years of experience designing, building, and scaling enterprise-grade data platforms. Specialized in orchestrating complex ERP and CRM integrations, Snowflake-native pipelines, and Data Vault 2.0 implementations. Recognized for bridging the gap between business requirements and highly technical data engineering — successfully migrating legacy environments and validating massive-scale enterprise data to deliver certified, analytics-ready products that empower data science, reporting, and operational efficiency.
Experience
Senior Data Engineer—R1 RCM
March 2024 – Present·Philadelphia, PA (Remote)
- Restored $47.8M in combined monthly revenue reporting accuracy by leading cross-team alignment and resolving systemic data normalization inconsistencies across 1,247 healthcare facilities.
- Spearheaded the implementation of scalable Data Vault 2.0 architectures in Snowflake, orchestrating complex ETL/ELT pipelines across Raw, Refined, and Data Product layers using dbt and Apache Airflow.
- Optimized warehouse cost and performance through rigorous consumption monitoring and query tuning, successfully resolving multi-hour query timeouts and improving infrastructure efficiency.
- Partnered with Data Governance and Analytics teams to deliver certified data products, ensuring documented quality SLAs, lineage, and adherence to enterprise data observability best practices.
Data Engineer II / ETL Developer II—XSOLIS
2020 – March 2024·Philadelphia, PA (Remote)
- Accelerated enterprise data delivery for data science and clinical operations by engineering robust Python and SQL pipelines, supporting the deployment of advanced predictive and deep learning models.
- Optimized high-volume data extraction and Redshift enterprise data warehouse migrations by designing and scheduling interval-based SQL stored procedures from production databases.
- Enhanced executive visibility and clinical application performance by developing interactive Power BI semantic models and automated reporting dashboards.
- Secured enterprise data assets by authoring comprehensive source control documentation, integrating strict HITRUST compliance policies directly into development workflows.
Developer — Data Integration—Tractor Supply Company
August 2017 – February 2020·Nashville, TN
- Engineered Talend data integration jobs to synchronize and validate SAP ERP Item Master tables against enterprise Netezza data warehouses, achieving strict data consistency requirements for downstream analytics.
- Translated enterprise business needs into technical requirements, collaborating closely with business analysts to design logical and physical dimensional models for self-service reporting.
- Supported major promotional marketing initiatives by writing Java and SQL pipelines to transform semi-structured XML customer data into filtered, analytics-ready tables.
Junior Full Stack Software Developer—Nashville Software School
July 2016 – June 2017·Nashville, TN
- Delivered production-quality code by completing an immersive software development program focused on full-stack application development, databases, REST APIs, and object-oriented programming.
Database Analyst & Enterprise Growth Strategist—Emma / Marigold
2015 – 2017·Nashville, TN
- Maximized e-commerce and digital strategy goals for CEOs and CMOs by delivering expert consultative strategy and database analysis for the Emma SaaS platform.
- Expanded the retail market pipeline and bridged the gap between sales and engineering by targeting ideal client profiles across the university, agency, and retail sectors.
Database Analyst & Retail Enterprise Growth Strategist—Listrak
January 2014 – December 2014·Greater Philadelphia Area
- Optimized sales pipeline efficiency by engineering custom Salesforce data extraction queries and generating analytics reports to drive strategic decision-making.
- Accelerated enterprise client acquisition by conducting technical stack analyses (using Datanyze and builtWith) to map out scalable omni-channel integration strategies.
IT Recruiter and Account Executive—The Judge Group
October 2014 – 2015·Conshohocken, PA
- Executed comprehensive talent acquisition strategies for large hiring initiatives by facilitating stakeholder management with senior clients and vendors.
Pharmaceutical Account Manager—Day & Zimmerman
May 2013 – October 2014·Philadelphia, PA
- Streamlined recruitment and managed large national pharmaceutical accounts across the R&D, Clinical, and IT sectors.
Selected Projects
Real-Time Analytics PipelineFortune 500 E-Commerce Platform
Apache KafkaSpark StreamingSnowflakedbtAWS EKSTerraform
- Replaced daily batch jobs with a streaming architecture processing 50,000+ events/sec, cutting analytics latency from 24 hours to under 3 minutes.
- Improved inventory accuracy by 34% and reduced stockout events by 41% through live dashboard availability.
- Reduced infrastructure costs 22% by migrating from over-provisioned servers to auto-scaling EKS containers.
Cloud Data Warehouse ModernizationRegional Healthcare Analytics Provider
AWS Redshift ServerlessAWS GlueS3dbt CloudAirflowTerraform
- Migrated 8 TB of HIPAA-regulated data from on-premise SQL Server to AWS with zero data loss and zero compliance violations.
- Reduced average query time from 4.2 hours to 18 minutes; eliminated an $800K hardware refresh.
- Grew self-service analytics adoption from 6 power users to 40+ analysts within 3 months of go-live.
ML Feature Store PlatformSeries B FinTech Startup
FeastApache SparkBigQueryRedisAirflowGCPDocker
- Built a centralized feature store computing 200+ features, cutting model development cycles from 3–4 weeks to 4–5 days.
- Eliminated training-serving skew across all 12 production models; online serving latency at p99: 2.3 ms.
- Achieved 67% feature reuse rate across new model development efforts.
Technical Skills
Engineering & Modeling
Data Vault 2.0, Dimensional Modeling, Star Schemas, ERP System Integration (SAP), Master Data Validation, Historical BackfillsIntegration & Pipelines
SQL, Python, Java, Talend, dbt Cloud/Core, Apache Airflow, ETL/ELTCloud & Operations
Snowflake (Streams, Tasks, Dynamic Tables), AWS (EC2, S3, Glue, Lambda), Kubernetes Pod Operators, CI/CDQuality & Strategy
Data Contracts, Regression Testing, Lineage Tracking, Stakeholder Management, Agile/ScrumEducation
Bachelor of Science in Corporate Communication—Drexel University
Minor in MusicJunior Developer Bootcamp—Nashville Software School
Front-End & Back-End Development