Skip to content
View ericg1212's full-sized avatar

Block or report ericg1212

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
ericg1212/README.md

Eric Grynspan · Data Engineer

Data Engineer | Fintech & Healthcare | Python · SQL · Snowflake · dbt · Airflow · AWS · Terraform · FHIR

8+ years delivering data systems across regulated environments — production pipelines, cloud-native architecture, and compliance-grade testing — in fintech, capital markets, and healthcare.


Projects

Proprietary AI builders generate a +92.0% Sharpe ratio premium over third-party integrators (Spearman ρ = +0.800, p ≈ 0.005) across 10 major tech stocks — visualized in an interactive Power BI dashboard.

Pipelines 4 production Airflow DAGs — stocks, SEC EDGAR 10-K, FRED macro, analysis
Storage Hive-partitioned S3 data lake · Parquet/Snappy · Glue catalog · serverless Athena
Quality 184 pytest unit tests · moto AWS mocking · GitHub Actions CI/CD
IaC End-to-end Terraform

Classifies 257K denied claims by root cause — systematic denials vs. documentation failures — and the remediation path differs fundamentally for each.

Stack Synthea FHIR R4 · Python · Snowflake (RAW → staging → mart) · dbt · Dagster
Scale 495K total claims · 51.9% denial rate · 12 dbt models · 83 automated tests
RWE T2D/CKD cohort · 104 patients · 54.8% metformin utilization

Stack

Python SQL Snowflake dbt Apache Airflow Dagster AWS Docker Terraform Power BI GitHub Actions PostgreSQL pandas pytest


Connect

LinkedIn   Sharpe Premium Pipeline   Healthcare Claims Pipeline

Pinned Loading

  1. sharpe-premium-pipeline sharpe-premium-pipeline Public

    $650B in AI spend, 10 large-cap stocks, 3 years of data. Proprietary AI builders outperform third-party integrators by 92% on risk-adjusted returns (Spearman ρ=+0.800, p≈0.005). Airflow → S3 → Athe…

    Python 1

  2. healthcare-claims-pipeline healthcare-claims-pipeline Public

    HL7 FHIR R4 → OMOP CDM → Snowflake → dbt → Dagster. RCM: classifies 257K denied claims by root cause — systematic vs. documentation failures. RWE: T2D+CKD metformin utilization cohort. 12 dbt model…

    Python 1