From lakehouse design and orchestrated pipelines to LLM-powered agents and RAG — built end-to-end by someone who also does the data science behind them.
- Data Platforms — cloud-native lakehouses (Snowflake, BigQuery, DBT, Airflow/Prefect), end-to-end from ingestion to serving. Production scale: 13TB+, 700+ pipelines, 60+ sources.
- AI / LLM Systems — RAG pipelines, agent-based automation, LLM-powered document processing. Recent: HR/payroll resolution agents, workforce planning systems.
- ML & Fraud Detection — Isolation Forest, AWS Fraud Detector, custom statistical models over high-volume transaction data (1B+ monthly transactions).
- Data Engineering — ETL/ELT at scale, CDC pipelines, Kubernetes-based distributed processing, API integrations across SaaS ecosystems.
Data: Snowflake · DBT · Airflow · Prefect · Kafka · Spark · Airbyte
AI/LLM: LangChain · ChromaDB · RAG · Agents · OpenAI · Claude · Ollama
Cloud: AWS (EMR, ECS, ECR, Lambda, SQS, RDS) · GCP
Languages: Python · SQL · Scala · Bash
Infra: Docker · Kubernetes · Terraform · CI/CD
Most of my production work lives in private repositories under NDA — the public repos here are personal experiments, tooling, and academic work.
Current public projects include a structured data extraction tool for Argentine bank statements (simple_santander_rio_parser) and ML research from my Master's in Data Science at ITBA (gravitational_waves_classifiers).
25+ years building data and software systems across fintech, agro, retail, and pharma.
MSc Data Science · MBA · BSc Computer Science.
Previously: Worky · Cobre · Microsoft · Deloitte · Cognizant.

