Available for Work

George
Cubas

Machine Learning Engineer

Machine Learning Engineer with 10+ years of combined experience in tech and energy. I build scalable AI solutions that drive revenue — from oil & gas economics to fintech analytics and real estate valuation models.

United States|Python / ML / Energy
0+Years Experience
0+Years ML/AI
0M+Investment Decisions
0+ML Models Shipped
Tech Stack

Skills & Expertise

Deep specialization in Python/ML with strong engineering fundamentals spanning backend systems, cloud infrastructure, and energy domain expertise.

ML & Data Science

Python96%
PyTorch & Scikit-learn92%
Polars & Pandas94%
Causal Inference & Stats88%

Backend & Cloud

FastAPI & Django93%
Apache Spark & Databricks90%
AWS & Docker88%
PostgreSQL & Weaviate91%

Engineering & Domain

Energy Economics92%
ETL & Data Pipelines94%
REST APIs & Microservices91%
Git & CI/CD89%
PythonPyTorchFastAPIPolarsSparkDatabricksAWSWeaviatePostgreSQLDjangoDockerScikit-learnNumPySQLAlchemyRustR
Approach

Work Methodology

How I approach building ML systems that deliver measurable business impact.

01

Discovery & Analysis

Understanding business objectives and data landscapes before writing a single line of code.

02

Scalable Architecture

Building modular, maintainable ML pipelines and APIs designed to handle production traffic.

03

Rapid Iteration

Fast experiment cycles with clear metrics, moving from prototype to production efficiently.

04

Continuous Optimization

Performance tuning, model retraining, and infrastructure improvements as standard practice.

Career Path

Work Experience

A decade of building ML solutions across energy, fintech, real estate, and enterprise — each role compounding domain expertise with engineering depth.

Python Developer

Sterling Lariat Ventures
Mar 2025 - Jun 202515% Revenue Increase
  • Deployed production-grade FastAPI microservice for Stripe payment processing
  • Migrated invoice generation from Pandas to Polars for parallelized data pipelines
  • Built Polars-powered reporting engine for structured monthly financial summaries
  • Implemented async webhook handlers with PostgreSQL via SQLAlchemy ORM
FastAPIPolarsStripePostgreSQLDocker

Python Developer

EOG Resources
Mar 2024 - Sep 2024$50M Investment Decisions
  • Developed Oil & Gas economic models for cost recovery and profit split analysis
  • Created DDA functions enhancing cash flow models with Polars parallelization
  • Implemented NPVI and Profitability Index calculations for accurate cash flow analysis
  • Designed FastAPI endpoints for advanced scenario sensitivity analysis
PolarsFastAPIEnergy EconomicsPython

Python Developer

Compbuss
Aug 2023 - PresentRAG Systems in Production
  • Built RAG system using Dolphin LLaMA with Dspy and Weaviate vector DB
  • Designed AI services including image classifiers using natural evolution methods
  • Maintained CI/CD pipelines with test-driven development practices
RAGWeaviateDspyFastAPILLM

Python Developer

University of Virginia
Mar 2022 - Dec 202240% Faster Grant Reporting
  • Engineered ETL pipelines for Office of Sponsored Programs using Pandas & SQLAlchemy
  • Built web crawlers for frontend data quality verification
  • Leveraged multiprocessing and asyncio for parallelized performance
PandasSQLAlchemyETLAsyncIO

Python Developer

Kinstone Investment Properties
Jul 2019 - Mar 202212% Portfolio ROI
  • Built ML valuation models using Pandas, NumPy and Scikit-learn
  • Scraped Freddie Mac & Case Shiller data with BeautifulSoup & Selenium
  • Applied Monte Carlo Analysis for Levered and Unlevered IRR prediction
Scikit-learnMonte CarloSeleniumNumPy

Python Engineer

Occidental Petroleum
Jan 2017 - Jul 2019Drilling Risk Analysis
  • Built Anti-Collision Risk Analysis app using Django Framework
  • Created neural network-based document extraction pipeline
  • Developed ML models for production drawdown using Scikit-learn
DjangoScikit-learnArcGISPython

Drilling Engineer

BP Alaska
2013 - 2016Field Operations
  • Managed drilling operations on the North Slope of Alaska
  • Foundation in energy sector operations and engineering economics
DrillingOperationsEnergy
Selected Work

Projects

Open-source projects showcasing ML engineering, distributed systems, and applied AI research.

RAG Pipeline

Generative AI with Weaviate & Dspy

A production RAG framework integrating multiple data sources with Weaviate vector DB and Dspy for enriching LLM knowledge bases and enhancing contextual accuracy.

Impact: Multi-source retrieval with context-aware generation

PythonWeaviateDspyOllamaFastAPI

Apache Spark Prediction Pipeline

IoT Data & Pressure Prediction

End-to-end Spark pipeline using IoT sensor data for real-time pressure prediction with distributed computing for high-throughput data processing.

Impact: Real-time prediction at scale with IoT data

Apache SparkPythonIoTDatabricks

Reinforcement Learning NN

Actor-Critic for CartPole

An Actor-Critic reinforcement learning algorithm that uses an actor network for optimal policy discovery and a critic network for action probability evaluation to solve the CartPole problem.

Impact: Convergent policy with optimized reward function

PythonPyTorchOpenAI GymRL
Open for Opportunities

Let's build something impactful together

Full-Time / Contract / Consulting — available for ML engineering, data pipeline architecture, and AI product development.