Skip to content
View Sam-24-dev's full-sized avatar

Highlights

  • Pro

Block or report Sam-24-dev

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Sam-24-dev/README.md

👋 About Me

I don't just analyze data. I build the systems that make analysis possible.

Junior Data Engineer & Analyst | 7th Semester ESPOL, Ecuador

I am an engineer focused on the complete data lifecycle: from building robust architectures (ETL/SQL) to analyzing trends and deploying Machine Learning models.

Unlike a traditional analyst, my technical background allows me to not only visualize data but also build and optimize the systems behind it. My goal is to transform complex raw data into clear, actionable strategies that drive business growth.

💼 What I Bring to the Table:

  • Data Engineering: Automating ETL pipelines and optimizing database queries (40% performance boost).
  • Machine Learning: Building predictive models for dynamic pricing and real-world scenarios.
  • Business Intelligence: Identifying financial gaps ($16k+) and visualizing KPIs for decision-making.

🔭 Currently

🔨 Building Technology Trend Analysis Platform — End-to-end multi-source ETL pipeline tracking developer trends across GitHub, StackOverflow, and Reddit. Features Pandera quality gates, a DuckDB analytics engine, and fully automated CI/CD workflows (133 passing tests) powering a cross-platform Flutter dashboard.
📚 Learning Cloud (AWS/GCP) & dbt
👀 Open to Junior Data Engineer / Data Analyst roles
📍 Based in Guayaquil, Ecuador

🏆 Certifications & Awards

🎖️ Certification / Award 🏢 Issuer 📅 🔗
🌍 Galactic Problem Solver — Global Nominee NASA Space Apps Challenge Oct 2025 📄 Certificate
📊 PL-300: Power BI Data Analyst (In Progress) Microsoft 2026 🔄
📗 MO-210: Excel Associate (In Progress) Microsoft 2026 🔄

🚀 Featured Project — Highlight

End-to-End Data Engineering & Machine Learning Project

Simulating price optimization for ride-hailing apps using a data architecture with 1.2 Million records.

  • 🔧 ETL Architecture: Engineered an automated Python pipeline to ingest 1.2M+ raw records, using complex SQL JOINs to clean and consolidate a final dataset of ~600k verified trips in SQLite.
  • 🤖 Machine Learning: Trained a Random Forest Regressor to predict dynamic pricing (Baseline RMSE: $9.00).
  • 📊 Key Insight: Feature importance analysis revealed distance (>0.6) and surge_multiplier as the absolute dominant factors, proving granular weather data added unnecessary noise.
  • Tech Stack: Python, SQL, Pandas, Scikit-Learn, Plotly.

📁 Other Key Projects

Award: Galactic Problem Solver (Global Nominee)

  • Innovation: Built a full-stack web app analyzing 10 years of NASA satellite data across 195+ countries with <2s response time on interactive maps.
  • Impact: Developed MVP in a 48-hour hackathon, integrating real-time APIs to predict global extreme weather probabilities.
  • Tech: Python (Flask), React, TypeScript, Leaflet, Plotly.
 

End-to-end Data Engineering for Agriculture

  • Result: Engineered a Python ETL pipeline (covered by 14 unit tests) that modeled a strategic turnaround, projecting an ROI improvement from -5.58% to +15% (+20.6 pts) and a +75% boost in productivity.
  • Architecture: Built a robust MySQL -> Python -> JSON pipeline feeding a 5-page interactive dashboard for operational tracking.
  • Tech: MySQL, Python, Pandas, Pytest, JS/Bootstrap.
 

SQL Database Design & Query Optimization

  • Achievement: Optimized a 3NF MySQL database with composite indexes (idx_competencias_tipo_compid), reducing execution time by 40% for complex multi-table queries.
  • Scope: Processed historical performance data for 15 teams across 8 LATAM countries managing a $325,000 total prize pool.
  • Tech: MySQL 8.0, Advanced SQL (CTEs, Window Functions), Vanilla JS, Chart.js.
 

Business Intelligence

  • Insight: Analyzed sales distribution across 23 active sellers ($28.4K avg), uncovering a critical $16.66K performance gap between top and bottom performers.
  • Impact: Identified "Meat" as the top revenue driver ($80.05K) and Tulsa as the premier market (20 top clients), delivering actionable KPIs for data-driven decisions.
  • Tech: Power BI, DAX, Excel.

Scientific Research & Data Modeling

  • Validation: Built an automated R pipeline to validate a Negative Binomial Distribution model (k=3, p=0.3) on 309 observations, achieving a statistically significant p-value of 0.660.
  • Impact: Tracked a mean serve time of 1.945s (<2s threshold) and exported JSON/PNG assets into a dynamic JS web dashboard.
  • Tech: R (Tidyverse, ggplot2), HTML/CSS/JS.
 

🛠️ Technical Stack

Category Technologies
🔧 Data Engineering & Analysis Python SQL DuckDB Pandas Pandera
🤖 Machine Learning Scikit-Learn R
📊 Visualization & BI Power BI Tableau Plotly Excel
🌐 Web & App React Flask Flutter TypeScript
☁️ Cloud & DevOps GitHub Actions SQLite Vercel Git

📊 GitHub Stats


📈 Contribution Trend


⏱️ Weekly Coding Activity

Real-time stats powered by WakaTime — tracking every line of code I write.

**I'm a Night 🦉**
🌞 Morning                0 commits           ░░░░░░░░░░░░░░░░░░░░░░░░░   00.00 % 
🌆 Daytime                255 commits         █████████░░░░░░░░░░░░░░░░   35.56 % 
🌃 Evening                369 commits         █████████████░░░░░░░░░░░░   51.46 % 
🌙 Night                  93 commits          ███░░░░░░░░░░░░░░░░░░░░░░   12.97 % 

📅 I'm Most Productive on Saturday

Monday                   72 commits          ███░░░░░░░░░░░░░░░░░░░░░░   10.04 % 
Tuesday                  89 commits          ███░░░░░░░░░░░░░░░░░░░░░░   12.41 % 
Wednesday                140 commits         █████░░░░░░░░░░░░░░░░░░░░   19.53 % 
Thursday                 120 commits         ████░░░░░░░░░░░░░░░░░░░░░   16.74 % 
Friday                   28 commits          █░░░░░░░░░░░░░░░░░░░░░░░░   03.91 % 
Saturday                 186 commits         ██████░░░░░░░░░░░░░░░░░░░   25.94 % 
Sunday                   82 commits          ███░░░░░░░░░░░░░░░░░░░░░░   11.44 % 

📊 This Week I Spent My Time On

💬 Programming Languages: 
Markdown                 57 mins             ██████████████████░░░░░░░   71.85 % 
Dart                     18 mins             ██████░░░░░░░░░░░░░░░░░░░   22.84 % 
Python                   4 mins              █░░░░░░░░░░░░░░░░░░░░░░░░   05.31 % 

🐱‍💻 Projects: 
Technology-trend-analysis1 hr 20 mins        █████████████████████████   100.00 % 

Last Updated on 05/03/2026 01:04:09 UTC


🤝 Open to Opportunities

I'm a 7th-semester Computer Engineering student at ESPOL actively looking for Junior Data Engineer or Data Analyst roles where I can contribute from day one.

Pinned Loading

  1. Technology-trend-analysis-platform Technology-trend-analysis-platform Public

    Data intelligence platform for technology trends across GitHub, StackOverflow, and Reddit using Python ETL, Pandera quality gates, DuckDB trend engine, and Flutter Web.

    Python

  2. Analisis-Ping-Pong Analisis-Ping-Pong Public

    Automated statistical analysis pipeline using R to model ping pong serve precision with Negative Binomial distribution (309 observations). Includes interactive web dashboard.

    HTML 1

  3. Analisis-Cultivo-Arroz Analisis-Cultivo-Arroz Public

    End-to-end data engineering platform for agricultural analytics. ETL pipeline (Python) + Interactive dashboard (Chart.js) with KPIs, financial analysis, and strategic insights.

    HTML

  4. easyparker-pwa easyparker-pwa Public

    EasyParker es una PWA para reservar parqueo en Guayaquil | Modos: Conductor y Anfitrión | Chat tiempo real | Eventos con surge pricing | Calificaciones etc| React + TypeScript + Tailwind

    TypeScript

  5. eSports-Analytics-Dashboard eSports-Analytics-Dashboard Public

    JavaScript

  6. RideFare-ETL-Pipeline RideFare-ETL-Pipeline Public

    Jupyter Notebook