Skip to content
View Elly-su's full-sized avatar
:octocat:
building
:octocat:
building
  • Nairobi
  • 10:44 (UTC +03:00)

Block or report Elly-su

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Elly-su/README.md

Elly Ochieng' Kimbero

Big Data Engineer | Architecting Scalable, Reliable Data Platforms


Core Competencies

Languages & Querying

Core scripting, systems programming, and advanced querying for data engineering.


Data Processing & Streaming

Building robust batch and real-time pipelines at scale.


Cloud & Data Platforms

Multi-cloud architecture and modern warehousing solutions.


APIs, Orchestration & Infrastructure


Engineering Practices

  • Orchestration and Workflow: Apache Airflow, Prefect, Dagster
  • Data Transformation: dbt, SQL
  • Infrastructure and Deployment: Docker, Kubernetes, Terraform
  • CI/CD: GitHub Actions, Azure DevOps, Jenkins
  • Version Control: Advanced Git workflows
  • Monitoring and Reliability: Structured logging, validation, alerting
  • Documentation: Pipeline lineage, runbooks, data dictionaries

Featured Project

AI-Powered ETL Data Engineering Pipeline

Designed and architected a modular ETL pipeline using Airflow, Docker, Python, and PostgreSQL with an interactive Streamlit interface.

Impact:

  • Automated ingestion, transformation, and validation workflows
  • Integrated LLM-powered metadata enrichment and reporting
  • Containerized services for reproducibility and deployment readiness

Key Engineering Focus:

  • DAG-based orchestration
  • Idempotent transformations
  • Structured logging and pipeline observability
  • Production-grade project structure

Collaboration

Open to building scalable data systems and discussing distributed systems, cloud architecture, and systems programming.

Pinned Loading

  1. Login Login Public

    Secure login/registration system with PHP, MySQL, bcrypt hashing, and SQL injection protection.

    PHP 5

  2. traffic-congestion-prediction traffic-congestion-prediction Public

    ML-powered urban traffic prediction system achieving 82% accuracy. Built with Python and scikit-learn using real traffic, weather, and event data. Includes complete data science pipeline and visual…

    Python 4

  3. tiktok-reach-analysis tiktok-reach-analysis Public

    Analyze TikTok reach patterns using Python, ML & statistics. Predicts video performance with 70%+ accuracy. 📊

    Jupyter Notebook 1