Skip to content
View timothynn's full-sized avatar
🤓
Locked In
🤓
Locked In

Block or report timothynn

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
timothynn/README.md
╔══════════════════════════════════════════════════════════════════════════════╗
║                                                                              ║
║                ████████╗██╗███╗   ███╗ ██████╗ ████████╗██╗  ██╗██╗   ██╗    ║
║                ╚══██╔══╝██║████╗ ████║██╔═══██╗╚══██╔══╝██║  ██║╚██╗ ██╔╝    ║
║                   ██║   ██║██╔████╔██║██║   ██║   ██║   ███████║ ╚████╔╝     ║
║                   ██║   ██║██║╚██╔╝██║██║   ██║   ██║   ██╔══██║  ╚██╔╝      ║
║                   ██║   ██║██║ ╚═╝ ██║╚██████╔╝   ██║   ██║  ██║   ██║       ║
║                   ╚═╝   ╚═╝╚═╝     ╚═╝ ╚═════╝    ╚═╝   ╚═╝  ╚═╝   ╚═╝       ║
║                                                                              ║
║                           Timothy Nduati · @timothynn                        ║
║                                                                              ║
╚══════════════════════════════════════════════════════════════════════════════╝

CS Grad • Azure Certified Data Scientist • Software Engineer → Data Engineer

Building the bridge between code and insights


🎯 The Journey

From writing algorithms to architecting data flows. From debugging code to debugging insights.

I'm on a mission to transform how organizations leverage their data—combining software engineering rigor with data science intuition to build systems that don't just work, but scale.

Current Chapter: Transitioning from software engineering into data engineering, bringing battle-tested engineering practices to the data world.


🧰 Arsenal

🔬 Data Science

skills = {
    'ml_frameworks': ['scikit-learn', 'tensorflow', 'pytorch'],
    'cloud_ml': ['Azure ML Studio', 'Azure Cognitive Services'],
    'analysis': ['pandas', 'numpy', 'scipy'],
    'visualization': ['matplotlib', 'seaborn', 'plotly'],
    'bi_tools': ['Power BI', 'Tableau']
}

Certified: Microsoft Azure Data Scientist Associate

⚙️ Data Engineering

toolkit = {
    'orchestration': ['Airflow', 'Azure Data Factory'],
    'processing': ['Spark', 'Kafka', 'dbt'],
    'storage': ['PostgreSQL', 'Azure SQL', 'Cosmos DB'],
    'cloud': ['Azure', 'AWS', 'GCP'],
    'streaming': ['Kafka', 'Azure Event Hubs']
}

Learning: Modern data stack, real-time pipelines

💻 Software Engineering

foundation = {
    'languages': ['Python', 'Java', 'C++', 'JavaScript'],
    'paradigms': ['OOP', 'Functional', 'Concurrent'],
    'practices': ['TDD', 'CI/CD', 'Code Review'],
    'tools': ['Git', 'Docker', 'Kubernetes']
}

Philosophy: Clean code, scalable systems

🛠️ DevOps & Infrastructure

automation = {
    'containers': ['Docker', 'Kubernetes'],
    'cicd': ['GitHub Actions', 'Azure DevOps'],
    'monitoring': ['Prometheus', 'Grafana'],
    'iac': ['Terraform', 'ARM Templates']
}

Focus: MLOps, DataOps, automation


💭 Philosophy

Code is temporary. Data systems are forever.

Good data engineering isn't just about moving data from A to B—it's about building resilient, self-healing systems that handle chaos gracefully. It's software engineering principles applied to the messiest resource we have: real-world data.

My Principles:

  • 🎯 Quality over speed → But never sacrifice both
  • 🔄 Automate everything → Including the automation
  • 📊 Data tells stories → Make sure they're true
  • 🧪 Test in production → Just kidding. Test before production.
  • 🚀 Ship iteratively → Perfect is the enemy of shipped

🎨 Currently Crafting

  • 🏗️ Building a real-time data pipeline for streaming analytics
  • 📚 Deep diving into distributed systems and data modeling
  • 🤖 Implementing MLOps practices for production ML systems
  • 🌱 Contributing to open-source data tools
  • 📝 Writing about lessons learned in data engineering

📊 By The Numbers

Profile Views


🤝 Let's Connect

I'm always interested in discussing data architecture, ML systems, or how to make data pipelines that don't keep you up at night.

LinkedIn Twitter Mastodon Matrix Email


🎓 Learning Path

graph LR
    A[Software Engineering] --> B[Data Science]
    B --> C[Machine Learning]
    C --> D[Data Engineering]
    D --> E[Distributed Systems]
    
    style A fill:#0D1117,stroke:#58A6FF,stroke-width:2px
    style B fill:#0D1117,stroke:#58A6FF,stroke-width:2px
    style C fill:#0D1117,stroke:#58A6FF,stroke-width:3px
    style D fill:#0D1117,stroke:#F97316,stroke-width:3px
    style E fill:#0D1117,stroke:#F97316,stroke-width:2px
Loading

Current Focus: Data Engineering & Distributed Systems
Next Up: Stream Processing & Real-time Analytics


💡 Featured Projects

Project Description Tech Stack
🐧 Palmer Penguins Clustering ML clustering analysis on penguin species data Python, Scikit-learn, Pandas
🔄 Data Pipeline Framework ETL framework for automated data ingestion Python, Airflow, Azure
📊 Real-time Analytics Dashboard Live data visualization platform Python, Kafka, Power BI

More projects coming soon...


"The best time to start building data systems was yesterday. The second best time is now."

⬆ back to top

Pinned Loading

  1. customer-churn-prediction-system customer-churn-prediction-system Public

    Predict which customers are likely to close their accounts and identify retention strategies.

    Python

  2. mini-data-warehouse mini-data-warehouse Public

    First mini data warehouse project

    HTML

  3. trading-pipeline trading-pipeline Public

    High-performance data pipeline processing market data streams with Apache Kafka, Redis, and PostgreSQL. Features risk management, backtesting framework, and real-time analytics dashboard.

    Python

  4. credit-risk-prediction-model credit-risk-prediction-model Public

    🏦 A comprehensive machine learning solution for predicting loan default probability using customer financial data, built with Nix Flakes for reproducible development.

    Python