Headshot of Jeffrey Brian Thompson

Building reliable data pipelines and turning complex data into clear, trusted insights.

About Me

I’m a data engineering and analytics consultant with a strong mathematical background. I enjoy working at the intersection of data engineering, analytics, and statistics, with a focus on building systems people can trust and understand.

I hold a Master of Advanced Study in Pure Mathematics (Part III) from the University of Cambridge and a First-Class BSc in Mathematics from the University of Surrey. This background gives me a rigorous approach to problem-solving and a strong intuition for data, structure, and modelling.

Alongside technical delivery, I place a strong emphasis on communication: explaining complex ideas clearly, supporting stakeholders, and producing work that is easy for others to maintain and build upon.

  • Python
  • SQL
  • Snowflake
  • dbt
  • Tableau
  • Power BI
  • AWS
  • Documentation

Skills & Tools

Programming & Data

  • Python, SQL
  • Data modelling and transformation
  • Statistical analysis and data exploration

Data Engineering

  • Automated data pipelines
  • Bronze–Silver–Gold / medallion architectures
  • Change Data Capture (CDC)

Platforms & Analytics

  • Snowflake, dbt
  • Tableau, Power BI, Alteryx

Cloud & Tooling

  • AWS (S3, EC2), Azure
  • GitHub

Selected Projects

AI-based vehicle tracking, counting, and lane estimation

Deep Neural Network (from scratch)

Python • Maths-first implementation

A Python implementation of a deep neural network with a focus on understanding the mathematics behind forward propagation, backpropagation, and optimisation.

  • Emphasis on clarity and learning rather than black-box usage
  • Built to deepen understanding of neural networks from first principles

AI-based Vehicle Tracking & Counting

Computer vision • YOLO • Analytics

An end-to-end AI vehicle detection and tracking pipeline using YOLO, with motion analysis, border-based counting, unsupervised lane estimation, and analytics-ready outputs.

  • Persistent IDs, velocity vectors, and border-based events
  • Unsupervised lane estimation via clustering
  • Dimnensional model designed for downstream analysis

SQL & Data Preparation Work

SQL • Data prep • Querying

A collection of repositories and exercises focused on data preparation, querying, and structuring data for analysis and reporting.

Data Engineering

A small ecosystem of projects that fit together as a pipeline: API extraction (Bronze)modelling in dbt (Silver/Gold). The extraction repos produce raw JSON data designed to feed the dbt analytics layer.

How these repos connect

Bronze → Silver → Gold

Tip: when viewing the repos, start with the dbt project for the “final shape”, then follow links upstream to the extraction pipelines.

dbt Cloud Analytics

Transformations • Modelling • Documentation

A dbt Cloud project snapshot modelling Amplitude and Mailchimp data into trusted, analytics-ready models using layered patterns, documentation, lineage, and testability.

Amplitude Extraction

API ingestion • Parsing • S3

Extracts raw, event-level product analytics data from the Amplitude Export API, normalises raw files, and uploads to S3 for downstream modelling.

Mailchimp Extraction

Campaigns • Engagement • S3

Extracts campaign metadata and email engagement activity from the Mailchimp Marketing API, writes JSON, uploads to S3, and logs retries/errors.

TfL BikePoint Ingestion

API → JSON → S3

A lightweight ingestion example demonstrating extraction, cloud loading, and a repeatable pattern suitable for scheduling — designed to feed downstream analytics modelling.

Tutorials and follow-alongs

Short, practical guides you can fork and run locally. These are written as “follow-along” repos to help you get set up quickly and learn by doing.

Getting started with GitHub and Python

Git • VSCode • Windows

A quick setup guide for getting started with Python using GitHub in VSCode, fully self contained, including configuring your first project with a virtual environment and Git repository.

Gemini API in Python

Python • VS Code • .env

A walkthrough for setting up a Python environment in VS Code and making Gemini API calls, including one-shot prompts, chat, and streaming output.

Draft Portfolio Website

HTML/CSS • GitHub Pages

A clean, minimal personal website template designed to be easy to customise and deploy. Includes a fast workflow for filling content using AI tools.

LaTeX in VS Code

Windows • LaTeX workflow

A quick setup guide for writing LaTeX in VS Code on Windows, with a repo structure that keeps PDFs tracked and auxiliary files ignored. Includes a small Sympy example to support maths workflows.

Tables in MATLAB

MATLAB • Data structures

A quick intro to creating, storing, reading, and manipulating MATLAB tables (including mixed data types), with console output to support learning.

Writing & Knowledge Sharing

Data & Analytics Writing

Blog posts • Practical insight

I regularly write about data topics with a focus on accessibility and practical understanding. Topics include:

  • Data preparation and modelling
  • Granularity, aggregation, and joins
  • Storytelling with data
  • BI best practices

Mathematical Writing

Pure mathematics • Part III

A self-contained mathematical essay proving the Hodge Decomposition Theorem, with a geometry-first extension to manifolds with boundary.

Tutoring & Educational Materials

A-level mathematics

Supplementary A-level mathematics notes developed through tutoring, focusing on method selection, reasoning, and clear worked solutions.

Research & Academic Work

Graph theory • arXiv

Co-author of a graph theory research paper exploring structural properties of graphs: Graphs with equal girth and circumference.

Experience Highlights

Data Engineering & Analytics Consulting

Client-facing delivery • Pipelines • BI
  • Delivering automated data pipelines for analytics and BI.
  • Client-facing analytics projects across multiple tools and domains.
  • Experience leading and coordinating project work.
  • Strong documentation and handover practices.

Teaching & Communication

Tutoring • Stakeholders • Documentation
  • Mathematics tutoring (A-level).
  • Explaining technical concepts to non-technical audiences.
  • Producing educational content and internal documentation.

Certifications

Tableau • Alteryx
  • Tableau Certified Data Analyst
  • Alteryx Designer Core Certification

Music & Outside the Office

Music

Producer • Published work

Beyond data and code, I’m a music producer with published work.

Buy me a coffee

If you’ve found my work helpful, you can support me here—thank you!

☕ Buy me a coffee

Your support helps me dedicate time to open-source projects, writing, and learning.

Contact

Reach me via: LinkedInGitHubBlog