Problem

Demonstrate an end-to-end, reproducible workflow for market data analysis that cleanly separates modeling from interpretation.

Approach

This project implements a reproducible time-series analytics pipeline for commodity price data. The workflow ingests public market data, stores it in a local SQLite database, engineers time-series features, trains a simple baseline model, and evaluates performance on a held-out test set.

A local LLM (via Ollama) is used strictly for interpretation and reporting, generating an analyst-style note based on model outputs and metrics — not for prediction.

Impact

Shows practical data science skills: structured pipelines, baseline modeling, transparent evaluation, and analyst-style reporting with LLM assistance.

Tools and Methods

  • Python
  • Time-series analysis
  • SQLite
  • Snakemake
  • LLMs (Ollama)