Back to Ideas 1 min read

Stage, Then Transform: Why Your Crawler Should Never Touch Production Tables

E
EkoHacks

Stage, Then Transform: Why Your Crawler Should Never Touch Production Tables

When you pull data from an external source, a public register, a CSV feed, a scraped site, there is one decision that quietly determines whether the pipeline will still be working in six months. It is not “which library”, “which schedule”, or “which database”. It is: where does the data land first?

Most pipelines answer that question badly. They crawl a source, massage the rows in-memory, and write straight into the production table the app already reads from. It works on day one. It breaks on day ninety.

This post is the pattern I wish I had internalised before I wrote my first crawler, explained with the pipeline we are building right now at Propi: matching first-time buyers to UK conveyancers using public SRA and Law Society data.

The shortcut that bites

E

Written by

EkoHacks

More from Ideas

·6 min read

The Website That Told the Same Story Twice

Our home and story pages felt like one story stretched across two screens. Rather than argue about it by taste, we modelled our own words as data, asked a small database one question, and let it show us exactly what we were repeating.

E
EkoHacks
The Workflow Is the Lesson
·4 min read

The Workflow Is the Lesson

We are moving our content out of a cloud CMS and into git. Here is why a school that teaches a keyboard first, code first way of working should practise it in the open, and the path we are taking to get there.

E
EkoHacks
Dockerising a NestJS and Nuxt Monorepo with pnpm
·6 min read

Dockerising a NestJS and Nuxt Monorepo with pnpm

A practical guide to Dockerising a NestJS and Nuxt monorepo using pnpm, with separate multi stage Dockerfiles for development and production and a single Docker Compose setup for consistent, reproducible environments.

O
Ogochukwu Okpala
EkoHacks

EkoHacks is a coding institute teaching employable software engineering through real projects. We build civic technology and train developers, born in Nigeria, launching in rural Greece. Global tech, local impact.

Stay Tuned

Be the first to hear about new dojos, insights, and opportunities from EkoHacks.