I just trained a physics-based earthquake forecasting model on a $1000 GPU

13 points by ArchitectAI 20 hours ago

So I've been working on this seismic intelligence system (GSIN) and I think I accidentally made data centers kind of obsolete for this type of work. Let me explain what happened. The Problem: Earthquake forecasting sucks. The standard models are all statistical bullshit from the 80s. They don't understand physics, they just pattern match on historical data. And the few ML attempts that exist? They need massive compute clusters or AWS bills that would bankrupt a small country. I'm talking researchers spending $50k on cloud GPUs to train models that still don't work that well. Universities need approval from like 5 committees to get cluster time. It's gatekept as hell. What I Built: I took 728,442 seismic events from USGS and built a 3D neural network that actually understands how stress propagates through rock. Not just pattern matching - it learns the actual physics of how earthquakes trigger other earthquakes. The architecture is a 3D U-Net that takes earthquake sequences and outputs probability grids showing where aftershocks are likely. It's trained on real data spanning decades of global seismic activity. Here's the crazy part: The entire training pipeline runs on a single RTX 5080. $1000 GPU. Not a cluster. Not AWS. Just one consumer card.

Pre-loads all 15GB of training data into RAM at startup Zero disk reads during training (that's the bottleneck everyone hits) Uses only 0.2GB of VRAM somehow Trains 40 epochs in under 3 hours Best validation Brier score: 0.0175

For context, traditional seismic models get Brier scores around 0.05-0.15. Lower is better.

codemusings 5 hours ago

Right. Because compute power and/or a physics based model is the limiting factor for accurately predicting when a seismic event happens. Training on historic data is hardly the problem that need's solving.

It's the leading indicators that are actually measurable that are missing. You know the ones that allow for evacuations and other protective measures.

qmarchi 17 hours ago

Given the swath of sensors that Japan has, and the long history of a lot of them. I do wonder what the result of training off their datasets would be.

Grosvenor 19 hours ago

This is interesting. Can you share the model/github?

Woberto 17 hours ago

So when/where is the next big one coming?

highd 14 hours ago

How are you doing your train/test split?