Discern or define
Door Gert-Jan van de Streek / feb 2012 / 1 Min
Door Gert-Jan van de Streek / / 1 min
The classic way that we use databases is oriented towards the amount of available storage that we have for data. It's time to rethink that pattern, now the cost of storage sinks so fast that it must converge to free or nearly free at some point.
Consider the following record in a database:
residence | Oldebroek |
The traditional pattern for updating this record is to replace the value in place:
residence |
Why have we always done this? Well, in fact we haven't. When storage was still simply a piece of paper, we simply slipped in a new piece of paper in the folder and that would be the new valid file, plus as a bonus, you had a history over time.
Now that the cost of storage is not a problem anymore, we should go back that paradigm. The above example would translate to:
residence | point in time 1 | Oldebroek |
residence | point in time 2 | Mastenbroek |
Note that we added a perception of time to the record to be able to figure out what the prevailing version of the record is. Think a bit what this means for your application. You simply store a new fact at a point in time without erasing the past.
You can implement this paradigm into your existing database by adding it to your design, or you can use database technology that does this by design. You might find Sesame or Apache Jena interesting. Alternatively you could look into Datomic.
Interested in Datomic?
|
Door Gert-Jan van de Streek / okt 2024
Dan denken we dat dit ook wat voor jou is.