I haven't read the whole thing but I really enjoyed the introduction and agree c...

proto-n · on Dec 2, 2022

Can you recommend some resources to achieve (b)? Just basic stuff without going full guru on every little detail.

Or do you just mean using indexes basically? I'm just not sure how many unknown unknowns I have wrt sql.

gen220 · on Dec 3, 2022

The gist is:

1. Trace your application, so you know which query is running slowly in production.

2. Using this exact query, run `EXPLAIN ANALYZE` or whatever your RDBMS equivalent is.

3. Read the output, see which step is taking the slowest. Use google to help.

4. Google-fu until you find out which index might help you.

Over time, (3) and (4) requires less and less Google, because there's really only a few common cases that give you 100% of the speedup in 80% of cases.

Once you know this path to improvement exists, it's trivial to progress down it habitually. The `EXPLAIN ANALYZE` output looks like Greek at first, but quickly becomes as familiar to parse as a compiler error, etc.

zasdffaa · on Dec 3, 2022

DB 'expert' here - agree completely. It's about profiling then banging your head against it and learning more from books/the docs/web. That's really it.

Pamar · on Dec 3, 2022

(I am Not the OP) Not exactly from 0, maybe, but I strongly suggest this book: https://www.oreilly.com/library/view/the-art-of/0596008945/

RheingoldRiver · on Dec 3, 2022

If you're really starting out from square 1, I might read this book first: https://www.amazon.com/Database-Design-Mere-Mortals-Annivers... (although I read an earlier edition). The Art of SQL is a great book, but I wish I'd read something a bit...easier...first, lol. I struggled with it.

Pamar · on Dec 3, 2022

Yeah well, I am old school (or at least old) so I had already a good grasp of SQL, but I think the book is good because it explains overarching concepts very well.

wswope · on Dec 2, 2022

The details depend heavily on what specific database you’re working with, but in broad strokes, you’ll want to know: indices, query plans, when and where to cache expensive computations, every f—- word of your DB manual’s chapter on concurrency control, and the fastest method(s) for yeeting data in and out from your language/toolkit of choice.

rudasn · on Dec 2, 2022

The last few years I've been working on a fairly big django project written by someone learning the problem domain and python/django at the same time.

I have a lot of similar stories and I you are absolutely right to encourage people to learn the basics, at least (looking at you, indexes and perf tools).

We're talking about having our cpu hitting 80-90% and rising internal temperature to 70-80C, while executing a task thats normally performed tens or hundreds of times during the workday.

A couple of carefully planned indexes, a bit of sql shuffling, is all it takes sometimes.

The upside of working with cheap bare metal servers is that you catch these things early on (5 concurrent users and your server is toasted). Fun times.