nate berkopec's latest activity

Emmanuel Cousin followd a person

over 1 year

nate berkopec @nateberkopec@mastodon.social

Author, The Complete Guide to Rails Performance. Co-maintainer of Puma. http://speedshop.co (he/him)

22 Following - 926 Followers

nate berkopec

22h ·
Public

·
mastodon.social

View Article

Here's a demonstration of how IO/CPU interact with the GVL to affect the throughput of your Puma or Sidekiq application. Give it a run (gem install parallel first) and see what happens! You can also try removing the GVL by making Parallel use processes instead of threads.

https://gist.github.com/nateberkopec/b57599281cab58f08e506514eb7b2e49

nate berkopec

2d ·
Public

·
mastodon.social

View Article

This is why your load test is a lie.

This is what real prod traffic looks like. 200 rps one second, 500 rps the next, ping ponging around from moment to moment. Uneven arrivals like this are so much harder to deal with than fake, synthetic load test requests.

nate berkopec

5d ·
Public

·
mastodon.social

View Article

Puma 6.5.0 is out!

https://github.com/puma/puma/releases/tag/v6.5.0

nate berkopec

5d ·
Public

·
mastodon.social

View Article

If you've got *1 million* concurrent users, saving $2 million/year in infra is hopefully not the most important thing for that business.

nate berkopec

6d ·
Public

·
mastodon.social

View Article

Had a little “lost my yubikey” scare. Now I’ve done what I should have done in the first place: made it hard to misplace and have two to begin with!

nate berkopec

6d ·
Public

·
mastodon.social

View Article

The easiest way to spout bullshit about performance is to talk in relative terms only (this is 3x faster than before!) without reference to the absolute.

Great, your new code is 3x faster. But it runs 3 million iter/sec and we only call it once.

nate berkopec

6d ·
Public

·
mastodon.social

View Article

Is there a compelling argument for _not_ always using YJIT locally/in development?

I think most people aren't.

nate berkopec

8d ·
Public

·
mastodon.social

View Article

Imagine that one of the DBs for your app suddenly had 100ms added to every call. You need to access this DB currently 1 to 30 times per transaction.

What would you do to compensate for this added latency?

nate berkopec

9d ·
Public

·
mastodon.social

View Article

I've written a ~500 line web application load simulator in Ruby. You give it the number of servers, processes, threads, p50 and p95 response times, # of db VCPU, and I/O wait %, and it Monte Carlo simulates your maximum possible req/sec.

Deploying as a tool for retainer clients soon.

nate berkopec

12d ·
Public

·
mastodon.social

View Article

Underrated/missed change from Dima Fatko to basecamp/marginalia:

https://github.com/basecamp/marginalia/commit/226f93234b0ca58f548c5af23e229bdf3bf15ad5

I've profiled the previous version using caller and felt that capturing line numbers were too expensive as a result. caller_locations is a new-ish API and this change would make a big difference!

nate berkopec replied to

nate berkopec's note

13 days

nate berkopec

13d ·
Public

·
mastodon.social

View Article

The costs of setting pools too low is obvious - high latency caused by concurrent threads blocking on checking out a connection.

Pool "too high" cost is you don't catch leaks. But leaks have been far less of an issue in recent years, and there's probably better ways to detect.

nate berkopec replied to

nate berkopec's note

13 days

nate berkopec

13d ·
Public

·
mastodon.social

View Article

sorry, RMT = RAILS_MAX_THREADS or whatever you use to set your puma/sidekiq concurrency

nate berkopec

13d ·
Public

·
mastodon.social

View Article

I'm wondering if database pools should always be set to 25 conns.

Puma/Sidekiq is not the only source of concurrency. load_async, Parallel, Thread.new, fibers, etc. So RMT + 5 doesn't make sense.

25 is low enough to catch leaks, high enough to allow concurrency

nate berkopec

13d ·
Public

·
mastodon.social

View Article

Check out this before/after shot of our retainer client deploying a bunch of missing foreign key indexes identified by ids_must_be_indexed.

https://github.com/speedshop/ids_must_be_indexed

nate berkopec

15d ·
Public

·
mastodon.social

View Article

mosh --predict=experimental is CRAZY good for removing latency on SSH connections. I will probably never use ssh again.

nate berkopec

16d ·
Public

·
mastodon.social

View Article

If you want to limit concurrency to an external HTTP API, create a remote gateway class and put the limiter THERE, not on background jobs that access the API.

It's really common for teams to end up with a spaghetti of locks on jobs that end up over or under throttling the API calls. Have one lock, in one place, not on the job.

nate berkopec

19d ·
Public

·
mastodon.social

View Article

You should know about hyperfine:

https://github.com/sharkdp/hyperfine

nate berkopec

20d ·
Public

·
mastodon.social

View Article

VERY common error with newbies and profiling:

They don't check that the output/thing they're profiling actually does what they think it is.

You end up profiling a command or something and you accidentally are profiling an error pathway instead of the real thing. ALWAYS check the output!

nate berkopec

21d ·
Public

·
mastodon.social

View Article

TIL: Bundler's job parallelization uses threads, not processes

https://github.com/rubygems/rubygems/blob/d57d302cbb265f5164b0bc448191e9beec257c43/bundler/lib/bundler/worker.rb#L90

The default is "the number of available processors", but the number of processors has nothing to do with the optimal number here, only 1 processor will be used.