Hacker Newsnew | past | comments | ask | show | jobs | submitlogin
Bjoern: A screamingly fast Python WSGI server written in C. (github.com/jonashaag)
52 points by jgalvez on Dec 24, 2010 | hide | past | favorite | 30 comments


I think all emphasis on server software speed is overrated. In pretty much everything but hello world, the vast majority of the time is spent inside the web app, not in the server.

Yes, lowering the web server overhead is a good thing. But I think in most cases it's already so low that further reducing this overhead results in no noticeable impact in real-life scenarios. At some point you'll just be benchmarking how fast the kernel is at doing connect(), read() and write() - in other words how fast your computer can do nothing.

For example, let's consider this thought experiment:

Someone here mentioned Mongrel2 getting 4000 req/sec. Let's replace the name "Mongrel2" with "Server A" because this thought experiment is not limited to Mongrel2, but all servers. I assume he's benchmarking a hello world app on his laptop. Suppose that a hypothetical Server B gets "only" 2000 req/sec. One might now (mistakenly) conclude that:

- Server B is a lot slower.

- One should use Server A instead of Server B in high-traffic production environments.

Now put Server A behind HAProxy. HAproxy is known as a high-performance HTTP proxy server with minimal overhead. Benchmark this setup, and watch req/sec drop to about 2000-3000 (when benchmarked on a typical dual core laptop).

What just happened? Server B appears to be very slow. But the reality is that both Server A and Server B are so fast that doing even a minimum amount of extra work will have a significant effect on the req/sec number. In this case, the overhead of an extra context switch and a read()/write() call to the kernel is already enough to make the req/sec number drop by half. Any reasonably complex web app logic will make the number drop so much that the performance difference between the different servers become negligible.


Why on earth should developers ever say "Ok, it's fast enough. We're done."

I mean, maybe temporarily done so that you can patch up some higher priority (more broken?) stuff, but DONE? Pushing the envelope is how you progress, and in this case also how you can learn.


Because at some point you will pass the point of diminishing returns. Suppose 3% of the time is spent in the server, and 97% in the web app. Do you really want to spend 200 hours trying to reduce that 3% to 2.5% instead of spending 10 hours on making your app twice as fast? You've made the server 20% faster, but what have you really gained in practice when you look at the overall picture?

Even assuming unlimited developer resources, will you really notice a 0.5% difference in production, especially when taking things like network latency into account?

My point is that hyping over raw performance is misleading at best. If performance is the only thing that matters then we wouldn't have operating systems with abstractions - everybody would program against the bare metal in assembly. One should weight the pros and cons carefully and reach a balanced trade-off.


> 200 hours trying to reduce that 3% to 2.5%

It's about progression over time. If someone didn't do it, we'd be leaving significant performance gains behind. That means more servers, more electricity and more money for everyone.

I'm not going to do that for my app but it's nice that there are people making these improvements for the benefit of everyone that uses their server.

When you take into account many small improvements, across all deployments, over time, they can and do add up to significant improvements.



Totally agree with you. You dont really get 1000's of reqs/sec in a real world environment. All you need is a good server which is fast and memory efficient. Rest of the efforts should be focussed on the webapp side, more importantly DB. If you can work on your IO you will have a fast and efficient application.


Because increasing benchmark speed provides no business value (assuming your business isn't self-promotion, getting into flamewars, etc).

There's a extremely well known saying that speaks to this issue, goes something like "Preoptimiztion is root of all evil."


This looks like a really cool project, but the comparisons of other webservers are uh, a bit on the negative side. No code is perfect, even if it's better than the rest. While I find myself falling into the same thinking, it's rarely a good idea, and it certainly doesn't help your project's image.


I'm wondering why it says that it has "no threads, coroutines or other crap", and then complains about difficulty with "tr[ying] to patch Fapws so that it would support threading".


I feel the same way. His arrogance is strangely reassuring though heh.


>..."Fapws done right"

I know if I were the author of Fapws I would probably find that section a little irritating.


Only if he wouldn't concede that his software (like everyone else's) has deficiencies.


So he started off a bit aggressively, so what. I looked at the code and it's very spartan and light. It's quite the opposite of the competition and it works for him. Good job on what looks like to be an advanced coders Hello World: web server.


I've been doing some research on small web servers, trying to understand didiwiki's code (a web server and wiki engine in ~2k lines of C):

http://c2.com/cgi/wiki?DidiWiki

This is a nice introduction to writing a minimal web server:

http://www.ibm.com/developerworks/systems/library/es-nweb/in...


Thanks for pointing that out -- I'll want to look didiwiki over since I also wrote a webserver/wiki in <2k lines of C, long ago: http://wry.me/~darius/hacks/ikiwiki.tar.gz

There's usually something to learn from anyone else tackling the same problem.


Any experiences with Bjoern's Django compatibility?



Interesting. I think I am gonna benchmark this against gunicorn with a simple flask app.


Ok took 30 minutes to run some benchmarks on a AWS instance. I used a small instance run a simple Flask Hello World app with gunicorn (1 worker and 4 workers) and bjoern. Frankly I was not able to fund any speed difference between both the servers. Here is the gist of the benchmark results

https://gist.github.com/753998


> n. Frankly I was not able to fund any speed difference between both the servers.

That is because of this http://news.ycombinator.com/item?id=2037060



What about memory consumption?


Is anyone else confused by "1 worker and 4 workers"?


He did 3 benchmarks: gunicorn with 1 worker process, gunicorn with 4 worker processes, bjoern.


Well I run gunicorn in production with 4 workers so just wanted to give it a try. Also I chose to add network latency by doing it on amazon instance.


My guess is he meant 1 master & 4 workers.


results look promising. has anyone tried WSGI on mongrel2?


Is the name a reference to the main character of Peggle?

(Also, how does this compare speed-wise to Mongrel2? It easily handles the 4000 requests/second that my PSGI handler can do. I am sure it could do more if the backend was faster.)


Björn is hardly an uncommon name, as I understand it.


Thanks for this. One of my goals for 2011 is to get into network programming. At less than 1000 lines of code, I think I may just have to go through the source.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: