Image Hosting

okkezSS · October 14, 2010, 9:37am

hi there,

we have an image hosting, and of course we’re using Nginx!

Now on we’re having problem with disk i/o. With 200GB of images.
Is there any recommendation for this matter? Nginx configuration or
Hardware/other software configuration?

We already used Dual Quad Core 3Ghz, 8 GB RAM, 6x73GB 15k SCSI server

Indo_P · October 14, 2010, 9:48am

should be able to use a cache with either nginx+caching or Squid Cache.

2010/10/14 Indo P. [email protected]:

Indo_P · October 14, 2010, 9:49am

Try any of these:

Compile nginx with AIO support
Try directio option
Try sendfile option

Indo_P · October 14, 2010, 10:29am

i tried to use AIO, but it seems my server is getting slower

From: SplitIce [email protected]
To: [email protected]
Sent: Thu, October 14, 2010 2:47:55 PM
Subject: Re: Image Hosting

Try any of these:

Compile nginx with AIO support
Try directio option
Try sendfile option

On Thu, Oct 14, 2010 at 6:36 PM, Indo P. [email protected] wrote:

hi there,

Indo_P · October 14, 2010, 10:34am

Hmm well talking from experience nginx can serve upwards of 5,000 req/s
for
static files if configured right. Ill leave this for Igor or someone
more
expericed to reply to.

Indo_P · October 14, 2010, 9:50am

If its just serving static files nginx should perform better without
caching, unless they are calling PHP for all images (in which case thats
the
root of their performance problem)

Indo_P · October 14, 2010, 11:09am

Install the status module and get more infomation
Turn down keep alive, its good but if you are keeping that number of
connections open then its bound to hit problems.

Also probably send the core part of your nginx config might be some
optimisations there.

Indo_P · October 14, 2010, 10:49am

Here’s the output of established connection

netstat -n | grep :80 | grep ESTABLISHED | wc -l

20676

From: SplitIce [email protected]
To: [email protected]
Sent: Thu, October 14, 2010 3:33:45 PM
Subject: Re: Image Hosting

Hmm well talking from experience nginx can serve upwards of 5,000 req/s
for
static files if configured right. Ill leave this for Igor or someone
more
expericed to reply to.

On Thu, Oct 14, 2010 at 7:28 PM, Indo P. [email protected] wrote:

i tried to use AIO, but it seems my server is getting slower

From: SplitIce [email protected]

Indo_P · October 14, 2010, 2:28pm

Actually the image size is arround 100KB each. The server is running in
250Mbps traffic.
I already described the disk I’m using is scsi 15K RPM in raid 0

Sent from my BlackBerry
powered by Sinyal Kuat INDOSAT

Indo_P · October 14, 2010, 2:24pm

But if those 5,000 req/s each hit a different 1mb image then you need to
read 5gb of data from you storage which if your storage is just a plain
old
sata disk is going to be a huge problem.

Unfortunately Indo P. is not providing nearly enough information to
give
any sort of advice. If he is lucky then putting lots of ram in the
machine
for pagecache can help here if the cache hit ratio is good but if it is
not
then he probably has to distribute the I/O across more spindles and go
for
a raid with lots of disks.

Regards,
Dennis

Indo_P · October 14, 2010, 7:39pm

On 10/14/2010 09:36 AM, Indo P. wrote:

hi there,

we have an image hosting, and of course we’re using Nginx!

Now on we’re having problem with disk i/o. With 200GB of images.
Is there any recommendation for this matter? Nginx configuration or
Hardware/other software configuration?

We already used Dual Quad Core 3Ghz, 8 GB RAM, 6x73GB 15k SCSI server

Hi there,

I have something kinda image hosting, almost 220G (651k files, images).
Since most users getting quite random content, there is no way cache
most used files in ram.

We got best performances with ext4 and bfq io sched, Whole system is
gentoo-based etc… so I think you may want try optimize it on OS level.

Dual Core Xeon 2,5GHz, 4G ram and standalone raid10 array, two nginx
workers.

– Piotr.

Indo_P · October 14, 2010, 3:06pm

On Thu, Oct 14, 2010 at 12:36:05AM -0700, Indo P. wrote:

hi there,

we have an image hosting, and of course we’re using Nginx!

Now on we’re having problem with disk i/o. With 200GB of images.
Is there any recommendation for this matter? Nginx configuration or
Hardware/other software configuration?

We already used Dual Quad Core 3Ghz, 8 GB RAM, 6x73GB 15k SCSI server

What OS do you use ?
Try to increase number of worker_proceses, for example, to 20.

–
Igor S.
http://sysoev.ru/en/

Indo_P · October 15, 2010, 3:12am

What a great result. Could you please be kind to share your
configuration?

Indo_P · October 15, 2010, 7:57am

Ah, sorry I somehow didn’t catch that last line with the actual HW
setup.

If my math skills don’t fail me (and they very well might) the traffic
and
image size data mean that you serve about 300 images per second.
Can you post a few lines of vmstat output and perhaps the output of
“iostat
-d 60 2”?

Also how are the hits distributed across the whole pool of images? Are
these hits truly random or are some images hit significantly more often
than others?

More ram would obviously take some pressure off the disks if they are
really the problem.

Regards,
Dennis

Indo_P · October 15, 2010, 7:45am

Actually the image size is arround 100KB each. The server is running in
250Mbps traffic.
I already described the disk I’m using is scsi 15K RPM in raid 0

So do you see iowait (by running ‘iostat’ or ‘top’) which could mean
that
the bottleneck is disk system (and then the only way to improve the
situation is either by getting more disks or adding memory for caching
(either just by pure linux vm file cache or adding some memory only
proxies
like varnish)).

Usually its good to try other servers for comparison like apache or
lighttpd - if the default configurations show the same results its not
the
webserver at fault and therefore nothing wrong with nginx config.

But still its too few data (for example nginx version / configuration
(maybe
too less workers) / some IO metrics (filesystem version (ext, xfs … ),
file
atributes (directory structure)) / network load) to give any solution or
hints - in short be more detailed about the problem you have.

rr

Indo_P · October 15, 2010, 8:08am

On Thu, Oct 14, 2010 at 12:26:59PM +0000, [email protected] wrote:

Actually the image size is arround 100KB each. The server is running in 250Mbps
traffic.
I already described the disk I’m using is scsi 15K RPM in raid 0

What is stripe size of the RAID ?

–
Igor S.
http://sysoev.ru/en/

Indo_P · October 15, 2010, 8:51am

On Thu, Oct 14, 2010 at 01:36:20PM +0000, [email protected] wrote:

The disk are 6x73GB in scsi 15K RPM…

I meant stripe size of the RAID0.
I do not know however, how to see it on Linux.

I already described the disk I’m using is scsi 15K RPM in raid 0
[email protected]
nginx Info Page

nginx mailing list
[email protected]
nginx Info Page

–
Igor S.
http://sysoev.ru/en/

Indo_P · October 15, 2010, 8:52am

Hello!

On Thu, Oct 14, 2010 at 12:26:59PM +0000, [email protected] wrote:

Actually the image size is arround 100KB each. The server is running in 250Mbps
traffic.
I already described the disk I’m using is scsi 15K RPM in raid 0

Basic tunings you have to apply when serving static which doesn’t
fit into memory are:

If you use sendfile:

Make sure your OS uses appropriate read-ahead for sendfile to
avoid trashing disks with small requests (and seeks). For FreeBSD
8.1+ it should be enough to set read_ahead directive in nginx
config (0.8.18+).

http://sysoev.ru/nginx/docs/http/ngx_http_core_module.html#read_ahead
(in Russian)

Using bigger socket buffers (listen … sndbuf=… in nginx
config) may help too.

If serving large files - make sure you use appropriate
sendfile_max_chunk to avoid blocking nginx worker on disk for too
long.
Consider switching sendfile off if you can’t persuade it to read
large blocks from disk.

If not using sendfile:

Tune output_buffers (again, to avoid trashing disks with small
requests and seeks). Default is 2x32k, which will result in 4
disk requests for 100k file. Changing it to 1x128k would result
in 2x memory usage but 4x less disk requests, this is probably
good thing to do if you are disk-bound.

In both cases:

Using aio may help a lot (“aio sendfile” is only available under
FreeBSD) by adding more concurency to disk load and generally
improving nginx interactivity. Though right now it requires
patches to avoid socket leaks, see here:

http://nginx.org/pipermail/nginx-devel/2010-October/000498.html

Using directio may help to improve disk cache effictiveness by
excluding large files (if you have some) from cache. Though keep
in mind that it disables sendfile so if you generally tune for
sendfile - you may have to apply output_buffers tunings as well.

It’s hard to say anything more than this without knowing lots of
details.

Maxim D.

Indo_P · October 15, 2010, 9:01am

Thank’s maxim for the clear explanation. Will try ur suggestion and will
let u know.

What any information do you need to help this situation?

Indo_P · October 15, 2010, 8:50am

Hi Dennis,

For the output of vmstat and iostat I’ll update you soon, because now
I’m using my mobile.

The hits of the file is almost 90% pool of images.