Are Range requests for dynamic content supported?

Gurdipe_D · May 1, 2012, 11:54pm

Hi,

I’m wondering if nginx supports Range requests for dynamic content
(say, from a Rails app via Passenger).

I’ve tried it out with static content, and it works as expected. When
hitting a URL with dynamic content, however, nginx seems to ignore the
Range header, and it does not include a Accept-Ranges header in the
response. I did notice that Rails was not returning a Content-Length
header, so I fixed that. However, even with this header, I get the
same results.

I’m thinking the answer to this question is no, but I would like to
get some confirmation. Thanks!

-David

–
David van Geest
Software Engineer
Spindance, Inc.
616.355.7000 x113

David_van_Geest · May 2, 2012, 3:45am

Your backend is responsible for returning the proper data, it can’t
return the entire data and then expect nginx to send only part of it.

What you should actually do, if at all possible, is to use the
x-accel-redirect to direct nginx to the content location so that it can
read the file itself instead of getting it from your backend.

Posted at Nginx Forum:

David_van_Geest · May 2, 2012, 3:23pm

On Tue, May 1, 2012 at 9:45 PM, Ensiferous [email protected] wrote:

Your backend is responsible for returning the proper data, it can’t
return the entire data and then expect nginx to send only part of it.

Why can’t nginx do this? Technically it seems entirely feasible.
Partial Content is an HTTP feature so it seems logical to keep this
with the HTTP server.

What you should actually do, if at all possible, is to use the
x-accel-redirect to direct nginx to the content location so that it can
read the file itself instead of getting it from your backend.

There is no file or content location, this is dynamic content pulled
from a DB. Maybe I’m misunderstanding your suggestion though…

Thanks for the reply!

David_van_Geest · May 2, 2012, 7:58pm

On Wed, May 2, 2012 at 1:31 PM, Lukas T. [email protected]
wrote:

Its about stale data in the client. If the HTTP client request an exact
byte-range, then this particular byte-range was chosen because of a reason, like
the moov atom in mp4 files; the header of a pdf file or similar stuff (depending
on the content-type). Here is the problem: when the client reads the first bytes
of the PDF today, and the user scrolls down tomorrow (adobe reader makes heavy use
of range requests iirc), the pdf on the server needs to be exactly the same (bit
for bit). If its not, the byte-range request must not be successful, otherwise the
application will get corrupt data (how can the byte offset still be the same with
the file from yesterday if the content changed on the server). This is the reason
why the HTTP server needs to validate the client-side cache with things like
filemtime. If we can’t validate the client cache, we can’t serve 206 partial
content. In case of dynamic content we have no way to do this (theoretically it
would be doable with etag stro
ng validation, but nginx doesn’t support it and your application surely doesn’t
either).

Yeah, this is a reason why it shouldn’t be done in most cases…
technically it’s still feasible, but I guess I’ll accept “shouldn’t”
as “can’t”. We manage to get around this kind of situation by using
checksums to validate the entire HTTP body once it’s been delivered in
its entirety. The checksums are originally for something else
entirely, but that’s a different story.

Iirc, (please correct me if I’m wrong), nginx, when configured as a caching
reverse proxy serves 206 only when the object is already in the local nginx cache.
If its not there, full file will be served.

I don’t think nginx as we have it configured would qualify as a
caching reverse proxy - it’s a pretty standard nginx + passenger
install. I did try it with a static file and Range requests worked as
expected.

Also read HTTP/1.1 specs in [1].

I’ve read it, and am using it for reference, but I don’t see anything
that specifically addresses the question of dynamic content.

Can you tell us more about your use case? Is your dynamic content really that
big? Maybe you are approaching this from the wrong side, x-accel-redirect can
probably help here, as Ensiferous already posted.

Yeah, a bit of context would probably help here. The HTTP client in
this case is (always) a very particular embedded device with limited
resources and a constrained operating environment. A response at the
extreme end of the scale could be 200 K, and this apparently causes
the client to croak (for reasons that are a little unclear to me). It
was suggested that if we could break up the response into smaller
pieces, it may solve the issue. This is just one avenue we’re
exploring.

For the moment, I did get this to work by adapting the Rack middleware
Racknga::Middleware::Range I found here:
http://groonga.rubyforge.org/. It has yet to be seen whether this
will solve the client crashing issues, so for now I’m going to leave
it.

Thanks for your response!

David_van_Geest · May 2, 2012, 7:32pm

Its about stale data in the client. If the HTTP client request an exact
byte-range, then this particular byte-range was chosen because of a
reason, like the moov atom in mp4 files; the header of a pdf file or
similar stuff (depending on the content-type). Here is the problem: when
the client reads the first bytes of the PDF today, and the user scrolls
down tomorrow (adobe reader makes heavy use of range requests iirc), the
pdf on the server needs to be exactly the same (bit for bit). If its
not, the byte-range request must not be successful, otherwise the
application will get corrupt data (how can the byte offset still be the
same with the file from yesterday if the content changed on the server).
This is the reason why the HTTP server needs to validate the client-side
cache with things like filemtime. If we can’t validate the client cache,
we can’t serve 206 partial content. In case of dynamic content we have
no way to do this (theoretically it would be doable with etag stro
ng validation, but nginx doesn’t support it and your application surely
doesn’t either).

Iirc, (please correct me if I’m wrong), nginx, when configured as a
caching reverse proxy serves 206 only when the object is already in the
local nginx cache. If its not there, full file will be served.

Also read HTTP/1.1 specs in [1].

Can you tell us more about your use case? Is your dynamic content really
that big? Maybe you are approaching this from the wrong side,
x-accel-redirect can probably help here, as Ensiferous already posted.

[1] RFC 2616 - Hypertext Transfer Protocol -- HTTP/1.1

David_van_Geest · May 2, 2012, 8:51pm

On Wed, May 2, 2012 at 2:42 PM, Lukas T. [email protected]
wrote:

The HTTP client in
this case is (always) a very particular embedded device with limited
resources and a constrained operating environment.

Ok, so this is about working around a broken user-agent. I assume the vendor of
that embedded device can’t fix the crashing client?

Correct :-).

How do you force the client to request the content in Byte-Ranges and not at
once? Even if you are serving static files from the server (where nginx can
provide most of the HTTP/1.1 features), the client actually needs to request it
that way.

The developer of the embedded application has some control over the
HTTP requests, and thinks he can force it to use Byte-Ranges.

It has yet to be seen whether this
will solve the client crashing issues, so for now I’m going to leave
it.

Meaning you still aren’t sure whether range support will fix your client
crashing issue? I would look for other workarounds, like heavy gzipping, (if
x/html) minifying and outsourcing as much as possible in other files (css and
stuff), etc. You already implemented checksumming on the client-side, maybe you
can implement some segmenting in the application code as well, instead of relaying
on HTTP to do that job via Range requests.

Right. Gzipping may be another option… the response is purely JSON,
but should be very compressible. The question would probably come
down to space on the embedded device for a gzip library (sigh…). We
want to avoid large application changes but segmenting there is
another option. Thanks for the suggestions!

David_van_Geest · May 2, 2012, 8:42pm

The HTTP client in
this case is (always) a very particular embedded device with limited
resources and a constrained operating environment.

Ok, so this is about working around a broken user-agent. I assume the
vendor of that embedded device can’t fix the crashing client?

How do you force the client to request the content in Byte-Ranges and
not at once? Even if you are serving static files from the server (where
nginx can provide most of the HTTP/1.1 features), the client actually
needs to request it that way.

It has yet to be seen whether this
will solve the client crashing issues, so for now I’m going to leave
it.

Meaning you still aren’t sure whether range support will fix your client
crashing issue? I would look for other workarounds, like heavy gzipping,
(if x/html) minifying and outsourcing as much as possible in other files
(css and stuff), etc. You already implemented checksumming on the
client-side, maybe you can implement some segmenting in the application
code as well, instead of relaying on HTTP to do that job via Range
requests.

BR,

Lukas