transfer response handling #12480

icing · 2023-12-07T10:58:28Z

The last PR in the "client writer" series. This clarifies the handling of server responses by folding the code for the complicated protocols into their protocol handlers. This concerns mainly HTTP and its bastard sibling RTSP.

Nomenclature

The terms "read" and "write" are often used without clear context if they refer to the connect or the client/application side of a transfer. This PR uses "read/write" for operations on the client side and "send/receive" for the connection, e.g. server side. If this is considered useful, we can revisit renaming of further methods in another PR.

Protocol Handler Interface

Curl's protocol handler readwrite() method been changed:

-  CURLcode (*readwrite)(struct Curl_easy *data, struct connectdata *conn,
-                        const char *buf, size_t blen,
-                        size_t *pconsumed, bool *readmore);
+  CURLcode (*write_resp)(struct Curl_easy *data, const char *buf, size_t blen,
+                         bool is_eos, bool *done);

The name was changed to clarify that this writes reponse data to the client side. The parameter changes are:

conn removed as it always operates on data->conn
pconsumed removed as the method needs to handle all data on success
readmore removed as no longer necessary
is_eos as indicator that this is the last call for the transfer response (end-of-stream).
done TRUE on return iff the transfer response is to be treated as finished

This change affects many files only because of updated comments in handlers that provide no implementation. The real change is that the HTTP protocol handlers now provide an implementation.

HTTP/RTSP write_resp()

The HTTP protocol handlers write_resp() implementation will get passed all raw data of a server response for the transfer. The HTTP/1.x formatted status and headers, as well as the undecoded response body. Curl_http_write_resp_hds() is used internally to parse the response headers and pass them on. This method is public as the RTSP protocol handler also uses it.

HTTP/1.1 "chunked" transport encoding is now part of the general content encoding writer stack, just like other encodings. A new flag CLIENTWRITE_EOS was added for the last client write. This allows writers to verify that they are in a valid end state. The chunked decoder will check if it indeed has seen the last chunk.

General `transfer.c` handling

The general response handling in transfer.c:466 happens in function readwrite_data(). This mainly operates now like:

static CURLcode readwrite_data(data, ...)
{
  do {
    Curl_xfer_recv_resp(data, buf)
    ...
    Curl_xfer_write_resp(data, buf)
    ...
  } while(interested);
  ...
}

All the response data handling is implemented in Curl_xfer_write_resp(). It calls the protocol handler's
write_resp() implementation if available, or does the default behaviour.

All raw response data needs to pass through this function. Which also means that anyone in possession of such data may call Curl_xfer_write_resp(). This was implemented for HTTP/2 in #12468 to demonstrate the effect this has on transfer handling.

icing · 2024-01-04T13:16:52Z

@monnerat, could I interest you in taking a glance at this PR? It is basically an extensions of the content encoder stack that, I believe, was done by you. Would be nice to get some feedback.

bagder · 2024-01-07T13:38:51Z

@icing this merge conflicts now

wip

- use Curl_xfer_recv_resp() to receive response data from connection - used Curl_xfer_write_resp() to write response data to client

…done

* code spelling in content_encoding.c * single-use functions made static

…al api

monnerat · 2024-01-08T08:51:17Z

lib/content_encoding.c

-/* supported content encodings table. */
-static const struct Curl_cwtype * const encodings[] = {
+/* supported general content decoders. */
+static const struct Curl_cwtype * const general_decoders[] = {


When i wrote this, I preferred using "unencode" rather than "decode" to emphasize the context of encoding handling. So I'm not a big fan of the renaming.

I can change that back. I thought "decode" was the opposite of "encode", but English is not my first language.

It is. "unencode" was always very weird to read in my eyes. What does it mean? In English you encode one way and you decode the other way.

I thought "decode" was the opposite of "encode"

Yes it is indeed. And the word "unencode" is not very elegant I admit.
This was just because we are dealing with encodings.
Thus I may be ok with decoding if you change it everywhare for consistency!

Renamed as proposed.

monnerat · 2024-01-08T09:04:18Z

lib/content_encoding.c

@@ -851,6 +851,13 @@ static const struct Curl_cwtype * const encodings[] = {
  NULL
 };

+/* supported content decoders only for transfer encodings */
+static const struct Curl_cwtype * const transfer_decoders[] = {


Wouldn't it be easier to keep a single table and add bit flags in the Curl_cwtype structure to specify the allowed phase(s) and if it may be skipped or not? Just a suggestion.

I prefer to keep them separate and void testing a flag while iterating. Matter of taste, probably.

lib/content_encoding.c

github-actions bot added the tests label Dec 7, 2023

icing added performance tidy-up labels Dec 7, 2023

icing force-pushed the cw-part5 branch from 6fc91ea to 722a2af Compare December 15, 2023 10:44

icing force-pushed the cw-part5 branch from 722a2af to 33dad65 Compare December 28, 2023 11:28

icing added 10 commits January 8, 2024 09:08

Curl_xfer_write_resp

e362690

wip

Add Curl_xfer_recv_resp()

745c193

- use Curl_xfer_recv_resp() to receive response data from connection - used Curl_xfer_write_resp() to write response data to client

Rename handler->readwrite() to handler->write_resp()

595d47e

On multiplex only stop receiving on EOS

2944e46

http handler, delay h2 101 switch after that response has been fully …

55a589b

…done

fixes detected in CI

4764e59

* code spelling in content_encoding.c * single-use functions made static

http, always initialize bool *done parameter

471ab9a

fix typo, rename eos parameter in Curl_xfer_recv_resp() to clarify

9886abc

add Curl_xfer_write_resp to script/singleuse.pl as part of the intern…

9ad1644

…al api

remove buf update as it is never read

e5d84a8

icing force-pushed the cw-part5 branch from 33dad65 to e5d84a8 Compare January 8, 2024 08:10

monnerat reviewed Jan 8, 2024

View reviewed changes

lib/content_encoding.c Outdated Show resolved Hide resolved

rename content decoders back to unencoders after review by monnerat

c2277fa

bagder closed this in d7b6ce6 Jan 13, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

transfer response handling #12480

transfer response handling #12480

icing commented Dec 7, 2023 •

edited

icing commented Jan 4, 2024

bagder commented Jan 7, 2024

monnerat Jan 8, 2024

icing Jan 8, 2024 •

edited

bagder Jan 8, 2024

monnerat Jan 8, 2024

icing Jan 8, 2024

monnerat Jan 8, 2024

icing Jan 8, 2024

transfer response handling #12480

transfer response handling #12480

Conversation

icing commented Dec 7, 2023 • edited

Nomenclature

Protocol Handler Interface

HTTP/RTSP write_resp()

General transfer.c handling

icing commented Jan 4, 2024

bagder commented Jan 7, 2024

monnerat Jan 8, 2024

Choose a reason for hiding this comment

icing Jan 8, 2024 • edited

Choose a reason for hiding this comment

bagder Jan 8, 2024

Choose a reason for hiding this comment

monnerat Jan 8, 2024

Choose a reason for hiding this comment

icing Jan 8, 2024

Choose a reason for hiding this comment

monnerat Jan 8, 2024

Choose a reason for hiding this comment

icing Jan 8, 2024

Choose a reason for hiding this comment

icing commented Dec 7, 2023 •

edited

General `transfer.c` handling

icing Jan 8, 2024 •

edited