Curl_client_write, disambiguate flag semantics #11885

icing · 2023-09-19T10:35:49Z

use CLIENTWRITE_BODY only when data is actually body data
add CLIENTWRITE_INFO for meta data that is not a HEADER
debug assertions that BODY/INFO/HEADER is not used mixed
move data->set.include_header check into Curl_client_write so protocol handlers no longer have to care
add special in FTP for data->set.include_header for historic, backward compatible reasons
move unpausing of client writes from easy.c to sendf.c, so that code is in one place and can forward flags correctly

- use CLIENTWRITE_BODY *only* when data is actually body data - add CLIENTWRITE_INFO for meta data that is *not* a HEADER - debug assertions that BODY/INFO/HEADER is not used mixed - move `data->set.include_header` check into Curl_client_write so protocol handlers no longer have to care - add special in FTP for `data->set.include_header` for historic, backward compatible reasons - move unpausing of client writes from easy.c to sendf.c, so that code is in one place and can forward flags correctly

bagder · 2023-09-20T08:26:48Z

lib/ftp.c

+   * output. */
+  CURLcode result;
+  int save = data->set.include_header;
+  data->set.include_header = TRUE;


Where within Curl_client_write() is data->set.include_header checked? I can't find it, so I'm not following why it needs to be set like this!

Never mind. I see it now, this patch adds the check.

Would it not be nicer to pass in the "include_header" bool as a new argument to Curl_client_write instead of changing the variable like this?

Never mind. I see it now, this patch adds the check.

Would it not be nicer to pass in the "include_header" bool as a new argument to Curl_client_write instead of changing the variable like this?

My thinking is that FTP is the sole exception here and that every other protocol does not have to care about this. I'd rather spare all others the "burden" to think about it than make FTP look nicer.

Ah right. Sounds fair.

bagder · 2023-09-20T12:54:50Z

lib/sendf.h

+ * confusion on how to interpret/format/convert the data.
+ */
+#define CLIENTWRITE_BODY    (1<<0) /* non-meta information, BODY */
+#define CLIENTWRITE_INFO    (1<<1) /* meta information, not a HEADER */


The only user of this (CLIENTWRITE_INFO) now is response lines in the pingpong handling. Does it really need to be set special? It has been set a "header" up until now and it will be sent to the header callback...

Yeah, that is what I thought first as well. But then test cases failed. There are cases, where data->req.include_header is set, but pingpong never wrote with `CLIENTWRITE_BODY´. Other protocol handler parts did, though. I believe IMAP is such a case, if memory serves me.

So, before this PR, we had calls to Curl_client_write() that:

added CLIENTWRITE_BODY when data->req.include_header was TRUE

did not add CLIENTWRITE_BODY although data->req.include_header was TRUE

added CLIENTWRITE_BODY irregardless of data->req.include_header

Case 1 is now automatically handles in Curl_client_write(). Case 2 is now changed to CLIENTWRITE_INFO. Case 3 is handled via the special flag set in FTP (and only there because FTP via HTTP PROXY does not want to see the CONNECT headers).

Why do all this? Because with this PR, the flags say what the written data is - and no longer to which callback it shall be passed to. So CLIENTWRITE_BODY really is only used for body data.

When this holds true, it is possible to move several things into Curl_client_write():

content encoding writers, e.g. data->req.write_stack

chunked decoding

progress updates

making life for transfer loop and protocol handlers easier.

To explain the case some more: we have this nice writer stack in content_encoding.[ch]. That is also called for Transfer-Encoding headers. But chunked is not implemented there. Why not?

Well, chunked changes the amount of bytes in a response. The chunk framing is not counted against Content-Length and download progress updates. In order to count the bytes correctly, it needs to know what of its buffer is really the content.

So, when writing received transfer data, one needs to if() check for chunked encoding, call it, get back an updated buffer position and length and use that for client writes and progress updates.

If chunked were a content_encoding writer, we could just write the received data through the writer stack and it would take care of all this. Add a "progress" writer after "chunked" and updates just work for everyone.

bagder · 2023-09-21T06:57:33Z

Thanks!

- use CLIENTWRITE_BODY *only* when data is actually body data - add CLIENTWRITE_INFO for meta data that is *not* a HEADER - debug assertions that BODY/INFO/HEADER is not used mixed - move `data->set.include_header` check into Curl_client_write so protocol handlers no longer have to care - add special in FTP for `data->set.include_header` for historic, backward compatible reasons - move unpausing of client writes from easy.c to sendf.c, so that code is in one place and can forward flags correctly Closes curl#11885

icing mentioned this pull request Sep 19, 2023

dfilters, a client writer stack per easy handle #11851

Closed

icing added 2 commits September 20, 2023 09:30

fix bitfield assignment type

28da2a5

icing force-pushed the client_write_flags_peas branch from b750933 to 28da2a5 Compare September 20, 2023 07:42

bagder reviewed Sep 20, 2023

View reviewed changes

bagder approved these changes Sep 21, 2023

View reviewed changes

bagder closed this in 8898257 Sep 21, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Curl_client_write, disambiguate flag semantics #11885

Curl_client_write, disambiguate flag semantics #11885

icing commented Sep 19, 2023

bagder Sep 20, 2023

bagder Sep 20, 2023

icing Sep 20, 2023

bagder Sep 20, 2023

bagder Sep 20, 2023

icing Sep 20, 2023 •

edited

icing Sep 20, 2023

bagder commented Sep 21, 2023

Curl_client_write, disambiguate flag semantics #11885

Curl_client_write, disambiguate flag semantics #11885

Conversation

icing commented Sep 19, 2023

bagder Sep 20, 2023

Choose a reason for hiding this comment

bagder Sep 20, 2023

Choose a reason for hiding this comment

icing Sep 20, 2023

Choose a reason for hiding this comment

bagder Sep 20, 2023

Choose a reason for hiding this comment

bagder Sep 20, 2023

Choose a reason for hiding this comment

icing Sep 20, 2023 • edited

Choose a reason for hiding this comment

icing Sep 20, 2023

Choose a reason for hiding this comment

bagder commented Sep 21, 2023

icing Sep 20, 2023 •

edited