curl_multibyte: always return a heap-allocated copy of string #6602

jay · 2021-02-13T05:53:58Z

Change the Windows char <-> UTF-8 conversion functions to return an
allocated copy of the passed in string instead of the original.

Prior to this change the curlx_convert_ functions would, as what I
assume was an optimization, not make a copy of the passed in string if
no conversion was required. No conversion is required in non-UNICODE
Windows builds since our tchar strings are type char and already in
UTF-8, so no conversion takes place.

In contrast the UNICODE Windows builds require conversion
(wchar <-> char) and do return a copy. That inconsistency could lead to
programming errors where the developer expects a copy, and does not
realize that won't happen in all cases.

Closes #xxxx

MarcelRaad

Doesn't build yet, but I like the usage simplification. Can't curlx_unicodefree just be removed completely?

ghost · 2021-02-15T07:26:45Z

Congratulations 🎉. DeepCode analyzed your code in 9.778 seconds and we found no issues. Enjoy a moment of no bugs ☀️.

👉 View analysis in DeepCode’s Dashboard | Configure the bot

jay · 2021-02-15T07:32:15Z

Doesn't build yet, but I like the usage simplification. Can't curlx_unicodefree just be removed completely?

Ok I replaced the calls with Curl_safefree. I don't understand why (free) was used in the curlx_unicodefree macro instead of free, is there a reason to avoid a function-like macro version of free?

MarcelRaad · 2021-02-15T08:23:27Z

I don't understand why (free) was used in the curlx_unicodefree macro instead of free, is there a reason to avoid a function-like macro version of free?

There was some CI failure after using the macro for non-Windows too, but I don't remember the details.

jay · 2021-02-18T19:12:20Z

I don't understand why (free) was used in the curlx_unicodefree macro instead of free, is there a reason to avoid a function-like macro version of free?

There was some CI failure after using the macro for non-Windows too, but I don't remember the details.

So it appears I can't get rid of curlx_unicodefree. Since the conversion functions are curlx they are not tracked by memdebug which is why that (free) was in parentheses. I've reverted the change to Curl_safefree and added comments explaining why the function-like macros in multibyte use parentheses.

- Change the Windows char <-> UTF-8 conversion functions to return an allocated copy of the passed in string instead of the original. Prior to this change the curlx_convert_ functions would, as what I assume was an optimization, not make a copy of the passed in string if no conversion was required. No conversion is required in non-UNICODE Windows builds since our tchar strings are type char and remain in whatever the passed in encoding is, which is assumed to be UTF-8 but may be other encoding. In contrast the UNICODE Windows builds require conversion (wchar <-> char) and do return a copy. That inconsistency could lead to programming errors where the developer expects a copy, and does not realize that won't happen in all cases. Closes #xxxx

MarcelRaad

👍

sergio-nsk · 2021-04-22T23:04:33Z

It seems this change causes the NTLM authentication to crash. Crashes happen in Curl_auth_build_spn -> _CrtIsValidHeapPointer.

jay · 2021-04-23T04:23:29Z

It seems this change causes the NTLM authentication to crash. Crashes happen in Curl_auth_build_spn -> _CrtIsValidHeapPointer.

Please try #6938

curlx_convert_UTF8_to_tchar must be freed by curlx_unicodefree, but prior to this change some uses mistakenly called free. I've reviewed all other uses of curlx_convert_UTF8_to_tchar and curlx_convert_tchar_to_UTF8. Bug: curl#6602 (comment) Reported-by: sergio-nsk@users.noreply.github.com Closes #xxxx

sergio-nsk · 2021-04-23T04:47:05Z

#6938 fixes the issue I have commented. Thank you!

curlx_convert_UTF8_to_tchar must be freed by curlx_unicodefree, but prior to this change some uses mistakenly called free. I've reviewed all other uses of curlx_convert_UTF8_to_tchar and curlx_convert_tchar_to_UTF8. Bug: #6602 (comment) Reported-by: sergio-nsk@users.noreply.github.com Closes #6938

jay added libcurl API tidy-up labels Feb 13, 2021

jay requested review from captain-caveman2k and MarcelRaad February 13, 2021 05:53

jay added the Windows label Feb 13, 2021

MarcelRaad reviewed Feb 15, 2021

View reviewed changes

jay force-pushed the improve_multibyte branch from 38071e6 to dda90a0 Compare February 18, 2021 19:10

jay force-pushed the improve_multibyte branch from dda90a0 to e59dea7 Compare February 18, 2021 19:16

jay force-pushed the improve_multibyte branch from e59dea7 to ce0e186 Compare February 18, 2021 19:24

MarcelRaad approved these changes Feb 19, 2021

View reviewed changes

jay closed this in 0936350 Feb 20, 2021

jay deleted the improve_multibyte branch February 20, 2021 19:45

arvids-kokins-bidstack mentioned this pull request Apr 14, 2021

TestMemoryCallbacksRealServer failing bidstack-group/curl#1

Merged

jay mentioned this pull request Apr 23, 2021

lib: fix some misuse of curlx_convert_UTF8_to_tchar #6938

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

curl_multibyte: always return a heap-allocated copy of string #6602

curl_multibyte: always return a heap-allocated copy of string #6602

jay commented Feb 13, 2021

MarcelRaad left a comment

ghost commented Feb 15, 2021 •

edited by ghost

Loading

jay commented Feb 15, 2021

MarcelRaad commented Feb 15, 2021

jay commented Feb 18, 2021

MarcelRaad left a comment

sergio-nsk commented Apr 22, 2021 •

edited

Loading

jay commented Apr 23, 2021

sergio-nsk commented Apr 23, 2021

curl_multibyte: always return a heap-allocated copy of string #6602

curl_multibyte: always return a heap-allocated copy of string #6602

Conversation

jay commented Feb 13, 2021

MarcelRaad left a comment

Choose a reason for hiding this comment

ghost commented Feb 15, 2021 • edited by ghost Loading

👉 View analysis in DeepCode’s Dashboard | Configure the bot

jay commented Feb 15, 2021

MarcelRaad commented Feb 15, 2021

jay commented Feb 18, 2021

MarcelRaad left a comment

Choose a reason for hiding this comment

sergio-nsk commented Apr 22, 2021 • edited Loading

jay commented Apr 23, 2021

sergio-nsk commented Apr 23, 2021

ghost commented Feb 15, 2021 •

edited by ghost

Loading

sergio-nsk commented Apr 22, 2021 •

edited

Loading