Is it possible to get GCC to compile UTF-8 with BOM source files?

前端 未结 2 1602
滥情空心
滥情空心 2020-12-10 13:24

I develop C++ cross platform using Microsoft Visual Studio on Windows and GCC on uBuntu Linux.

In Visual Studio I can use unicode symbols like "π

2条回答
  •  一整个雨季
    2020-12-10 14:21

    While unicode identifiers are supported in gcc, UTF-8 input is not. Therefore, unicode identifiers have to be encoded using \uXXXX and \UXXXXXXXX escape codes. However, a simple one-line patch to the cpp preprocessor allows gcc and g++ to process UTF-8 input provided a recent version of iconv that support C99 conversions is also installed. Details are present at

    https://www.raspberrypi.org/forums/viewtopic.php?p=802657

    However, the patch is so simple it can be given right here.

    diff -cNr gcc-5.2.0/libcpp/charset.c gcc-5.2.0-ejo/libcpp/charset.c
    *** gcc-5.2.0/libcpp/charset.c  Mon Jan  5 04:33:28 2015
    --- gcc-5.2.0-ejo/libcpp/charset.c  Wed Aug 12 14:34:23 2015
    ***************
    *** 1711,1717 ****
        struct _cpp_strbuf to;
        unsigned char *buffer;
    
    !   input_cset = init_iconv_desc (pfile, SOURCE_CHARSET, input_charset);
        if (input_cset.func == convert_no_conversion)
          {
            to.text = input;
    --- 1711,1717 ----
        struct _cpp_strbuf to;
        unsigned char *buffer;
    
    !   input_cset = init_iconv_desc (pfile, "C99", input_charset);
        if (input_cset.func == convert_no_conversion)
          {
            to.text = input;
    

    Even with the patch, two command line options are needed to enable UTF-8 input. In particular, try something like

    $ /usr/local/gcc-5.2/bin/gcc \
        -finput-charset=UTF-8 -fextended-identifiers \
        -o circle circle.c
    

提交回复
热议问题