Mime Encoded-Word parsing in Objective-C

不羁的心 提交于 2019-12-12 14:23:42

问题


Do OS X or iOS provide an API to parse MIME Encoded-Word? These lovely strings:

=?iso-8859-1?Q?=A1Hola,_se=F1or!?=

Alternatively, is there a known open-source library that does this?


回答1:


I took @NickolayO.'s answer, added base-64 support with QSStrings and shortened the code by using componentsSeparatedByString: and stringByReplacingOccurrencesOfString:withString:.

I've made the code available on GitHub. Here's a snippet for convenience:

@implementation NSString (MimeEncodedWord)

- (BOOL) isMimeEncodedWord
{
    return [self hasPrefix:@"=?"]  && [self hasSuffix:@"?="];
}

+ (NSString*) stringWithMimeEncodedWord:(NSString*)word
{ // Example: =?iso-8859-1?Q?=A1Hola,_se=F1or!?=
    NSArray *components = [word componentsSeparatedByString:@"?"];
    if (components.count < 5) return nil;

    NSString *charset = [components objectAtIndex:1];
    NSStringEncoding encoding = CFStringConvertEncodingToNSStringEncoding(CFStringConvertIANACharSetNameToEncoding((CFStringRef)charset)); // TODO: What happens if the encoding is invalid?

    NSString *encodingType = [components objectAtIndex:2];
    NSString *encodedText = [components objectAtIndex:3];
    if ([encodingType isEqualToString:@"Q"])
    { // quoted-printable
        encodedText = [encodedText stringByReplacingOccurrencesOfString:@"_" withString:@" "];
        encodedText = [encodedText stringByReplacingOccurrencesOfString:@"=" withString:@"%"];
        NSString *decoded = [encodedText stringByReplacingPercentEscapesUsingEncoding:encoding];
        return decoded;
    } else if ([encodingType isEqualToString:@"B"])
    { // base64
        NSData *data = [QSStrings decodeBase64WithString:encodedText];
        NSString *decoded = [[NSString alloc] initWithData:data encoding:encoding];
        return decoded;
    } else {
        NSLog(@"%@ is not a valid encoding (must be Q or B)", encodingType);
        return nil;
    }    
}

@end



回答2:


Here is the code for quoted-printable version (Mime Encoded-word can be quoted-printable or base-64 encoded). For base64 encoded you should do similarly, replacing quoted-printable decoding with base64-decoding. Probably this code requires some testing. Also, it supports only NSString's encodings.

- (NSString*) decodeMimeEncodedWord:(NSString*)word
{
    if (![word hasPrefix:@"=?"] || ![word hasSuffix:@"?="])
        return nil;

    int i = 2;
    while ((i < word.length) && ([word characterAtIndex:i] != (unichar)'?'))
        i++;

    if (i >= word.length - 4)
        return nil;

    NSString *encodingName = [word substringWithRange:NSMakeRange(2, i - 2)];
    NSStringEncoding encoding = CFStringConvertEncodingToNSStringEncoding(CFStringConvertIANACharSetNameToEncoding((CFStringRef)encodingName));
    // warning! can return 'undefined something' if encodingName is invalid or unknown

    NSString *encodedString;

    if ([[word substringWithRange:NSMakeRange(i + 1, 2)] isEqualToString:@"Q?"])
    {
        // quoted-printable
        encodedString = [word substringWithRange:NSMakeRange(i + 3, word.length - i - 5)];
        NSMutableData *binaryString = [[[NSMutableData alloc] initWithLength:encodedString.length] autorelease];
        unsigned char *binaryBytes = (unsigned char*)[binaryString mutableBytes];
        int j = 0;
        char h;     

        for (i = 0; i < encodedString.length; i++)
        {
            unichar ch = [encodedString characterAtIndex:i];
            if (ch == (unichar)'_')
                binaryBytes[j++] = ' ';
            else if (ch == (unichar)'=')
            {
                if (i >= encodedString.length - 2)
                    return nil;

                unsigned char val = 0;

                // high-order hex char
                h = [encodedString characterAtIndex:++i];
                if ((h >= '0') && (h <= '9'))
                    val += ((int)(h - '0')) << 4;
                else if ((h >= 'A') && (h <= 'F'))
                    val += ((int)(h + 10 - 'A')) << 4;
                else
                    return nil;
                // low-order hex char
                h = [encodedString characterAtIndex:++i];
                if ((h >= '0') && (h <= '9'))
                    val += (int)(h - '0');
                else if ((h >= 'A') && (h <= 'F'))
                    val += (int)(h + 10 - 'A');
                else
                    return nil;

                binaryBytes[j++] = val;             
            }
            else if (ch < 256)
                binaryBytes[j++] = ch;
            else
                return nil;
        }

        binaryBytes[++j] = 0;
        [binaryString setLength:j];

        NSString *result = [[NSString alloc] initWithCString:[binaryString mutableBytes] encoding:encoding];        
        // warning! can return 'undefined something' if encoding is invalid or unknown

        return result;
    }
    else if ([[word substringWithRange:NSMakeRange(i + 1, 2)] isEqualToString:@"B?"])
    {
        // base64-encoded       
        return nil;
    }
    else
        return nil;
}


来源:https://stackoverflow.com/questions/14014738/mime-encoded-word-parsing-in-objective-c

易学教程内所有资源均来自网络或用户发布的内容,如有违反法律规定的内容欢迎反馈
该文章没有解决你所遇到的问题?点击提问,说说你的问题,让更多的人一起探讨吧!