UnicodeEncodeError: 'gbk' codec can't encode character: illegal multibyte sequence

前端未结

关注

 3  1676

伪装坚强ぢ 2020-12-05 17:20

I want to get html content from a url and parse the html content with regular expression. But the html content has some multibyte characters. So I met the error described in

3条回答

自闭症患者 (楼主)

2020-12-05 17:38
Try
```
open(file, 'r', encoding='utf-8')
```
instead of
```
open(file, 'r')
```
0 讨论(0)

查看其它3个回答
发布评论:

提交评论
- 加载中...