Normalize whitespace with Python

前端 未结 3 1455
野趣味
野趣味 2021-01-12 11:57

I\'m building a data extract using scrapy and want to normalize a raw string pulled out of an HTML document. Here\'s an example string:

  Sapphire RX460 OC           


        
3条回答
  •  灰色年华
    2021-01-12 12:16

    You can use a function like below with regular expression to scan for continuous spaces and replace them by 1 space

    import re
    
    def clean_data(data):
        return re.sub(" {2,}", " ", data.strip())
    
    product_title = clean(product.css('h3::text').extract_first())
    

    And then improve clean function anyway you like it

提交回复
热议问题