文章/答案/技术大牛

发布

问过滤网站上的特定评论
EN

Stack Overflow用户

提问于 2018-08-16 13:47:41

回答 2查看 58关注 0票数 0

#!/usr/bin/env python
# -*- coding: utf-8 -*-
import urllib2
#import re
from BeautifulSoup import BeautifulSoup

headers = {'User-Agent': 'Mozilla/5.0'}

req = urllib2.Request('https://www.sikayetvar.com/onedio', 
None,headers)
resp  = urllib2.urlopen(req)
html = resp.read()
soup = BeautifulSoup(html)

complaints = soup.findAll('p', attrs = {'class' : 'complaint-summary'})


for complaint in complaints:
   if complaint.text.find("genç") is not -1:
      print complaint.text

我想过滤某些网站上有特定单词的投诉，但我无法搜索其中包含nonascii字符的单词。我用的是python2.7和漂亮的汤。知道为什么会这样吗？

beautifulsoup

web

python

回答 2

Stack Overflow用户

回答已采纳

发布于 2018-08-16 14:14:10

如果您的测试在p标记内，YouTube应该将od语句更改为

#!/usr/bin/env python
# -*- coding: utf-8 -*-
import urllib2
from BeautifulSoup import BeautifulSoup

headers = {'User-Agent': 'Mozilla/5.0'}

req = urllib2.Request('https://www.sikayetvar.com/onedio', 
None,headers)
resp  = urllib2.urlopen(req)
html = resp.read()
soup = BeautifulSoup(html)

complaints = soup.findAll('p', attrs = {'class' : 'complaint-summary'})

for complaint in complaints:
    if b"genç".decode("utf-8") in complaint.text:
        print(complaint.text)

票数 0

Stack Overflow用户

发布于 2018-08-16 17:13:33

请勿使用python2。他们将在未来几年停止对它的支持。

import requests
from bs4 import BeautifulSoup 

response = requests.get('https://www.sikayetvar.com/onedio',headers = {'User-Agent': 'Mozilla/5.0'})

soup = BeautifulSoup(response.content,'lxml')

complaints = soup.select('p.complaint-summary')
for complaint in complaints:
    if "genç" in complaint.text:
        print(complaint.text.strip())

输出将是

Ne yazık ki bir sosyal sitede ahlak dışı içerikli haberler durulmuyor. Çocuk ve gençler için sakıncalı olduğunu düşünüyorum. Fotoğraflarda saçma başlıkları görebilirsiniz. Başlıklardan anlaşılacağı üzere cinsel…

票数 0

页面原文内容由Stack Overflow提供。腾讯云小微IT领域专用引擎提供翻译支持

原文链接：

https://stackoverflow.com/questions/51870338

复制

相似问题

问过滤网站上的特定评论
EN

回答 2

Stack Overflow用户

Stack Overflow用户

社区

活动

圈层

关于

腾讯云开发者

热门产品

热门推荐

更多推荐

问过滤网站上的特定评论EN

回答 2

Stack Overflow用户

Stack Overflow用户

社区

活动

圈层

关于

腾讯云开发者

热门产品

热门推荐

更多推荐

问过滤网站上的特定评论
EN