GerapyPlaywright issues

playwright._impl._api_types.Error: Browser closed.

1

这种报错会是什么原因呢... 2022-03-23 06:21:06 [scrapy.core.scraper] ERROR: Error downloading Traceback (most recent call last): File "/usr/local/lib/python3.8/dist-packages/twisted/internet/defer.py", line 1656, in _inlineCallbacks result = current_context.run( File "/usr/local/lib/python3.8/dist-packages/twisted/python/failure.py", line 489, in throwExceptionIntoGenerator return g.throw(self.type, self.value,...

yyyy777

gzip.BadGzipFile: Not a gzipped file (b'<!')

2

使用Sscrapy的时候，利用 yield PlaywrightRequest(article_url, callback=self.parse_result, wait_until='domcontentloaded', headers=self.headers) 出现错误：gzip.BadGzipFile: Not a gzipped file (b'

legend-zl

actions可以带参数吗

![image](https://user-images.githubusercontent.com/47239789/231372831-7f2f73dd-c622-45bc-ba65-da84d1d91310.png) 想要在actions那里给page对象传参数，不知道应该怎么写才好

duanyaozong

在添加代理的时候需要修改一下（如下图），否则会报错

![image](https://user-images.githubusercontent.com/43235339/209943799-136f2bfb-c07c-4133-b1b4-3f9d4430da5b.png)

oldsiks

好像不支持windows

1

源码在windows 平台运行报错

oldsiks

你好，程序报错，帮忙看看哪里有问题

7

**程序时原封不动的运行我的scrapy版本时2.5** ` 2021-12-29 14:10:14 [scrapy.utils.log] INFO: Scrapy 2.5.0 started (bot: example) 2021-12-29 14:10:14 [scrapy.utils.log] INFO: Versions: lxml 4.6.3.0, libxml2 2.9.5, cssselect 1.1.0, parsel 1.6.0, w3lib 1.22.0, Twisted 21.2.0, Python...

superniao666

Method is not Json serializable for actions

Hello, I am running into an issue with 'yield' for gerapy-playwright when I need to access a login page. I try to run: yield PlaywrightRequest(login_page_url, self.parse_login, actions = self.login_action) in...

BenzTivianne

出现gzip.BadGzipFile: Not a gzipped file (b'<!') 的解决办法。一处bug

3

如何爬去的一个网站返回的response里面的headers包含了 content-encoding: "gzip"的话，那么就会报上述错误，虽然作者在 downloadermiddlewares.py 的代码段中去掉了这个属性： # Necessary to bypass the compression middleware # 这个地方只能去掉 headers 中的content-encoding，但是response.headers中的依然存在，所以下面应该直接改为 headers=headers, headers = response.headers headers.pop('content-encoding', None) headers.pop('Content-Encoding', None) response = HtmlResponse( page.url, status=response.status, headers=response.headers,...

legend-zl

GERAPY_PLAYWRIGHT_PRETEND = True 对CSDN不起作用

1

使用大神代码中伪装的脚本，添加到Playwright后还是无法骗过CSDN

legend-zl

This package does not work on Windows

3

![image](https://user-images.githubusercontent.com/49141693/147732578-42efcbea-ccef-4722-a3b7-27ec1fa9ed5c.png) 环境说明： scrapy 2.5.1 gerapy_playwright 0.2.2 Python 3.6

xiaoxiao0819

GerapyPlaywright
GerapyPlaywright copied to clipboard

Metadata

playwright._impl._api_types.Error: Browser closed.

gzip.BadGzipFile: Not a gzipped file (b'<!')

actions可以带参数吗

在添加代理的时候需要修改一下（如下图），否则会报错

好像不支持windows

你好，程序报错，帮忙看看哪里有问题

Method is not Json serializable for actions

出现gzip.BadGzipFile: Not a gzipped file (b'<!') 的解决办法。一处bug

GERAPY_PLAYWRIGHT_PRETEND = True 对CSDN不起作用

This package does not work on Windows

← Metadata

Owner

Metadata

GerapyPlaywright GerapyPlaywright copied to clipboard

Metadata

← Metadata

Owner

Metadata

GerapyPlaywright
GerapyPlaywright copied to clipboard