May cause a dead loop when decoding some website.
var httpService = new HTTPService(null); var strData =await httpService.Get(new Aiursoft.Pylon.Models.AiurUrl("https://bsaber.com/"),false); var htmlDoc= HtmlConvert.DeserializeHtml(strData);
就解析 https://bsaber.com/ 这个网站 DeserializeHtml 一直不返回,测试过最长时间 大概 10分钟
nuget上最新的版本
Ok. Will investigate.
It seems that it was because the long script tag which caused the regex stop working. We will try this solution to resolve this:
- Remove all script tags in the HTML.
- Remove all link tags in the HTML
And after that and start decoding it might solve that.
We will release the resolved edition in several days. Please be patient for the next release.
By the way, thanks for your contribution!

It seems that those tags in scripts which caused the code stop working.
It seems that those tags in scripts which caused the code stop working.
所以 我应该在处理html的时候去处理掉这些标记.或者能够修改正则表达式 忽略掉它们.(啥时候更视频~),23333.
The regex expression is very very hard to change and may impact current running projects. I may try to write another extension methods to remove all scripts and links in HTML which may impact my HTML analyzing.
BTW, Video updates will be in several days.
好的 谢谢,那么 我想想咋手动替换它们 哈哈哈.
It seems that it was not related to those scripts. I am trying to debug this.
As I found, the following code will cause this project stacked.
<!DOCTYPE html>
<html>
<head>
<title>Edi.Wang</title>
<meta name="theme-color" content="#2a579a">
<meta charset="utf-8" />
<meta name="viewport" content="width=device-width, initial-scale=1, user-scalable=no" />
<meta name="keywords" content="eeee" />
<meta name="keywords" content="eeee" />
<meta name="keywords" content="eeee" />
<meta name="keywords" content="eeee" />
<meta name="keywords" content="eeee" />
<meta name="keywords" content="eeee" />
<meta name="keywords" content="eeee" />
<meta name="keywords" content="eeee" />
<meta name="keywords" content="eeee" />
<meta name="keywords" content="eeee" />
<meta name="keywords" content="eeee" />
<meta name="keywords" content="eeee" />
<meta name="keywords" content="eeee" />
<meta name="keywords" content="eeee" />
<meta name="keywords" content="eeee" />
<meta name="keywords" content="eeee" />
<meta name="keywords" content="eeee" />
<meta name="keywords" content="eeee" />
<meta name="keywords" content="eeee" />
<meta name="keywords" content="eeee" />
<meta name="keywords" content="eeee" />
<meta name="keywords" content="eeee" />
<meta name="keywords" content="eeee" />
</head>
<body></body>
</html>
I am still wondering why.
我已经定位了Bug。在一个tag下有太多并行的同种tag时,类似上面的情况,就会导致程序判断标签起始点的时间复杂度呈指数上升,从而极大的消耗时间。
现在直接在master的代码的Example3中就能重现此bug。
啊这个Bug太难了……不会(摊手