GerapyAutoExtractor
GerapyAutoExtractor copied to clipboard
函数preprocess4content_extractor的bug
函数preprocess4content_extractor中的
for child in children(element):
只是遍历了子,而不是遍历所有节点,是否应该改为
for descendant in element.iterdescendants():