Slow speed comparing to Python lxml #1087

gryznar · 2024-05-13T07:46:26Z

I am previous user of lxml. Unfortunatelly parsing using lxml was much faster comparing to html. Maybe there are places to improve it via applying some solutions from lxml? The biggest drop is in creating Document from String, especially for big sites

The text was updated successfully, but these errors were encountered:

HosseinYousefi · 2025-04-29T14:24:07Z

Recently a PR with some performance improvements has been merged. Can you check if the performance is now more comparable? If not, could you provide an example preferably with the benchmark harness for python as well that demonstrates the difference in speed so I can take a look into it?

gryznar · 2025-05-02T16:19:39Z

Yeah, I'll try it in free time. Thanks for the improvements!

mosuem transferred this issue from dart-archive/html Oct 29, 2024

mosuem added the package:html label Oct 29, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Slow speed comparing to Python lxml #1087

Slow speed comparing to Python lxml #1087

gryznar commented May 13, 2024

HosseinYousefi commented Apr 29, 2025

gryznar commented May 2, 2025

Slow speed comparing to Python lxml #1087

Slow speed comparing to Python lxml #1087

Comments

gryznar commented May 13, 2024

HosseinYousefi commented Apr 29, 2025

gryznar commented May 2, 2025