Loading an HTML file with the following content results in a stack overflow and stops the running application:
Calling DocumentModel.Load() on the file and then GetPaginator() on the loaded document reproduces the issue.
This also applies to setting the width or other style elements in %, as long as the parent element does not contain a style with an absolute value (in px for example). So setting the body width to 1000px solves the issue.
Is there a possibility to set the body width/height of the document before getting the Paginator? Or is there a way to throw an exception before the application stops due to the stack overflow?
I was unable to reproduce your issue, I tried the following:
string html = @"<html>
var document = new DocumentModel();
I also tried loading that content from the “input.html” file, still no issue occurred.
with another HTML file I have a similar issue now. The application does not get a stack overflow, but the paginating (= loading the document and then calling GetPaginator().Pages) takes more than 5 minutes which causes my application to throw a Timeout Exception.
The HTML file is not large, it is the body of a newsletter mail. I uploaded it → here ← .
I tried it with the newest version of GemBox.Document. Could you have a look at it with this HTML content?
thank you for the suggested workaround.
Since we try to avoid analysing the content of the documents that we process with our application in detail, we will continue to solve the problem by catching the TimeoutException.
Should I create a Feature Request for faster processing of HTML files with nested tables? Or is the issue already being worked on?
We already have a backlog ticket for this and I’ve added your report to it in order to increase its priority.
But at the moment I cannot say when this could become available.
Please note that we prioritize greater time investments by the number of users requesting them and currently we’re working on some other features that have greater priority.