You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
In a sample of 2 pdfs that I converted into TEI with Grobid, I noticed that text lines preceding a table were sometimes dropped and did not appear in the final TEI file. The following screenshots come from two consecutive pages of a pdf file:
The text underlined in yellow was ignored by Grobid, as you can see in the TEI file. The text of the table was ignored as well by Grobid.
The text was updated successfully, but these errors were encountered:
@panagiotis-tsolakis the text missing was a bug in 0.8.1, which has been fixed in the current master, so the text is not lost. The table, unfortunately is blended into the text, the recognition should hopefully improve with #963
In a sample of 2 pdfs that I converted into TEI with Grobid, I noticed that text lines preceding a table were sometimes dropped and did not appear in the final TEI file. The following screenshots come from two consecutive pages of a pdf file:
The text underlined in yellow was ignored by Grobid, as you can see in the TEI file. The text of the table was ignored as well by Grobid.
The text was updated successfully, but these errors were encountered: