|
![]() [ View full size screenshot ] |
| Version | Date Released | Status | Release Notes |
| 5.2.0 | Mar 2, 2012 | New Release | · iText 5.2.0: better PDF parsing · We received plenty of feedback regarding PDF parsing, and we've taken into account almost all the issues that were reported. This means that PDF to text conversion with iText has now improved dramatically. Soon the Belgian IRS will start using iText to parse thousands of documents looking for a national number on the first page. We're using different strategies to do this: we parse the text at a specific position if we know it; or we parse the whole page looking for a pattern if the number can be anywhere on the page. We've also improved the parsing of PDF documents in languages such as Chinese, Korean, Japanese |
| 5.1.3 | Dec 4, 2011 | New Release | · iText 5.1.3 brings the long awaited support for PAdES-LTV. Examples on how to use this functionality that will be part of the upcoming PDF-2 standard will be following soon. We've also improved the code for parsing PDFs for images. It can now be done using much less code than before. We've added support for JavaScript when parsing bookmarks to XML (or vice-versa) and some convenience methods to ColumnText. · Furthermore, we've fixed plenty of bugs that were reported since the previous release, for instance a bug that garbled the output when parsing PDFs with fonts using a /ToUnicodeMap, an issue with page numbers that were shown even when the page label should have been empty, and so on. · Finally, we've made iText more tolerant towards PDFs that weren't created correctly. For instance a PDF with a binary zero in the Producer name, or PDFs where the /Outlines dictionary isn't an indirect reference. |
| 5.1.2 | Aug 17, 2011 | New Release | · Added high level api interfaces Indentable and Spaceable. This makes it easier to apply styles in XML Worker. · Extended Logging to work like logging frameworks do. · Added a simple System.out logger: SysoLogger · Added a WriterOperation interface, and an abstract WritableDirectElement class that should implement it. This object allows XML Worker to do operations on a writer/document, similar to what VerticalPositionMark does, but different. · An incomplete ToUnicode cmap would prevent the correct text extraction. · Fixed problem where an Exception was thrown in the edge condition where renderInfo.getText() returns an empty string. · Bugfix Rectangle: the parameters defining the lower-left coordinate weren't taken into account · Restore of some changed XMLs. Fix by Jérôme Marant for Latin (see version 2.1 of the extrajars). · Normally, a PDF should have the startxref value in the final 1024 bytes. |
| 5.0.6 | Feb 7, 2011 | New Release | · Added an event to signal that a table will continue in the next page · HTMLWorker: relative cell widths are supported in tables (Samuli Tuomola). · Tagged XML extraction didn't consider indirect /K objects or page contents with an array. · PdfStamper: non pdfdocencoding characters were being wiped out from the producer name. · TiffImage: images with 1 bit per pixel may also use a pallete. · Added RectangleReadOnly.setRotation() override to throw an exception if a read only Rectangle is changed. Parsing PDF: · CMapAwareDocumentFont: Some PDFs have a diff map bigger than 256 characters. · CMapParser: Fix to read broken ToUnicode cmaps. · Cache the fonts used in text extraction. · Added basic support for rich text values in form fields. No appearance support as yet. · added support for web hex colors without a leading '#', and some tests. · The meaning of transparent (0 or 0x00) and opaque (1, 255, or 0xFF) was switched in the WebColors class. |