Projects / PDFTextStream / Releases

All releases of PDFTextStream

  •  09 Aug 2012 10:34
Avatar

    Release Notes: PDFTextStream is now free for use in single-threaded applications; all previous "evaluation" limitations no longer apply when PDFTextStream is operated without a license file. A new OutputHandler is now available: com.snowtide.pdf.SelectionOutputTarget, implementing text extraction based on a "selection coordinates", as commonly found in user-facing PDF viewer UIs.

    •  02 Aug 2012 13:20
    Avatar

      Release Notes: This release adds support for decryption of AES-encrypted PDF documents (including support for 256-bit and variable bit length ciphers), and adds dozens of performance and PDF document compatibility enhancements and fixes. PDFTextStream for Java now requires version 1.5.0 or higher of the JVM/JRE, and PDFTextStream.NET now ships with IKVM 0.46.0.1 and requires .NET 2.0 or higher. PDF merge capability (com.snowtide.pdf.util.MergeUtil) has been deprecated, as has memory-mapping of opened PDF files (now disabled by default).

      •  15 Sep 2011 10:56
      Avatar

        Release Notes: This release includes a variety of fixes made to ensure PDFTextStream is capable of extracting text from PDF documents that are nonconforming to the PDF specification. It also includes a variety of performance enhancements.

        •  23 Apr 2009 13:59
        Avatar

          Release Notes: An .isStruckThrough() method was added to com.snowtide.pdf.TextUnit, indicating whether a character has a strikethrough drawn through it. PDFTextStream's support for embedded character mappings was improved. The calculation of whitespace between words has been fixed to properly account for whitespace that is explicitly encoded in the source PDF documents. PDFTextStream's handling of composite content encodings was improved, which previously could fail resulting in some ranges of PDF content being "ignored" during extraction.

          •  30 Dec 2008 19:16
          Avatar

            Release Notes: This release adds support for extracting XFA forms data as XML. It significantly improves the performance of text extraction using VisualOutputTarget. Support for PDF documents larger than 2GB. A fix for a bug where the encodings from embedded Type1 fonts were previously not being applied properly in some circumstances. A fix for a problem where newer content in updated PDF documents was sometimes being ignored. A fix for a problem where PDFDocEncoding-encoded bookmarks and metadata were not being decoded properly. A .getDestinationName() method in com.snowtide.pdf.Bookmark.

            •  05 Apr 2007 15:59
            Avatar

              Release Notes: Support was added for updating text, checkbox, radio button, and choice interactive form fields. Support was added for Kodak print job data extraction (%KDK commands) via com.snowtide.pdf.util.KodakPrintData. The AcroFormField.isReadOnly() function was exposed. ByteBuffer-based buildPDFDocument() functions were added to com.snowtide.pdf.lucene.PDFDocumentFactory. The documentation was improved significantly.

              •  28 Mar 2007 18:36
              Avatar

                Release Notes: This release fixes handling of text spacing that was causing some columnated text to overrun column boundaries improperly. It fixes a problem where text from adjacent lines would be inappropriately intermingled. Unlicensed functionality has been changed so that evaluation use does not require a special evaluation license file; specifically, PDFTextStream will randomize some digits in text extracts when it is operating unlicensed, and the 8-page extract limitation has been removed.

                •  07 Dec 2006 23:12
                Avatar

                  Release Notes: This release adds a com.snowtide.pdf.RegionOutputTarget to support region-specific content extraction. It adds the ability to derive encoding and spatial metrics of Type3 fonts. It adds a pdfts.type3.derive system property to disable derivation if necessary. A problem with com.snowtide.pdf.VisualOutputTarget, where lines would sometimes be inappropriately combined, has been fixed.

                  •  30 Aug 2006 14:55
                  Avatar

                    Release Notes: Indication of corrupted or otherwise unreadable PDF files was improved (com.snowtide.pdf.FaultyPDFException). The pipe(OutputHandler) function was added to com.snowtide.pdf.layout.Line. The "pdfts.mmap.disable" system property option was added to disable memory-mapping of PDF files, which avoids a JDK bug.

                    •  16 Aug 2006 18:15
                    Avatar

                      Release Notes: This release adds builds for .NET and Python, supports the extraction of Chinese, Japanese, and Korean Text, and boosts performance significantly.

                      Screenshot

                      Project Spotlight

                      episoder

                      A tool to tell you about new episodes of your favourite TV shows.

                      Screenshot

                      Project Spotlight

                      BalanceNG

                      A modern software IP load balancer.