How To Extract Pdf Using WebDriver

No more concern for reading the PDF doc.We can extract PDF using selenium.This post helps you out
Download pdfbox-1.8.8 Jar file and add it to your BuildPath

import org.apache.pdfbox.cos.COSDocument;
import org.apache.pdfbox.pdfparser.PDFParser;
import org.apache.pdfbox.pdmodel.PDDocument;
import org.apache.pdfbox.util.PDFTextStripper;
import java.io.File;
import java.io.FileInputStream;
import java.io.IOException;
public class SimpleTest {
public static void main(String args[]) throws IOException {
PDFTextStripper pdfStripper = null;
PDDocument pdDoc = null;
COSDocument cosDoc = null;
File file = new File();
PDFParser parser = new PDFParser(new FileInputStream(file));
parser.parse();
cosDoc = parser.getDocument();
pdfStripper = new PDFTextStripper();
pdDoc = new PDDocument(cosDoc);
String parsedText = pdfStripper.getText(pdDoc);
System.out.println(parsedText);
}
}

Become a Software Tester

How To Extract Pdf Using WebDriver

Post a Comment

Post a Comment

Most Popular

Fillo is an Excel API for Java and you can query xls & xlsx files. Now, it supports SELECT, UPDATE & INSERT queries with or without WHERE clause.

Top 10 Interview Questions and Answers

Test Plan Template

Tags

Popular

Recent Posts

Contact Form

How To Extract Pdf Using WebDriver

You might like

Post a Comment

Post a Comment

Most Popular

Fillo is an Excel API for Java and you can query xls & xlsx files. Now, it supports SELECT, UPDATE & INSERT queries with or without WHERE clause.

Top 10 Interview Questions and Answers

Test Plan Template

Tags

Popular

Recent Posts

Contact Form