l o a d i n g

Automated PDF Data Scraping (Excel Output)

Oct 12, 2024 - Junior

$192.00 Fixed

Description: We are looking for an experienced freelancer or team to extract data from a large volume of PDF files. Due to the scale of this task, we prefer candidates who can develop automated scripts for efficient data extraction or those with a team capable of handling bulk processing. Requirements: • Expertise in Python or other relevant programming languages for PDF parsing • Experience with libraries such as PyPDF2, PDFMiner, or similar tools • Ability to extract structured data accurately (tables, text, etc.) • Handling of different PDF formats and layouts • Data output must be in Excel (.xlsx) format per template provided. Preferred Qualifications: • Prior experience with large-scale data extraction • Strong attention to detail and data accuracy • Ability to process PDFs with varying structures • Team availability for high-volume processing (if applicable) Selection Process: • We will conduct interviews to understand each applicant’s approach and past projects • Sample PDFs will be provided for test scraping before making a hiring decision Additional Information: • Please share relevant past projects or samples • Estimated time frame for completion • Your approach to automating the process Looking forward to your proposals!
  • Proposal: 0
  • 80 days
AuthorImg
Rukhmani Mukhopadhyay Inactive
,
Member since
Jun 24, 2024
Total Job
3