crewAI-tools icon indicating copy to clipboard operation
crewAI-tools copied to clipboard

feat: Add PDF Table Extract Tool

Open sachinspanicker opened this issue 1 year ago • 1 comments

PDF Table Extract Tool

Description

Add new PDFTableExtractTool for extracting tables from PDF documents and converting them to markdown format.

Features

  • Extract tables from PDF documents
  • Convert tables to markdown format
  • Handle multiple tables and large tables
  • Support both sync and async operations
  • Comprehensive error handling

Implementation

  • Added PDFTableExtractTool class
  • Added comprehensive test suite
  • Added documentation with usage examples
  • Implemented proper error handling
  • Added type hints and docstrings

Dependencies

Added to pyproject.toml:

  • PyMuPDF
  • pandas
  • tabulate

Testing

All tests passing:

  • Basic functionality
  • Error handling
  • Edge cases
  • Async operations

Documentation

  • Added detailed README
  • Added usage examples
  • Added inline documentation

sachinspanicker avatar Nov 27 '24 11:11 sachinspanicker

Looks good but missing a init import if you dont mind adding it :)

joaomdmoura avatar Dec 05 '24 14:12 joaomdmoura