Edward Kim

Results 32 issues of Edward Kim

## Description ## Checklist - [ ] I am familiar with the [Contributing Guidelines](https://github.com/NVIDIA/nv-ingest/blob/main/CONTRIBUTING.md). - [ ] New or existing tests cover these changes. - [ ] The documentation is...

## Description ## Checklist - [ ] I am familiar with the [Contributing Guidelines](https://github.com/NVIDIA/nv-ingest/blob/main/CONTRIBUTING.md). - [ ] New or existing tests cover these changes. - [ ] The documentation is...

## Description Resolves #722 This PR introduces a new capability to the nv-ingest-client.Ingestor class, allowing users to save the results of an ingestion process directly to disk. This is particularly...

## Description Resolves #722 This PR introduces a new capability to the nv-ingest-client.Ingestor class, allowing users to save the results of an ingestion process directly to disk. This is particularly...

## Description ## Checklist - [ ] I am familiar with the [Contributing Guidelines](https://github.com/NVIDIA/nv-ingest/blob/main/CONTRIBUTING.md). - [ ] New or existing tests cover these changes. - [ ] The documentation is...

### Is this a new feature, an improvement, or a change to existing functionality? New Feature ### How would you describe the priority of this feature request Significant improvement ###...

feature request

## Description This PR enhances the extraction capabilities for Microsoft Office documents (`.docx` and `.pptx`) by introducing LibreOffice into the container environment. It introduces two major features: 1. OLE object...

## Description ```shell docker compose -f docker-compose.yaml -f docker-compose.oss-ocr.yaml \ --profile table-structure --profile retrieval up -d --build ``` If you get rate-limited by Hugging Face, add your HF token in...

## Description ## Checklist - [ ] I am familiar with the [Contributing Guidelines](https://github.com/NVIDIA/nv-ingest/blob/main/CONTRIBUTING.md). - [ ] New or existing tests cover these changes. - [ ] The documentation is...

## Description This PR introduces a new method `ingest_in_chunks` to the Ingestor class. ### Before Currently, users who need to process very large datasets that don't fit on disk must...