extractcode
extractcode copied to clipboard
A mostly universal file extraction library and CLI tool to extract almost any archive in a reasonably safe way on Linux, macOS and Windows.
Hi, I'm running into a problem with certain .lz4 and also .jar files. Example (lz4): ``` $:~/SCAN_IMAGES/release-1.13.zip-extract$ ~/scancode-toolkit/extractcode ./release/deploy_artifacts/router.tar.gz-extract/router.tar-extract/0f6a1467d0c8a8fce8ea65eedd0d2ee6e23f979498d128a0101318c7549f90a6/layer.tar-extract/var/lib/apt/lists/deb.debian.org_debian_dists_bullseye_main_binary-amd64_Packages.lz4 Extracting archives... [####################] 4 ERROR extracting: /home/joe/SCAN_IMAGES/release-1.13.zip-extract/release/deploy_artifacts/router.tar.gz-extract/router.tar-extract/0f6a1467d0c8a8fce8ea65eedd0d2ee6e23f979498d128a0101318c7549f90a6/layer.tar-extract/var/lib/apt/lists/deb.debian.org_debian_dists_bullseye_main_binary-amd64_Packages.lz4: Unrecognized archive format Extracting...
Use the underscore name 'console_scripts' instead. By 2023-Sep-26, you need to update your project and remove deprecated calls or your builds will no longer be supported.
See: - https://www.bamsoftware.com/hacks/zipbomb/ - https://github.com/python/cpython/commit/d05bac0b74153beb541b88b4fca33bf053990183 - https://github.com/python/cpython/issues/109858
The extractcode doc at https://scancode-toolkit.readthedocs.io/en/stable/tutorials/how_to_extract_archives.html doc doesn't mention the "--ignore" option at all. it's quite an important option to avoid wasting time on unnecessary files and also for preventing extractcode...
``` bash ❯ extractcode --all-formats libmediainfo-0.7.43.diff Extracting archives... [####################] 4 ERROR extracting: ./libmediainfo-0.7.43.diff: sequence item 0: expected str instance, bytes found Extracting done. ```
Parse source map files and treat them as if they were archives containing files. fixes https://github.com/nexB/scancode-toolkit/issues/3637
update the `requirements.txt` to include the new spdx-tools release Signed-off-by: Armin Tänzer
Patch-ng is more widely packaged nowadays.
There are some .pkg files that 7zip is able to do the extraction while extractcode fails to do so. Note that I have already ran with the `--all-formats`
The idea is to build a set containing the hashes of already extracted archives along each filesystem branch. If a new archive is detected that is already present in the...