cc-lambda
cc-lambda copied to clipboard
Use re2
Problem: Python's re module is slow. This slows down all the common crawl processing and increases the cost of running cc-lambda.
Solution: Use re2
The problem with the solution is that pywren is not "auto-installing" re2 (since it has python and C parts). Asked how to solve this in the pywren repository and got no answer (yet).
Potential solutions:
- https://markn.ca/2018/02/python-extension-modules-in-aws-lambda/
- https://anaconda.org/conda-forge/re2