cc-lambda
cc-lambda copied to clipboard
Search the common crawl using lambda functions
Not sure if the author is interested in updating this project, but I just wanted to leave a note for anyone else who attempts to use it. Although the project...
pywren logs to cloudwatch every 2 seconds, this fill the log with unnecessary lines that say "Running ..." Send a PR to pywren to make the period between `Running ...`...
Problem: Python's `re` module is slow. This slows down all the common crawl processing and increases the cost of running cc-lambda. Solution: Use `re2` The problem with the solution is...