Thamme Gowda

Results 36 issues of Thamme Gowda

Closes #21 (replaced GFile with pathlib) Added CLI --average that outputs (mean, median) scores

#### Issue Description Build fails ``` [INFO] ------------------------------------------------------------------------ [INFO] Reactor Summary for sparkler-parent 0.2.2-SNAPSHOT: [INFO] [INFO] sparkler-parent .................................... SUCCESS [ 0.003 s] [INFO] sparkler-tests-base ................................ SUCCESS [ 1.374 s] [INFO]...

The first task is defining and expressing the **forcus crawling** specification. The second subtask will be implementing that specification in sparkler. Currently, we have support for URL based focus/filters. this...

## Background: Injector uses a URLValidator utility to validate urls before injection ## Problem URL validator used in injector is too strict, often times not passing valid urls. Example: we...

There are two FIXME: in configuration: First, support loading `sparkler-defaults.yaml` and `sparkler-site.yaml`. The common practice is `*-default.yaml` provides default and recommended values from developers. The `*-site.yaml` should beused by users...

+ review if this can be generalised as `Parser` + Generalise schema to fit all possible extractions that may come up in the future

This plugin shall add customize the URL selection for fetching

Sub tasks: + Define Scoring plugin interface + Port over Cosine Similarity from Nutch to Sparkler + Port over Naive Bayes Filter from Nutch to Sparkler + Integrate Domain Explorer...

enhancement

We have been using PdfLatex compiler/engine as the default, but as we know it isn't Unicode (non-Latin) friendly. Though the instructions suggest using XeLaTeX, the generated PDF looks different in...