hjwilli

Results 10 issues of hjwilli

Update the dataset preview page with more data analysis plots. (Notes from our initial pass at this are in issue #209, in particular some ui ideas are here https://github.com/EpistasisLab/pennai/issues/209#issuecomment-503251165 )

The serialized recommenders will need to be periodically retrained- for example as we update python packages, add new experiment configurations, or update what information the serialized recs contain. We want...

Clean up unnecessary files, references to local paths, etc. Some dirs to check: - [x] /mock_experiment - [ ] /ai - [x] /ai/metalearning/*.gz - [ ] /ai/metalearning/metalearning.ipynb (references local paths)...

The methods used to generate the metafeatures are currently calculating which columns are categorical by examining the data. For example, if file that contains some strings in a column, that...

Indicate that base.py is abstract in some way, perhaps by using the [abstract base class module](https://docs.python.org/3/library/abc.html) or raising a `NotImplementedError` if a base instance is instantiated. (see https://stackoverflow.com/questions/13646245/is-it-possible-to-make-abstract-classes-in-python)

**Service discovery** Service discovery is the automatic discovery and registration of services (in this case machine instances) that are available to use. With the current setup, during the machine initialization...

Ticket for keeping track of overall discussion/issues about the 'Open Source PennAI' milestone. --- **basic user and developer documentation** - [ ] User docs point to an intro to ML,...

Discussion

[PM2](http://pm2.keymetrics.io/) is being used to keep process alive in the `lab` ([the main server](https://github.com/EpistasisLab/pennai/blob/master/dockers/lab/files/entrypoint.sh#L8) and [ai processes](https://github.com/EpistasisLab/pennai/blob/master/dockers/lab/files/entrypoint.sh#L39)) and `machine` ([the main server process](https://github.com/EpistasisLab/pennai/blob/ff808c7565c1376f4edba40b83fc77f056de9012/dockers/lab/files/entrypoint.sh#L39)) docker container instances. PM2 has a [pm2-runtime](http://pm2.keymetrics.io/docs/usage/docker-pm2-nodejs/)...

Add an api call that performs first-pass validation on a dataset for use when uploading a new dataset, See registerDatafile() method in `lab\init\loadInitialDatasets.py`. Needed for #119 --- Proposed dataset staging...

Currently, when an ml algorithm completes result data and images are produced by skl_utils.py as files in a temporary folder. machine.js watches this folder and uses the lab API to...