atarashi
atarashi copied to clipboard
Atarashi scans for license statements in open source software, focusing on text statistics. Designed to work stand-alone and with FOSSology.
## Description Add a agent for [bm25Okapi](https://en.wikipedia.org/wiki/Okapi_BM25) agent. ## Files - **agents/bm25.py** ## How to use - Running **bm25** agent `atarashi -a bm25 /path/to/file.c` - Checking accuracy using **evaluator.py** `python3...
## Description Added a agent for [Linear Support vector machine](https://scikit-learn.org/stable/modules/generated/sklearn.svm.LinearSVC.html) model. ## Files - **agents/linearsvc.py** (Python file to run linearsvc model) ## How to use - Running **linearsvc** agent `atarashi...
# Description New Open Source License Scanning Algorithm: **Semantic Text Similarity** find similarity between documents according to its semantics. The [Gensim](https://radimrehurek.com/gensim/models/doc2vec.html) implementation of [Doc2Vec](https://cs.stanford.edu/~quocle/paragraph_vector.pdf) converts the whole document (unlike word2vec)...
## Description Added a agent for logistic regression model. ## Files - **agents/logisticRegression.py** (Python file to run logreg model) ## How to use - Running **logisticRegression** agent `atarashi -a logisticRegression...
Hello. I've been playing around with some _parameters_ of the TF-IDF agent. I've found that if we stop using a threshold (`cosine similarity >= 0.30`) to filter the match results,...
The evaluation script should - Allow to print a comparison table with all the algorithms supported by atarashi. You can find examples of comparison tables in #95 and #65. -...
Added debian folder to debianize atarashi. ## Creating Debian packages - Install build dependencies from `debian/control` - Install `code_comment` and other dependencies from requirements.txt ```sh # python3 -m pip install...
We can improve the performance of atarahi, nirjas, and others by using Numba and RAPIDS by Nvidia. Regular NumPy, pandas, and other libraries are slow. Maximum amount of time is...
# Description `argparse` is a parser for command-line options, arguments, and sub-command. Read Docs: https://docs.python.org/3/library/argparse.html Currently, `argparse` is used as command line parser in `atarashi` and we're planning to shift...
# Description Refactored to code to shift from argparse module to plac command line parser Closes #63