Dima
Dima copied to clipboard
simialrity join or search on spark core directly
我尝试着复现论文Balance-aware distributed string similarity-based query processing system里面的数据集 然而我的代码却无法读取数据,无论是hdfs还是本地文件  “val sqlContext = new org.apache.spark.sql.SQLContext(sc) // A JSON dataset is pointed to by path. // The path can be either a...
For diversity reasons, it would be nice to try to avoid 'master' and 'slave' terminology in this repository which can be associated to slavery. The master-slave terminology could be problematic...
Hi! Sorry for disturbing you with another bug report. Recently, when I tried to run Dima on the [com-LiveJournal dataset from SNAP Datasets](http://snap.stanford.edu/data/com-LiveJournal.html), I met another exception. The exception is...
1. Is there any document on how to use Dima in the program? 2. Is there any technical report on Dima? Due to the length limit, the paper on VLDB...