Dima
Dima copied to clipboard
Fail to read the json dataset from hadoop or file
我尝试着复现论文Balance-aware distributed string similarity-based query processing system里面的数据集
然而我的代码却无法读取数据,无论是hdfs还是本地文件

“val sqlContext = new org.apache.spark.sql.SQLContext(sc)
// A JSON dataset is pointed to by path. // The path can be either a single text file or a directory storing text files. val path = "hdfs://user/hni/Software.json" val people = sqlContext.read.json(path)” 无论是内存跟权限都没有问题,请问您遇到这种问题吗