Status
This is used as a status report.
7/20: Initial benchmarks for roadmap and ucsc data sources are established. The ucsc_igd is created from data source ucsc_sorted in /sfs/qumulo/qproject/shefflab/iGD. The giggle index folder for ucsc is about 640GB in size, the compressed file ucsc_index.tgz is ~120GB and located in /sfs/qumulo/qproject/shefflab/iGD. Install giggle to use it.
7/23: Large-scale data sources roadmap and ucsc for both iGD and giggle are available at: .../www/igd.
7/24-7/25: Add dynamic search functionality. Find large data sources with signal values for testing dynamic search.
7/26: iGD is ready for external test.
7/27: Add scripts for figure
8/2: Database for testing dynamic search is added to www/igd: tsbf_igd.
8/6: processing Cistrome data; double check the result difference with giggle
Finished Figure 1. Try to make iGD faster.
Found a new solution to the general interval search problem--it should be much better than the standard B+ tree algorithm. Will be implemented.