database-stream-processor icon indicating copy to clipboard operation
database-stream-processor copied to clipboard

Add LDBC Benchmarks

Open Kixiron opened this issue 3 years ago • 2 comments

LDBC Benchmarks are a great source of tests that really hone in on the things we need to benchmark:

  • LDBC Social Network Benchmark (LDBC-SNB)
    • The Social Network Benchmark's Interactive workload is focusing on transactional graph processing with complex read queries that access the neighbourhood of a given node in the graph and update operations that continuously insert new data in the graph
    • The Social Network Benchmark's Business Intelligence workload is focusing on aggregation- and join-heavy complex queries touching a large portion of the graph with microbatches of insert/delete operations
  • LDBC Graphalytics Benchmark (LDBC Graphalytics), focuses on large-scale graph analysis
  • LDBC Semantic Publishing Benchmark (LDBC-SPB) SPB performance is measured by producing a workload of CRUD (Create, Read, Update, Delete) operations which are executed simultaneously. The benchmark offers a data generator that uses real reference data to produce datasets of various sizes and tests the scalability aspect of RDF systems. The benchmark workload consists of (a) editorial operations that add new data, alter or delete existing (b) aggregation operations that retrieve content according to various criteria

These are a really good mix of both static and streaming datasets, which is exactly what we need. Prior art from Frank McSherry w/ DDFlow

Kixiron avatar Jul 08 '22 20:07 Kixiron

Is this completed?

mihaibudiu avatar Jan 12 '23 17:01 mihaibudiu

Is this completed?

@Kixiron implemented a few of these benchmarks, but I think there are more we may want to add in the future. Not a high-priority issue though.

ryzhyk avatar Jan 12 '23 17:01 ryzhyk