.	.
Institute	Eastern Switzerland University of Applied Science
Program	MSE Computer Science
Module	DB Seminar
Author	Roman Bögli
Supervisor	Prof. Stefan F. Keller
Date, Time	13. June 2022, 4pm
Project Page	GitHub

Declaration	Substitution
`{{.Iter}}`	Counter that starts with 1 and ends with the specified iteration count of the given benchmark.
`{{call .RandInt64}}`	Returns a random non-negative value of type Int64.
`{{call .RandFloat64}}`	Returns a random value within the interval [0.0,1.0) as Float64.
`{{call .RandIntBetween 1 42}}`	Returns a random integer between 1 and 42 (Int32).
`{{call .RandFloatBetween 0.8 9.9}}`	Returns a random float between 0.8 and 9.9 (Float64).
`{{call .RandString 1 9}}`	Returns a random string with a length between 1 and 9 characters.
`{{call .RandDate}}`	Returns a random date as string (yyyy-MM-dd) between `1970-01-01` and `2023-01-01`.

Part	Benchmark	Tasks
0	`initialize`	Drop all possibly existing data and recreate the root node called "BigBoss"
1	`insert_employee`	Inserts further nodes that are connected to randomly chosen existing nodes. The number of iterations equals 100% of the specified iteration count.
2	`select_before_index`	Subsequent query all existing nodes and return the node itself together with all its connected nodes (i.e. its subordinate employees). No index exists at this stage. The number of iterations equals 100% of the specified iteration count.
3	`create_index`	Creating a so-called BTREE index on the entity's relationship indicator (i.e. foreign key in relational DBMS, resp. relationship itself in graph-based DBMS).
4	`clear_cache`	All cached data is discarded.
5	`select_after_index`	The identical querying tasks as in Part 2 is repeated.
6	`clean`	Complete removal of existing data and index information.

References

Bechberger, D., & Perryman, J. (2020). Graph databases in Action: Examples in Gremlin. Manning.
Bush, J. (2020). Learn SQL Database Programming: Query and manipulate databases from popular relational database servers using SQL.
Chauhan, C., & Kumar, D. (2017). PostgreSQL High Performance Cookbook: Mastering query optimization, database monitoring, and performance-tuning for PostgreSQL. Packt Publishing.
Codd, E. F. (2002). A Relational Model of Data for Large Shared Data Banks. In M. Broy & E. Denert (Eds.), Software Pioneers (pp. 263–294). Springer Berlin Heidelberg. https://doi.org/10.1007/978-3-642-59412-0_16
Elmasri, R., & Navathe, S. (2011). Fundamentals of Database Systems (6th ed). Addison-Wesley.
Fleming, P. J., & Wallace, J. J. (1986). How not to lie with statistics: The correct way to summarize benchmark results. Communications of the ACM, 29(3), 218–221. https://doi.org/10.1145/5666.5673
Gray, J. (Ed.). (1994). The Benchmark Handbook for Database and Transaction Processing Systems (2. ed., 2. [print.]). Morgan Kaufmann.
Gregg, B. (2020). Systems Performance: Enterprise and the Cloud (Second). Addison-Wesley.
Meier, A., & Kaufmann, M. (2019). SQL & NoSQL Databases: Models, Languages, Consistency Options and Architectures for Big Data Management. Springer Vieweg.
Needham, M., & Hodler, A. E. (2019). Graph Algorithms: Practical Examples in Apache Spark and Neo4j (First edition). O’Reilly Media.
Peixoto, T. P. (n.d.). What is graph-tool? Graph-Tool. Retrieved 20 March 2022, from https://graph-tool.skewed.de/
Robinson, I., Webber, J., & Eifrem, E. (2015). Graph Databases: New Opportunities for Connected Data.
Scalzo, B. (2018). Database Benchmarking and Stress Testing: An Evidence-Based Approach to Decisions on Architecture and Technology. Springer Science+Business Media, LLC.
Stopford, B. (2012, August 17). Thinking in Graphs: Neo4J. http://www.benstopford.com/2012/08/17/thinking-in-graphs-neo4j/
Turner-Trauring, I. (2021, May 12). Docker can slow down your code and distort your benchmarks. Python=>Speed. https://pythonspeed.com/articles/docker-performance-overhead/

Automated Database Benchmarking Tool

Performance Analysis of MySQL, PostgreSQL and Neo4j using Different Data Scenarios

Content

Relational DBMS

Graph-Based DBMS

Query Languages

System Setup

Command Line Interface (CLI)

Possilbe CLI Commands

Statement Substitutions

Example

Substitution Possibilities

Custom Script (`merchant`)

Custom Script (`employees`)

Further Automation

Result Analysis

Showcase `employees`

Showcase Results (1/3)

Showcase Results (2/3)

Showcase Results (3/3)

Conclusion

Future Work

References

Thanks

Automated Database Benchmarking Tool

Performance Analysis of MySQL, PostgreSQL and Neo4j using Different Data Scenarios

Content

Relational DBMS

Graph-Based DBMS

Query Languages

System Setup

Command Line Interface (CLI)

Possilbe CLI Commands

Statement Substitutions

Example

Substitution Possibilities

Custom Script (merchant)

Custom Script (employees)

Further Automation

Result Analysis

Showcase employees

Showcase Results (1/3)

Showcase Results (2/3)

Showcase Results (3/3)

Conclusion

Future Work

References

Thanks

Custom Script (`merchant`)

Custom Script (`employees`)

Showcase `employees`