Baseline Experiments. #345

amlatyrngom · 2023-11-02T14:02:38Z

No description provided.

geoffxy

Thanks @amlatyrngom. Can you test this out on the 100 GB dataset with the scale down workload? experiments/15-e2e-scenarios-v2/scale_down/run_workload.sh. The script assumes it is being run by a tool called Conductor. The easiest way to run it without the tool for testing is to set the COND_OUT environment variable to a path where you want to store the results.

geoffxy · 2023-11-07T04:40:13Z

tests/test_plan_parsing.py

+@pytest.mark.skip(
+    reason="TODO(Amadou): This is failing even I haven't changed it. Flaky test?"
+)


This test should be fixed now - you can remove this decorator and the test should pass.

geoffxy · 2023-11-07T04:44:36Z

load_baseline.py

@@ -0,0 +1,100 @@
+# See workloads/cross_db_benchmark/benchmark_tools/tidb/README.md
+


Please move this file into workloads/IMDB_extended

geoffxy · 2023-11-07T04:48:24Z

workloads/IMDB_extended/workload_utils/database.py

+                cursor = self._cursor
+            # Exec
+            cursor.execute(query)
+            if cursor.rowcount is None or cursor.rowcount <= 0 or not(query.strip().lower().startswith("SELECT")):


query.strip().lower().startswith("SELECT") is always going to be false?

geoffxy · 2023-11-07T04:49:40Z

workloads/IMDB_extended/workload_utils/database.py

+            # Get cursor
+            if self._cursor is None:
+                had_cursor = False
+                cursor = self._conn.cursor()
+            else:
+                had_cursor = True
+                cursor = self._cursor
+            # Exec
+            cursor.execute(query)


Are these changes to support a MySQL connector? If so, please just create another Database class that is specialized for MySQL instead of modifying this class.

geoffxy · 2023-11-07T04:50:16Z

workloads/IMDB_extended/workload_utils/database.py

+            if cursor.rowcount is None or cursor.rowcount <= 0 or not(query.strip().lower().startswith("SELECT")):
+                rows = []
+            else:
+                print(f"Rows: {cursor.rowcount}. Q: {query}")


nit: Please use logger.debug() instead of print(). Otherwise this will get very verbose during a workload run.

geoffxy · 2023-11-07T04:56:37Z

workloads/IMDB_extended/run_transactions.py

+        dataset_type = "20gb"
+    else:
+        dataset_type = "original"
+    worker = TransactionWorker(worker_idx, args.seed ^ worker_idx, args.scale_factor, dataset_type=dataset_type)


Whoops - thanks for catching this. Instead of setting dataset_type here based on flags, I'd suggest we make it an argparse argument instead.

geoffxy · 2023-11-07T04:56:51Z

workloads/IMDB_extended/run_transactions.py

    parser.add_argument(
        "--scale-factor",
        type=int,
-        default=1,
+        default=6,


nit: Please keep this at 1 for now.

geoffxy · 2023-11-07T05:00:49Z

load_baseline.py

+if __name__ == "__main__":
+    yaml_main()
+    sys.exit(0)


Is this still needed?

geoffxy · 2023-11-07T05:04:04Z

workloads/IMDB_extended/run_transactions.py

+    parser.add_argument(
+        "--output-dir",
+        type=str,
+        default=".",
+        help="Environment variable that stores the output directory of tidb bench",
+    )


Is this still used? Also it seems like the help string is not correct since the default is a path and not an environment variable?

geoffxy · 2024-09-11T20:49:33Z

Closing since I think the intention was to not merge given the code divergence.

amlatyrngom added 17 commits July 14, 2023 09:37

Enhance provisioning.

d971c15

Enhance Provisioning.

bf4ba31

Add TIDB Setup and Connection

a1a9603

Merge remote-tracking branch 'origin/main' into aln-provisioning

01993b7

Merge remote-tracking branch 'origin/main' into aln-provisioning

db6b608

TiDB Benchmarking

c9bc6c7

Fix linting.

bc5a137

Fix tests.

26a1e9f

Fixes

30a3eb5

Setting up baseline

9f0b3ab

Baseline txns

617f421

Fixing merge conflicts

4d1314e

Merge remote-tracking branch 'origin/main' into aln-provisioning

d7ad9d2

Fix seq num problem

9ca64ab

Adding downscale expts

47c2258

Baseline expts

4f39e8b

Reconcile baseline code.

2470049

amlatyrngom requested a review from geoffxy November 2, 2023 14:02

geoffxy reviewed Nov 7, 2023

View reviewed changes

amlatyrngom added 11 commits November 9, 2023 11:10

Manual scale down expt

b5e4054

Add tidb and daylong

868417f

Add sine wave to txns.

eb49c1e

Linting fixes

e98c2d3

Modifying expts

d338d8e

Baseline expts

75c8d64

Add ad hoc queries and scale up.

23ad0dd

Downscale

d0dd0e9

Fix ad hoc path

3e4d9de

Add specialized expts

377d367

Update specialized workload.

b7f44a2

amlatyrngom added 13 commits December 13, 2023 18:13

Checking in scale up expts.

ad44ee2

Starting daylong.

7044c4f

Starting daylong.

d55904e

Merge remote-tracking branch 'origin/main' into aln-provisioning

1392296

Fix merge issues

5f20b33

Update expts

c59ba05

Update expts

92a2fe0

Update expts

d4c3856

Update expts

a8d31b5

Fix expts

f0b73ca

Fix expts

619efb0

Cause redshift scaleup

7e088d8

Change issue slots

07f634d

amlatyrngom force-pushed the aln-provisioning branch from 662f55c to 07f634d Compare January 25, 2024 07:12

amlatyrngom added 10 commits January 26, 2024 08:09

Merge remote-tracking branch 'origin/main' into aln-provisioning

fb47d92

Add new trace

3ecc61f

Add new trace

225a6dc

Merge remote-tracking branch 'origin/main' into aln-provisioning

9d68ec6

Add slo change expt.

4ca9d6e

New scale down

6891b5f

Adding chbench

2948b5e

Merge remote-tracking branch 'origin/main' into aln-provisioning

0792ddd

Running chbench

97765f0

Fix tidb chbench

7433714

geoffxy closed this Sep 11, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Baseline Experiments. #345

Baseline Experiments. #345

amlatyrngom commented Nov 2, 2023

geoffxy left a comment

geoffxy Nov 7, 2023

geoffxy Nov 7, 2023

geoffxy Nov 7, 2023

geoffxy Nov 7, 2023

geoffxy Nov 7, 2023

geoffxy Nov 7, 2023

geoffxy Nov 7, 2023

geoffxy Nov 7, 2023

geoffxy Nov 7, 2023

geoffxy commented Sep 11, 2024

		@@ -0,0 +1,100 @@
		# See workloads/cross_db_benchmark/benchmark_tools/tidb/README.md

Baseline Experiments. #345

Baseline Experiments. #345

Conversation

amlatyrngom commented Nov 2, 2023

geoffxy left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

geoffxy commented Sep 11, 2024