chore: Pycasbin benchmark #199

sheny1xuan · 2022-04-10T05:32:52Z

Add benchmark for pycabin binding Benchmarking pycasbin's performance to compare it with Python's casbin implementation #181.
Pycasbin binding result in my local machine, and the stddev is greater than mean, it's invalid.

pybind11 heavily relies on a template matching mechanism to convert parameters and return values that are constructed from STL data types such as vectors, linked lists, hash tables, etc. This even works in a recursive manner, for instance to deal with lists of hash maps of pairs of elementary and custom types, etc.
However, a fundamental limitation of this approach is that internal conversions between Python and C++ types involve a copy operation that prevents pass-by-reference semantics.

Current pybind11 binding copy python list to a vector, it's will be inefficient. And I think wrap DataVector and DataList with python will be helpful. We can ref to https://pybind11.readthedocs.io/en/stable/advanced/cast/stl.html.
And another thing, current casbin-cpp efficiency is smiliar with casbin-golang. Can we make it better? I think I need some help for the deep optimization. @EmperorYP7 @hsluoyz

Signed-off-by: stonex <[email protected]>

casbin-bot · 2022-04-10T05:32:57Z

@EmperorYP7 @divy9881 @noob20000405 please review

hsluoyz · 2022-04-10T08:41:52Z

@EmperorYP7 plz review

EmperorYP7 · 2022-04-12T07:02:50Z

pycasbin/README.md

@@ -82,6 +82,12 @@ def isAuthorized(req):

 Rest of the method's name is on par with `casbin-cpp`.

+### Benchmark
+
+Pycasbin use `pytest` for benchmark.


Suggested change

Pycasbin use `pytest` for benchmark.

Pycasbin use `pytest` for benchmark.

`pip3 install -U pytest`

EmperorYP7 · 2022-04-12T07:05:01Z

pycasbin/README.md

+
+Pycasbin use `pytest` for benchmark.
+
+Install `pytest` and `pycasbin` in your local machine, then run the benchmark by `python3 -m pytest --benchmark-verbose --benchmark-columns=mean,stddev,iqr,ops,rounds casbin-cpp/pycasbin/benchmarks/benchmark_model.py`.


Do this instead:

python3 -m pytest --benchmark-verbose --benchmark-columns=mean,stddev,iqr,ops,rounds casbin-cpp/pycasbin/benchmarks/benchmark_model.py

EmperorYP7 · 2022-04-12T07:18:13Z

pycasbin/benchmarks/benchmark_model.py

+        e.enforce("user501", "data9", "read")
+
+
+# TODO: pycasbin cost too much time in large model


Does it crash often?

How about we set up a flag through an OS env variable for intensive testing?

It cost too much time in my machine, don't crash.

EmperorYP7 · 2022-04-12T07:32:07Z

Current pybind11 binding copy python list to a vector, it's will be inefficient. And I think wrap DataVector and DataList with python will be helpful.

Flagging these data types as opaque might be one way to go.

And another thing, current casbin-cpp efficiency is smiliar with casbin-golang. Can we make it better? I think I need some help for the deep optimization.

Let's perform profiling on the current casbin-cpp and pycasbin setup. I suspect casbin-cpp's performance majorly depends on the performance of the underlying library used. (Exprtk)

There are scopes for micro-optimizations that I can think of. We'd need a thorough look into the code to create a significant difference in performance.

hsluoyz · 2022-04-12T13:16:30Z

@sheny1xuan

ArashPartow · 2022-04-14T03:24:42Z

@sheny1xuan in the first set of timing numbers, the stddev is greater than the mean.

This usually indicates invalid stats or the possibility of at least a bimodal (or more) distribution.

hsluoyz · 2022-04-14T05:44:20Z

@sheny1xuan

sheny1xuan · 2022-04-14T07:08:35Z

@sheny1xuan in the first set of timing numbers, the stddev is greater than the mean.

This usually indicates invalid stats or the possibility of at least a bimodal (or more) distribution.

Oh, I only paid attention to the mean, the stddev is also very important in the benchmark. And the above comparison of pycasbin and casbin-cpp binding is valid. The two set of benchmark data is all about casbin-cpp binding, I forget to change the package install environment.
And I rerun it in my local machine, the stddev is still greater than mean. Stddev is greater means the test data is unstable. I think there maybe something wrong in the code about binding or benmarking.

stonex added 2 commits April 10, 2022 10:49

chore: add benchmark for pycasbin

b7c2c7b

Signed-off-by: stonex <[email protected]>

chore: add usage of pycasbin benchmark.

c2baac2

casbin-bot requested review from divy9881, EmperorYP7 and noob20000405 April 10, 2022 05:32

EmperorYP7 requested changes Apr 12, 2022

View reviewed changes

sheny1xuan closed this Apr 26, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

chore: Pycasbin benchmark #199

chore: Pycasbin benchmark #199

sheny1xuan commented Apr 10, 2022 •

edited

Loading

casbin-bot commented Apr 10, 2022

hsluoyz commented Apr 10, 2022

EmperorYP7 Apr 12, 2022

EmperorYP7 Apr 12, 2022

EmperorYP7 Apr 12, 2022

sheny1xuan Apr 14, 2022

EmperorYP7 commented Apr 12, 2022

hsluoyz commented Apr 12, 2022

ArashPartow commented Apr 14, 2022

hsluoyz commented Apr 14, 2022

sheny1xuan commented Apr 14, 2022 •

edited

Loading


		Pycasbin use `pytest` for benchmark.

		Install `pytest` and `pycasbin` in your local machine, then run the benchmark by `python3 -m pytest --benchmark-verbose --benchmark-columns=mean,stddev,iqr,ops,rounds casbin-cpp/pycasbin/benchmarks/benchmark_model.py`.

		e.enforce("user501", "data9", "read")


		# TODO: pycasbin cost too much time in large model

chore: Pycasbin benchmark #199

chore: Pycasbin benchmark #199

Conversation

sheny1xuan commented Apr 10, 2022 • edited Loading

casbin-bot commented Apr 10, 2022

hsluoyz commented Apr 10, 2022

EmperorYP7 Apr 12, 2022

Choose a reason for hiding this comment

EmperorYP7 Apr 12, 2022

Choose a reason for hiding this comment

EmperorYP7 Apr 12, 2022

Choose a reason for hiding this comment

sheny1xuan Apr 14, 2022

Choose a reason for hiding this comment

EmperorYP7 commented Apr 12, 2022

hsluoyz commented Apr 12, 2022

ArashPartow commented Apr 14, 2022

hsluoyz commented Apr 14, 2022

sheny1xuan commented Apr 14, 2022 • edited Loading

sheny1xuan commented Apr 10, 2022 •

edited

Loading

sheny1xuan commented Apr 14, 2022 •

edited

Loading