Skip to content

Latest commit

 

History

History
87 lines (53 loc) · 3.48 KB

readme.md

File metadata and controls

87 lines (53 loc) · 3.48 KB

rho_reduce

Purpose: Testing algorithms for fast computation of partial trace operations, and benchmarks thereof.

Outcome: Partial traces are expensive. Extensive tensor reshaping is unavoidable in the mathematically elegant solution. Implementing a more complex algorithm involves fewer floating point operations, but fails to take advantage of hardware acceleration for linear algebra. If you need to repeatedly compute partial traces of a large tensor, doing so recursively can be very advantageous. I suspect there may be another, potentially large, speedup, that takes advantage of the best of both worlds.

Getting Started

Prerequisites

  • MATLAB/Octave (Todo: Verify .m files work in octave)
  • python 3.6

Installing

Install with

git clone --recursive git://github.com/groundhogstate/rho_reduce.git

Update with

git submodule update --recursive --remote

Running the tests

See TrC_examples.m linkfor examples in MATLAB syntax.

The alternative function ptrace() will allow the same syntax.

The two methods are compared in ptrace_vs_TrX.m, with MATLAB results

trace_profile

From my i7 quad-core in factory standard HP Probook 360.

Usage

About

See for Jonas Maziero's paper (\doc\mazerio_computing_partial_traces.pdf) which describes the implementation in ptrace.m.

The function TrX(rho,sys,dim) accepts the density matrix (rho) of a multipartite system with N elements each with dimension d_i, i=1:N. The total Hilbert space has therefore prod(d_i,i=1:N) dimensions. The function computes the trace 'over' the systems specified by index in (sys), and returns a density matrix of the remaning systems ~(idx \cap sys). The vector (dim) specifies the dimensions, as required for the permutation of the density matrix into a product of non-square matrices, reducing the partial trace to partial inner product.

The function ptrace(rho,sys,dim) performs the same operation, but does so by computing the reduced state one matrix element at a time. Each element is produced in (I suspect) the optimal number of operations for promise-free well-conditioned matrices. It looks suspiciously vectorizable, and easily parallelized.

In the end, Toby's vectorized code beats my implementation of the iterative algorithm that Jonas describes. Toby's executes quickly in MATLAB because of the zero-cost commands reshape and permute, and calling LAPACK to compute the linear algebra.

However, there are many fewer operations in the iterative algorithm.

Built With

Workflow

  • Atom extensions:
    • Hydrogen
    • markdown-preview-plus

Contributing

Fork, edit, submit pull request. Don't break stuff.

Versioning

Authors

Jacob Ross groundhogstate

License

This project is licensed under the MIT License - see the LICENSE.md file for details

Acknowledgments

TODO

  • Port to python
  • Port to C++
  • Scaling analysis
  • Test compiled versions
  • Recursive many-body traces
  • Benefits of parallelism?