POT: Python Optimal Transport ============================= |PyPI version| |Anaconda Cloud| |Build Status| |Codecov Status| |Downloads| |Anaconda downloads| |License| This open source Python library provide several solvers for optimization problems related to Optimal Transport for signal, image processing and machine learning. Website and documentation: https://PythonOT.github.io/ Source Code (MIT): https://github.com/PythonOT/POT POT provides the following generic OT solvers (links to examples): - `OT Network Simplex solver `__ for the linear program/ Earth Movers Distance [1] . - `Conditional gradient `__ [6] and `Generalized conditional gradient `__ for regularized OT [7]. - Entropic regularization OT solver with `Sinkhorn Knopp Algorithm `__ [2] , stabilized version [9] [10], greedy Sinkhorn [22] and `Screening Sinkhorn [26] `__ with optional GPU implementation (requires cupy). - Bregman projections for `Wasserstein barycenter `__ [3], `convolutional barycenter `__ [21] and unmixing [4]. - Sinkhorn divergence [23] and entropic regularization OT from empirical data. - `Smooth optimal transport solvers `__ (dual and semi-dual) for KL and squared L2 regularizations [17]. - Non regularized `Wasserstein barycenters [16] `__) with LP solver (only small scale). - `Gromov-Wasserstein distances `__ and `GW barycenters `__ (exact [13] and regularized [12]) - `Fused-Gromov-Wasserstein distances solver `__ and `FGW barycenters `__ [24] - `Stochastic solver `__ for Large-scale Optimal Transport (semi-dual problem [18] and dual problem [19]) - Non regularized `free support Wasserstein barycenters `__ [20]. - `Unbalanced OT `__ with KL relaxation and `barycenter `__ [10, 25]. - `Partial Wasserstein and Gromov-Wasserstein `__ (exact [29] and entropic [3] formulations). POT provides the following Machine Learning related solvers: - `Optimal transport for domain adaptation `__ with `group lasso regularization `__, `Laplacian regularization `__ [5] [30] and `semi supervised setting `__. - `Linear OT mapping `__ [14] and `Joint OT mapping estimation `__ [8]. - `Wasserstein Discriminant Analysis `__ [11] (requires autograd + pymanopt). - `JCPOT algorithm for multi-source domain adaptation with target shift `__ [27]. Some other examples are available in the `documentation `__. Using and citing the toolbox ^^^^^^^^^^^^^^^^^^^^^^^^^^^^ If you use this toolbox in your research and find it useful, please cite POT using the following reference: :: Rémi Flamary and Nicolas Courty, POT Python Optimal Transport library, Website: https://pythonot.github.io/, 2017 In Bibtex format: :: @misc{flamary2017pot, title={POT Python Optimal Transport library}, author={Flamary, R{'e}mi and Courty, Nicolas}, url={https://pythonot.github.io/}, year={2017} } Installation ------------ The library has been tested on Linux, MacOSX and Windows. It requires a C++ compiler for building/installing the EMD solver and relies on the following Python modules: - Numpy (>=1.16) - Scipy (>=1.0) - Cython (>=0.23) - Matplotlib (>=1.5) Pip installation ^^^^^^^^^^^^^^^^ Note that due to a limitation of pip, ``cython`` and ``numpy`` need to be installed prior to installing POT. This can be done easily with :: pip install numpy cython You can install the toolbox through PyPI with: :: pip install POT or get the very latest version by running: :: pip install -U https://github.com/PythonOT/POT/archive/master.zip # with --user for user install (no root) Anaconda installation with conda-forge ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ If you use the Anaconda python distribution, POT is available in `conda-forge `__. To install it and the required dependencies: :: conda install -c conda-forge pot Post installation check ^^^^^^^^^^^^^^^^^^^^^^^ After a correct installation, you should be able to import the module without errors: .. code:: python import ot Note that for easier access the module is name ot instead of pot. Dependencies ~~~~~~~~~~~~ Some sub-modules require additional dependences which are discussed below - **ot.dr** (Wasserstein dimensionality reduction) depends on autograd and pymanopt that can be installed with: :: pip install pymanopt autograd - **ot.gpu** (GPU accelerated OT) depends on cupy that have to be installed following instructions on `this page `__. obviously you need CUDA installed and a compatible GPU. Examples -------- Short examples ~~~~~~~~~~~~~~ - Import the toolbox .. code:: python import ot - Compute Wasserstein distances .. code:: python # a,b are 1D histograms (sum to 1 and positive) # M is the ground cost matrix Wd=ot.emd2(a,b,M) # exact linear program Wd_reg=ot.sinkhorn2(a,b,M,reg) # entropic regularized OT # if b is a matrix compute all distances to a and return a vector - Compute OT matrix .. code:: python # a,b are 1D histograms (sum to 1 and positive) # M is the ground cost matrix T=ot.emd(a,b,M) # exact linear program T_reg=ot.sinkhorn(a,b,M,reg) # entropic regularized OT - Compute Wasserstein barycenter .. code:: python # A is a n*d matrix containing d 1D histograms # M is the ground cost matrix ba=ot.barycenter(A,M,reg) # reg is regularization parameter Examples and Notebooks ~~~~~~~~~~~~~~~~~~~~~~ The examples folder contain several examples and use case for the library. The full documentation with examples and output is available on https://PythonOT.github.io/. Acknowledgements ---------------- This toolbox has been created and is maintained by - `Rémi Flamary `__ - `Nicolas Courty `__ The contributors to this library are - `Alexandre Gramfort `__ (CI, documentation) - `Laetitia Chapel `__ (Partial OT) - `Michael Perrot `__ (Mapping estimation) - `Léo Gautheron `__ (GPU implementation) - `Nathalie Gayraud `__ (DA classes) - `Stanislas Chambon `__ (DA classes) - `Antoine Rolet `__ (EMD solver debug) - Erwan Vautier (Gromov-Wasserstein) - `Kilian Fatras `__ (Stochastic solvers) - `Alain Rakotomamonjy `__ - `Vayer Titouan `__ (Gromov-Wasserstein -, Fused-Gromov-Wasserstein) - `Hicham Janati `__ (Unbalanced OT) - `Romain Tavenard `__ (1d Wasserstein) - `Mokhtar Z. Alaya `__ (Screenkhorn) - `Ievgen Redko `__ (Laplacian DA, JCPOT) This toolbox benefit a lot from open source research and we would like to thank the following persons for providing some code (in various languages): - `Gabriel Peyré `__ (Wasserstein Barycenters in Matlab) - `Nicolas Bonneel `__ ( C++ code for EMD) - `Marco Cuturi `__ (Sinkhorn Knopp in Matlab/Cuda) Contributions and code of conduct --------------------------------- Every contribution is welcome and should respect the `contribution guidelines `__. Each member of the project is expected to follow the `code of conduct `__. Support ------- You can ask questions and join the development discussion: - On the `POT Slack channel `__ - On the POT `mailing list `__ You can also post bug reports and feature requests in Github issues. Make sure to read our `guidelines `__ first. References ---------- [1] Bonneel, N., Van De Panne, M., Paris, S., & Heidrich, W. (2011, December). `Displacement interpolation using Lagrangian mass transport `__. In ACM Transactions on Graphics (TOG) (Vol. 30, No. 6, p. 158). ACM. [2] Cuturi, M. (2013). `Sinkhorn distances: Lightspeed computation of optimal transport `__. In Advances in Neural Information Processing Systems (pp. 2292-2300). [3] Benamou, J. D., Carlier, G., Cuturi, M., Nenna, L., & Peyré, G. (2015). `Iterative Bregman projections for regularized transportation problems `__. SIAM Journal on Scientific Computing, 37(2), A1111-A1138. [4] S. Nakhostin, N. Courty, R. Flamary, D. Tuia, T. Corpetti, `Supervised planetary unmixing with optimal transport `__, Whorkshop on Hyperspectral Image and Signal Processing : Evolution in Remote Sensing (WHISPERS), 2016. [5] N. Courty; R. Flamary; D. Tuia; A. Rakotomamonjy, `Optimal Transport for Domain Adaptation `__, in IEEE Transactions on Pattern Analysis and Machine Intelligence , vol.PP, no.99, pp.1-1 [6] Ferradans, S., Papadakis, N., Peyré, G., & Aujol, J. F. (2014). `Regularized discrete optimal transport `__. SIAM Journal on Imaging Sciences, 7(3), 1853-1882. [7] Rakotomamonjy, A., Flamary, R., & Courty, N. (2015). `Generalized conditional gradient: analysis of convergence and applications `__. arXiv preprint arXiv:1510.06567. [8] M. Perrot, N. Courty, R. Flamary, A. Habrard (2016), `Mapping estimation for discrete optimal transport `__, Neural Information Processing Systems (NIPS). [9] Schmitzer, B. (2016). `Stabilized Sparse Scaling Algorithms for Entropy Regularized Transport Problems `__. arXiv preprint arXiv:1610.06519. [10] Chizat, L., Peyré, G., Schmitzer, B., & Vialard, F. X. (2016). `Scaling algorithms for unbalanced transport problems `__. arXiv preprint arXiv:1607.05816. [11] Flamary, R., Cuturi, M., Courty, N., & Rakotomamonjy, A. (2016). `Wasserstein Discriminant Analysis `__. arXiv preprint arXiv:1608.08063. [12] Gabriel Peyré, Marco Cuturi, and Justin Solomon (2016), `Gromov-Wasserstein averaging of kernel and distance matrices `__ International Conference on Machine Learning (ICML). [13] Mémoli, Facundo (2011). `Gromov–Wasserstein distances and the metric approach to object matching `__. Foundations of computational mathematics 11.4 : 417-487. [14] Knott, M. and Smith, C. S. (1984).`On the optimal mapping of distributions `__, Journal of Optimization Theory and Applications Vol 43. [15] Peyré, G., & Cuturi, M. (2018). `Computational Optimal Transport `__ . [16] Agueh, M., & Carlier, G. (2011). `Barycenters in the Wasserstein space `__. SIAM Journal on Mathematical Analysis, 43(2), 904-924. [17] Blondel, M., Seguy, V., & Rolet, A. (2018). `Smooth and Sparse Optimal Transport `__. Proceedings of the Twenty-First International Conference on Artificial Intelligence and Statistics (AISTATS). [18] Genevay, A., Cuturi, M., Peyré, G. & Bach, F. (2016) `Stochastic Optimization for Large-scale Optimal Transport `__. Advances in Neural Information Processing Systems (2016). [19] Seguy, V., Bhushan Damodaran, B., Flamary, R., Courty, N., Rolet, A.& Blondel, M. `Large-scale Optimal Transport and Mapping Estimation `__. International Conference on Learning Representation (2018) [20] Cuturi, M. and Doucet, A. (2014) `Fast Computation of Wasserstein Barycenters `__. International Conference in Machine Learning [21] Solomon, J., De Goes, F., Peyré, G., Cuturi, M., Butscher, A., Nguyen, A. & Guibas, L. (2015). `Convolutional wasserstein distances: Efficient optimal transportation on geometric domains `__. ACM Transactions on Graphics (TOG), 34(4), 66. [22] J. Altschuler, J.Weed, P. Rigollet, (2017) `Near-linear time approximation algorithms for optimal transport via Sinkhorn iteration `__, Advances in Neural Information Processing Systems (NIPS) 31 [23] Aude, G., Peyré, G., Cuturi, M., `Learning Generative Models with Sinkhorn Divergences `__, Proceedings of the Twenty-First International Conference on Artficial Intelligence and Statistics, (AISTATS) 21, 2018 [24] Vayer, T., Chapel, L., Flamary, R., Tavenard, R. and Courty, N. (2019). `Optimal Transport for structured data with application on graphs `__ Proceedings of the 36th International Conference on Machine Learning (ICML). [25] Frogner C., Zhang C., Mobahi H., Araya-Polo M., Poggio T. (2015). `Learning with a Wasserstein Loss `__ Advances in Neural Information Processing Systems (NIPS). [26] Alaya M. Z., Bérar M., Gasso G., Rakotomamonjy A. (2019). `Screening Sinkhorn Algorithm for Regularized Optimal Transport `__, Advances in Neural Information Processing Systems 33 (NeurIPS). [27] Redko I., Courty N., Flamary R., Tuia D. (2019). `Optimal Transport for Multi-source Domain Adaptation under Target Shift `__, Proceedings of the Twenty-Second International Conference on Artificial Intelligence and Statistics (AISTATS) 22, 2019. [28] Caffarelli, L. A., McCann, R. J. (2010). `Free boundaries in optimal transport and Monge-Ampere obstacle problems `__, Annals of mathematics, 673-730. [29] Chapel, L., Alaya, M., Gasso, G. (2019). `Partial Gromov-Wasserstein with Applications on Positive-Unlabeled Learning `__, arXiv preprint arXiv:2002.08276. [30] Flamary R., Courty N., Tuia D., Rakotomamonjy A. (2014). `Optimal transport with Laplacian regularization: Applications to domain adaptation and shape matching `__, NIPS Workshop on Optimal Transport and Machine Learning OTML, 2014. .. |PyPI version| image:: https://badge.fury.io/py/POT.svg :target: https://badge.fury.io/py/POT .. |Anaconda Cloud| image:: https://anaconda.org/conda-forge/pot/badges/version.svg :target: https://anaconda.org/conda-forge/pot .. |Build Status| image:: https://github.com/PythonOT/POT/workflows/build/badge.svg :target: https://github.com/PythonOT/POT/actions .. |Codecov Status| image:: https://codecov.io/gh/PythonOT/POT/branch/master/graph/badge.svg :target: https://codecov.io/gh/PythonOT/POT .. |Downloads| image:: https://pepy.tech/badge/pot :target: https://pepy.tech/project/pot .. |Anaconda downloads| image:: https://anaconda.org/conda-forge/pot/badges/downloads.svg :target: https://anaconda.org/conda-forge/pot .. |License| image:: https://anaconda.org/conda-forge/pot/badges/license.svg :target: https://github.com/PythonOT/POT/blob/master/LICENSE