Distributed Machine Learning

Distributed Optimization

2016 ICDM Efficient Distributed SGD with Variance Reduction
2016 KDD Robust Large-Scale Machine Learning in the Cloud
2015 KDD Netowrk Lasso: Clustering and Optimization in Large Graphs
2012 JMLR Distributed Learning, Communication Complexity and Privacy
2012 AISTATS Protocols for Learning Classifiers on Distributed Data
2010 NIPS Parallelized Stochastic Gradient Descent | video (One-Short)
2010 NAACL Distributed Training Strategies for the Structured Perceptron
2009 NIPS Efficient Large-Scale Distributed Training of Conditional Maximum Entropy Models
2009 NIPS Slow Learners are Fast

2014 ATC Exploiting bounded staleness to speed up Big Data analytics
2014 NIPS Fundamental Limits of Online and Distributed Algorithms for Statistical Learning and Estimation
2014 ICML Communication-Efficient Distributed Optimization using an Approximate Newton-type Method
2013 NIPS Information-theoretic lower bounds for distributed statistical estimation with communication constraints
2013 NIPS Optimistic Concurrency Control for Distributed Unsupervised Learning
2013 SDM Butterfly Mixing: Accelerating Incremental-Update Algorithms on Clusters
2012 NIPS Communication-Efficient Algorithms for Statistical Optimization
2011 NIPS Distributed Delayed Stochastic Optimization

2016 ICLRW Revisiting Distributed Synchronous SGD
2014 Distributed Optimization and Statistical Learning via the Alternating Direction Method of Multipliers (ADMM)
2012 IEEE Trans. on Automatic Control Dual Averaging for Distributed Optimization: Convergence Analysis and Network Scaling
2010 NIPS Distributed Dual Averaging in Networks
2009 IEEE Trans. on Automatic Control Distributed subgradient methods for multi-agent optimization | slides
2008 Convex Optimization in Signal Processing and Communications Cooperative Distributed Multi-Agent Optimization

2014 APSys A Scalable and Topology Configurable Protocol for Distributed Parameter Synchronization
2014 ICML Tutorial Emerging System for Large-Scale Machine Learning
2013 Distributed Computing When distributed computation is communication expensive

2014 OSDI Project Adam: Building an Efficient and Scalable Deep Learning Training System
2014 OSDI Scaling Distributed Machine Learning with the Parameter Server
2014 NIPS Communication Efficient Distributed Machine Learning with the Parameter Server
2013 NIPSW Parameter Server for Distributed Machine Learning
2013 NIPSW Distributed Delayed Proximal Gradient Methods
2012 NIPS Large Scale Distributed Deep Networks (DistBelief)
2010 VLDB An Architecture for Parallel Topic Models