Publication & Citation Trends
Publications
36 total
GShard: Scaling Giant Models with Conditional Computation and Automatic Sharding PDF
Cited by 348
OpenAlex
Automatic Cross-Replica Sharding of Weight Update in Data-Parallel Training
Cited by 52
Semantic Scholar
Lingvo: a Modular and Scalable Framework for Sequence-to-Sequence Modeling PDF
Cited by 183
OpenAlex
Scale MLPerf-0.6 models on Google TPU-v3 Pods
Cited by 43
Semantic Scholar
TensorPipe: Easy Scaling with Micro-Batch Pipeline Parallelism
Cited by 0
Semantic Scholar
GPipe: Efficient Training of Giant Neural Networks using Pipeline\n Parallelism PDF
Cited by 882
OpenAlex
GPipe: Efficient Training of Giant Neural Networks using Pipeline Parallelism PDF
Cited by 236
OpenAlex
Mesh-TensorFlow: Deep Learning for Supercomputers
Cited by 431
Semantic Scholar
Research Topics
Parallel Computing and Optimization Techniques
(28)
Embedded Systems Design Techniques
(9)
Advanced Neural Network Applications
(8)
Logic, programming, and type systems
(7)
Distributed and Parallel Computing Systems
(5)
Affiliations
Google (United States)
Palo Alto University
Parallel Consulting (United States)
École Polytechnique Fédérale de Lausanne
Stanford University