Follow
David R So
David R So
Jane Street Capital
Verified email at janestreet.com
Title
Cited by
Cited by
Year
Palm 2 technical report
R Anil, AM Dai, O Firat, M Johnson, D Lepikhin, A Passos, S Shakeri, ...
arXiv preprint arXiv:2305.10403, 2023
14382023
Towards a human-like open-domain chatbot
D Adiwardana, MT Luong, DR So, J Hall, N Fiedel, R Thoppilan, Z Yang, ...
arXiv preprint arXiv:2001.09977, 2020
11322020
Carbon emissions and large neural network training
D Patterson, J Gonzalez, Q Le, C Liang, LM Munguia, D Rothchild, D So, ...
arXiv preprint arXiv:2104.10350, 2021
8152021
Pay attention to mlps
H Liu, Z Dai, D So, QV Le
Advances in neural information processing systems 34, 9204-9215, 2021
6422021
The evolved transformer
D So, Q Le, C Liang
International conference on machine learning, 5877-5886, 2019
5502019
Automl-zero: Evolving machine learning algorithms from scratch
E Real, C Liang, D So, Q Le
International conference on machine learning, 8007-8019, 2020
3402020
The carbon footprint of machine learning training will plateau, then shrink
D Patterson, J Gonzalez, U Hölzle, Q Le, C Liang, LM Munguia, ...
Computer 55 (7), 18-28, 2022
3302022
Searching for efficient transformers for language modeling
D So, W Mańke, H Liu, Z Dai, N Shazeer, QV Le
Advances in neural information processing systems 34, 6010-6022, 2021
1652021
Classification of crystallization outcomes using deep convolutional neural networks
AE Bruno, P Charbonneau, J Newman, EH Snell, DR So, V Vanhoucke, ...
PLOS one 13 (6), e0198883, 2018
862018
Transcending scaling laws with 0.1% extra compute
Y Tay, J Wei, HW Chung, VQ Tran, DR So, S Shakeri, X Garcia, HS Zheng, ...
arXiv preprint arXiv:2210.11399, 2022
692022
EvoPrompting: language models for code-level neural architecture search
A Chen, D Dohan, D So
Advances in Neural Information Processing Systems 36, 2024
642024
Mufasa: Multimodal fusion architecture search for electronic health records
Z Xu, DR So, AM Dai
Proceedings of the AAAI Conference on Artificial Intelligence 35 (12), 10532 …, 2021
592021
Towards a human-like open-domain chatbot. arXiv 2020
D Adiwardana, MT Luong, DR So, J Hall, N Fiedel, R Thoppilan, Z Yang, ...
arXiv preprint arXiv:2001.09977, 2001
422001
Brainformers: Trading simplicity for efficiency
Y Zhou, N Du, Y Huang, D Peng, C Lan, D Huang, S Shakeri, D So, ...
International Conference on Machine Learning, 42531-42542, 2023
212023
Computationally efficient neural network architecture search
DM Dohan, DR So, C Liang, QV Le
US Patent 10,997,503, 2021
172021
Towards a human-like open-domain chatbot
A Kulshreshtha, DDF Adiwardana, DR So, G Nemade, J Hall, N Fiedel, ...
arXiv preprint arXiv:2001.09977, 2020
92020
Improving image generative models with human interactions
AK Lampinen, D So, D Eck, F Bertsch
arXiv preprint arXiv:1709.10459, 2017
52017
Unified functional hashing in automatic machine learning
R Gillard, S Jonany, Y Miao, M Munn, C de Souza, J Dungay, C Liang, ...
arXiv preprint arXiv:2302.05433, 2023
22023
Multi-modal neural network architecture search
Z Xu, DR So, AM Dai
US Patent App. 17/915,796, 2023
12023
Granular neural network architecture search over low-level primitives
DR So, QV Le Jr, H Liu, WA Manke, Z Dai, NM Shazeer
US Patent App. 17/827,362, 2022
12022
The system can't perform the operation now. Try again later.
Articles 1–20